Node-Shell-Workshop/lesson-2.md at master · msachi/Node-Shell-Workshop

Lesson 2 - Using streams in your script

Please go into the lesson-2 folder in your terminal. The relevant files for this lesson are there.

Introduction

In this lesson we are going to cover an alternative way of reading and writing to files using one of the core features of node: streams.

Whenever you've watched a video online, have you noticed you can start watching it even though the whole video hasn't finished loading? That's because it is being 'streamed', bit by bit, so that as every chunk of its data becomes available it is immediately put to use.

A stream in node is simply an interface for working with 'streaming data' like this. Streams can be readable (e.g. reading a file), writable (e.g. writing new content to a file), or both.

Streams in readFile

Let's start with something familiar. Up until now, whenever you've needed to read a file using node's fs module you've likely done something like the following:

fs.readFile(file, function(err, file) {
  if (err) console.error(err);
  // do something with file
});

It turns out that under the hood, readFile actually uses streams! Let's look at a simplified version of the readFile function:

fs.readFile = function(file, cb) {

  var readStream = fs.createReadStream(file);
  //set up a read stream on a target file and store it in a variable

  var fileContent = '';
  //create an empty string we will use to store the contents of the read file.

  readStream.on('data', function(chunk) {
    fileContent += chunk;
  });
  //every time a new chunk of the read file becomes available we append it to our fileContent variable

  readStream.on('error', function(err) {
    cb(err, fileContent)
  });
  // handle errors

  readStream.on('end', function() {
    cb(null, fileContent);
  });
  // do something with fileContent
}

Now let's go through each bit in more detail, and explain what's happening.

streams have a method stream.on('event', function () {}). What it does is subscribes a function to the specified event, so that it will be executed every time the event occurs.

readStream.on('data', function (chunk) {
  fileContent += chunk;
});

Here data is the type of event. The target file of readStream will be read bit by bit. Every time a new chunk becomes available, the data event is triggered and the function is called. Its first argument is always the contents of the new available chunk. In this case, we just append each chunk of new content to the fileContent variable.

Finally, when the stream has finished reading the file the end event is triggered. At this point, the whole file has been read chunk by chunk, and the variable fileContent should contain all the content of the read file.

Exercise 2a - Read Streams

Inside cat.js re-write your cat command to use a read stream instead of fs.readFile.

To recap, your cat command should be executed like this:

node cat.js file

It should output the contents of file to the terminal. You can try using your command on the example files in the public folder.

Hint: If you see something like this get outputted to your terminal:

<Buffer 68 65 6c 6c 6f 20 77 6f 72 6c 64>

This is called a 'buffer'. It's an encoded format that represents the file's raw binary data. Each part of the sequence 68, 65, 6c etc, represent characters of the file that is being read. 10 for example is equivalent to /n. To convert the buffer into a string you can use the toString() method, or provide 'utf-8' as the second argument of fs.createReadStream.

Exercise 2b - Write Streams

If read streams let you read files, write streams let you write content to them!

Inside write.js, type the following (don't copy-paste it):

var fs = require("fs");
var data = 'Simply Easy Learning';

// Create a writable stream
var writeStream = fs.createWriteStream('output.txt');

// Write the data to stream with encoding to be utf8
writeStream.write(data,'UTF8');

// Mark the end of file
writeStream.end();

// When the stream finishes log 'write completed'
writeStream.on('finish', function() {
    console.log("Write completed.");
});

Now try running node write.js. It should log Write completed. to the terminal and a new file called output.txt with the content Simply Easy Learning should have been created.

Did you notice write streams use .write() and .end(), just like the response object of your servers? That's because the response object is a write stream and when you're responding to the client you're 'writing' content to it!

The request object, likewise, is a read stream as when the client makes a request you're 'reading' the content that has been 'streamed' to the server!

Exercise 2c - Redirection

In Unix, it is possible to take the output of a command that would normally be printed to the terminal (standard output) and redirect it so that the contents of that output are written to a file instead.

The command we use to accomplish this is > :

cat index.html > example.js

cat index.html will read the html file and output its contents, then > will take this output and redirect it so that it is written to example.js instead.

Go into the public folder and try this:

node path_to_your_cat.js index.html > example.js

Can you see example.js has now been overwritten to contain the contents of index.html?

(Note: this command will over-write the file's prior content so be careful using this on your solution scripts.)

Inside redirect.js modify your cat command from the first exercise so that you can give it the following arguments:

node redirect.js path_to_read_file '>' path_to_write_file

If '>' is given as argument followed by another file as an argument it will, instead of outputting the contents of read_file to the terminal, write the contents of it to write_file instead.

(Note: you need to include the quotes around >, otherwise it will be interpreted as the in-built Unix command.)

Hint: If you want to take the output of a read stream and make it become the input of a write stream, this is called 'piping.' Piping in node is done using stream.pipe() method:

readStream.pipe(writeStream);

Every time a new chunk of read_file gets read by readStream it will immediately be redirected to become the input of writeStream. This input will get written to write_file.

Exercise 2d - Appending files

You may not always want to completely re-write the contents of a file. What if you want the content of a file to remain intact but simply append new content onto the end of it?

In Unix you can do this using >> :

cat index.html >> example.js

cat index.html will read the html file and output its contents, then >> will take this output and redirect it so that it is appended to the contents of example.js.

Go into the public folder and try this:

node path_to_your_cat.js index.html >> example.js

Can you see example.js now has the contents of index.html appended onto the end?

Inside append.js modify your cat command from the first exercise so that you can give it the following arguments:

node append.js path_to_read_file '>>' path_to_write_file

If '>>' is provided as an argument followed by a file as another argument it will, instead of outputting the contents of read_file to the terminal, append it to write_file instead.

Hint: fs.createReadStream and fs.createWriteStream can be passed a flags object as a second argument.

🏁 That's the end of the workshop! Congratulations! 🏁

If you're hungry for more, please see Bradley's longer version of this workshop.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lesson 2 - Using streams in your script

Introduction

Streams in readFile

Exercise 2a - Read Streams

Exercise 2b - Write Streams

Exercise 2c - Redirection

Exercise 2d - Appending files

🏁 That's the end of the workshop! Congratulations! 🏁

FilesExpand file tree

lesson-2.md

Latest commit

History

lesson-2.md

File metadata and controls

Lesson 2 - Using streams in your script

Introduction

Streams in readFile

Exercise 2a - Read Streams

Exercise 2b - Write Streams

Exercise 2c - Redirection

Exercise 2d - Appending files

🏁 That's the end of the workshop! Congratulations! 🏁