Read only the necessary bytes when parsing the header #36

DavidBruant · 2019-01-19T12:31:14Z

I use this library in a project i'm currently working on. It works beautifully, thank you very much!

The files i use it on are around 1.5GB. The streaming of the body is super cool, however, i noticed that to parse the header, the entire file was being read in memory

This is due to the use of fs.readFile in src/header.js
In this PR, the code is a bit lower-level to be able to read only the necessary bytes. The idea is:

read on the file only the start field of the header,
this information is used to know how many bytes the header is composed of, so only this number of bytes is read

I'm happy to discuss the change further if what i did here is unclear or if it doesn't adhere to the project standard

DavidBruant · 2019-01-19T12:38:08Z

I can confirm that this change had an amazing perf impact on the header reading in my 1.5GB files. It's pretty much instantaneous now and i can run my script in parallel of my web browser without running out of memory 👌

Read only the necessary bytes when parsing the header

1937787

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Read only the necessary bytes when parsing the header #36

Read only the necessary bytes when parsing the header #36

Uh oh!

DavidBruant commented Jan 19, 2019

Uh oh!

DavidBruant commented Jan 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Read only the necessary bytes when parsing the header #36

Are you sure you want to change the base?

Read only the necessary bytes when parsing the header #36

Uh oh!

Conversation

DavidBruant commented Jan 19, 2019

Uh oh!

DavidBruant commented Jan 19, 2019

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant