You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently the file write happens at the end of the process. This means that the entire JSON representation needs to be held in memory until the end of the process. This limits the number of URLs that can be processed in a single run.
Proposed solution
Stream the output to the output file after each request
Manage a SHA of each row processed so we don't add duplicates (this can be stored in memory)
The text was updated successfully, but these errors were encountered:
# This is the 1st commit message:
Rename the merlin namespace.
# This is the commit message #2:
Merlin framework composer project
# This is the commit message #3:
Code consistency.
Currently the file write happens at the end of the process. This means that the entire JSON representation needs to be held in memory until the end of the process. This limits the number of URLs that can be processed in a single run.
Proposed solution
The text was updated successfully, but these errors were encountered: