-
-
Notifications
You must be signed in to change notification settings - Fork 5
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add support for WARC creation/writing #2
Comments
ikreymer
added a commit
that referenced
this issue
May 23, 2020
- WARCSerializer for serializing warc records as is - WARCEnsureDigestSerializer for serializing warc records and computing block, payload digests and optional signature (experimental) - Experimental SWARCSerializer for serializing a stream warc with unknown payload size in 'SWARC' format, appending payload, block and signature as trailers. SWARCReader with SWARCParser for converting SWARC -> WARC
Merged
ikreymer
added a commit
that referenced
this issue
Jun 9, 2020
* error handling: improve errors if trying to read already consumed record serialization via s async iterator, #2 and digest computation #6 - WARCSerializer for serializing WARC records and computing block, payload digests from full payload (non-streaming). Also computes the total content-length * warcserializer: - support gzipped output - add createWARCInfo, support revisit params, add tests * fix and update dist build * update README: add writing WARCs, add collapsible sections for examples switch to different uuid package, update dist/ update serializer tests * README tweaks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
No description provided.
The text was updated successfully, but these errors were encountered: