Releases: openzim/warc2zim
Releases · openzim/warc2zim
2.0.0
Added
- Allow to specify a scraper suffix for the ZIM scraper metadata at the CLI (#168)
- New test website to test many known situations supposed to be handled (#166)
Changed
- Replace Service Worker approach by scraper-side rewriting of static content (kiwix/overview#95)
- Adopted Python bootstrap conventions (#152)
- Upgrade dependencies, especially move to Python 3.12 (only) and zimscraperlib 3.3.2
- Change wording in logs about the return code 100 (which is not an error code)
- Added checks in
converter.py
to verify output directory existence, logging appropriate error messages and cleanly exit if checks fail. (#106) - Added check for invalid zim file names (#232)
- Changed default publisher metadata from 'Kiwix' to 'openZIM' (#150)
1.5.5
1.5.4
1.5.3
1.5.2
1.5.1
1.5.0
Added
- Don't crash on failure to convert illustration (skip illus instead)
Changed
- Fixed 404 page (#96)
- Dont't crash on missing Location headers on potential redirect
- Fixed incorrect ISO-639-3 --lang not replaced with
eng
- Don't fallback to
eng
if the host doesnt have the matching locale - Using wabac.js 2.15.0 with fix for scope conflict in SW/DB
- Payload entries now uses original ~
text/html
mimetype instead oftext/html;raw=true
- dont't crash on icon link with no href
v1.4.3
1.4.0
Added
- Additional fuzzy matching rules for youtube and vimeo, and additional test cases
- Support for youtube videos, which require POST request handling to work.
- Support for canonicalizing POST request data into URL for fuzzy matching (using cdxj-indexer)
- Support loading custom sw.js from a local file path
Changed
- Updated zimscraperlib to 1.6 using libzim7.2
- Updated warcio to 1.7.4
- Added support for {period} replacement in --zim-file
- Using fixed MarkupSafe version (Jinja2 dependency)