-
-
Notifications
You must be signed in to change notification settings - Fork 286
Issues: adbar/trafilatura
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Fast and full mode yield the same results
bug
Something isn't working
#787
opened Feb 12, 2025 by
adbar
Review input type for New feature or request
is_probably_readerable()
function
enhancement
#749
opened Nov 22, 2024 by
adbar
Review HTML element list and conversion
enhancement
New feature or request
#720
opened Oct 15, 2024 by
adbar
2 tasks
Docs: add page explaining how to run tests
documentation
Docs in need of update or extension
#698
opened Sep 9, 2024 by
adbar
Downloads: add support to switch between proxies
enhancement
New feature or request
#697
opened Sep 9, 2024 by
adbar
Investigate spacing in element tails
question
Further information is requested
#661
opened Jul 26, 2024 by
adbar
utils.decode_file()
: add switch for full detection or GZip only
enhancement
#595
opened May 15, 2024 by
adbar
Make cascade of different content extractors explicit and configurable
enhancement
New feature or request
#538
opened Apr 3, 2024 by
adbar
Add support for Netscape cookies file format
enhancement
New feature or request
#473
opened Jan 11, 2024 by
adbar
Check URLs passed to courlan functions Further information is requested
extract_links
and fix_relative_urls
question
#382
opened Jun 23, 2023 by
adbar
Function to use part of the heuristics on bare HTML fragments
enhancement
New feature or request
#369
opened Jun 14, 2023 by
adbar
Fix XPath expression in subtree
maintenance
Software compability and continuity
#289
opened Jan 19, 2023 by
adbar
Add document language to metadata
enhancement
New feature or request
#224
opened Jul 19, 2022 by
adbar
Simplify handling of nested elements
enhancement
New feature or request
#93
opened Jul 12, 2021 by
adbar
Keeping all valid table information and formatting
bug
Something isn't working
#78
opened Jun 2, 2021 by
adbar
Refactor code to provide a "keep-tags" option
enhancement
New feature or request
#52
opened Jan 12, 2021 by
adbar
3 tasks
List of smaller extraction bugs (text & metadata)
good first issue
Good for newcomers
up for grabs
Good for (first) contributors
#4
opened Jan 9, 2020 by
adbar
ProTip!
What’s not been updated in a month: updated:<2025-02-14.