Skip to content

Pull requests: codelucas/newspaper

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

Added Hindi language stopwords
#249 opened May 14, 2016 by parulsethi Loading… updated Apr 10, 2026
Update extractors.py
#500 opened Jan 5, 2018 by sirks Loading… updated Apr 10, 2026
Added reference corpus keyword functionality enhancement
#480 opened Dec 1, 2017 by IngoKl Loading… updated Apr 10, 2026
Convert list items correctly (fixes double text) bug enhancement
#456 opened Oct 7, 2017 by mercuree Contributor Loading… updated Apr 10, 2026
implementing 3rd method of publish date extraction (issue # 521)
#549 opened Apr 11, 2018 by zachorban Loading… updated Apr 10, 2026
change content tag name for datePublished
#402 opened Jul 20, 2017 by mamoit Loading… updated Apr 10, 2026
Fix typo: langauges -> languages
#345 opened Mar 10, 2017 by jumarko Loading… updated Apr 10, 2026
Convert mthreading to use concurrent.futures
#553 opened Apr 17, 2018 by c0d3d Loading… updated Apr 10, 2026
Adjustments to extract full text of an article from The Atlantic.
#518 opened Feb 12, 2018 by EdwardBetts Contributor Loading… updated Apr 10, 2026
Added explaination on using exisiting html string
#307 opened Nov 26, 2016 by Bartvds Loading… updated Apr 10, 2026
Scrape og:image:secure_url og:image:url
#404 opened Jul 21, 2017 by mamoit Loading… updated Apr 10, 2026
Fixed problem with empty meta description bug enhancement
#524 opened Feb 28, 2018 by baktakt Loading… updated Apr 10, 2026
Nepali language support enhancement
#544 opened Apr 3, 2018 by ashokpant Loading… updated Apr 10, 2026
Another option to get meta language
#312 opened Dec 25, 2016 by mercuree Contributor Loading… updated Apr 10, 2026
FIX: issue#283 text dates are not well parsed
#284 opened Sep 2, 2016 by vperilla Loading… updated Apr 10, 2026
many changes
#558 opened Apr 29, 2018 by bung87 Loading… updated Apr 10, 2026
Previous siblings were inserted in reverse order
#329 opened Feb 5, 2017 by mercuree Contributor Loading… updated Apr 10, 2026
Trailing slash for url date regex
#358 opened Apr 6, 2017 by akionakamura Loading… updated Apr 10, 2026
Made the amount of keywords adjustable, defaulting to 10
#561 opened May 7, 2018 by fabiant7t Loading… updated Apr 10, 2026
Fixed raise statement bug enhancement
#550 opened Apr 12, 2018 by ttong1013 Loading… updated Apr 10, 2026
Add JSON-LD support by using extruct enhancement
#385 opened Jun 15, 2017 by torbenbrodt Contributor Loading… updated Apr 10, 2026
modify the last step of title-extracting enhancement
#476 opened Nov 24, 2017 by Ckins Loading… updated Apr 10, 2026
Do not choose top node that will be emptied enhancement needs design decision
#424 opened Aug 30, 2017 by megatron-me-uk Contributor Loading… updated Apr 10, 2026
Create reference to article_html node
#323 opened Jan 23, 2017 by mercuree Contributor Loading… updated Apr 10, 2026
customization for french newspaper
#389 opened Jun 28, 2017 by antsafi Loading… updated Apr 10, 2026
ProTip! Filter pull requests by the default branch with base:master.