Skip to content

CHesketh76/GoogleNews

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

GoogleNews Fixed issue of not pulling past articles, only current articles

Build Status Coverage Status PyPI PyPI - Downloads PyPI - Python Version PyPI - Wheel GitHub contributors GitHub issues GitHub Action GitHub

Install

!pip install git+https://github.com/CHesketh76/GoogleNews.git

or

!pip install --upgrade git+https://github.com/CHesketh76/GoogleNews.git

Usage

  • Initializing
from GoogleNews import GoogleNews
googlenews = GoogleNews()
  • Check version
print(googlenews.getVersion())
  • Enable to throw exception
googlenews.enableException(True)
  • Optional choose language
googlenews = GoogleNews(lang='en')

or

googlenews = GoogleNews(lang='en', region='US')
  • Optional choose period (period and custom day range should not set together)
googlenews = GoogleNews(period='7d')
  • Optional choose custom day range (mm/dd/yyyy)
googlenews = GoogleNews(start='02/01/2020',end='02/28/2020')
  • Optional set encode
googlenews = GoogleNews(encode='utf-8')

or

googlenews.set_lang('en')
googlenews.set_period('7d')
googlenews.set_time_range('02/01/2020','02/28/2020')
googlenews.set_encode('utf-8')
  • news.google.com search sample
googlenews.get_news('APPLE')
  • google.com section news search sample
googlenews.search('APPLE')

Default return first page result, you don't need to get first page again, otherwise you might get duplicate result. To get other page of search results:

googlenews.get_page(2)
  • If you only want to get specific page
result = googlenews.page_at(2)
  • If you want to get the total result number of the search(this is approximate number, not exact number, it is the number showing on the google search page) (Note: this function is not available for googlenews.search())
googlenews.total_count()
  • Get results will return the list, [{'title': '...', 'media': '...', 'date': '...', 'datetime': '...', 'desc': '...', 'link': '...', 'img': '...'}]
googlenews.results()

if googlenews.results(sort=True) the tool will try to order the results in cronologically reversed order

  • Get texts will return the list of news titles
googlenews.get_texts()
  • Get links returns the list of news links
googlenews.get_links()
  • Clear result list before doing another search with the same object
googlenews.clear()

Issue

Image is not working in the latest version, it can only return default google loading gif

The date range is not always working as Google may return the result with random order or out of date range.

Google may recognize the program as automated robots and block the IP, using cloud server and fetching data with high frequency will get higher chance to be blocked.

About

Fixes to the GoogleNews

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%