Article-Scraper

Currently just working for the site Geeks for Geeks, but the idea is to scrape sites I frequent for Python specific articles and return them in it's own html file with the headline, summary, and link to the full article.

Modules Used

BeautifulSoup 4 for web scraping.
Requests library for making HTTP requests.
lxml parser used in the BeautifulSoup object.
os for opening the html file.

To-Do

Scrape more than just the one site for articles.
Work on the over all display of the html file, it's not overly well formatted currently.
Test on Linux, I know it will have trouble when trying to run the os.startfile() line.

Usage

Download the files, go to terminal and simply run:

python geeksScrape.py

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
README.md		README.md
geekScrape.css		geekScrape.css
geeksScrape.html		geeksScrape.html
geeksScrape.py		geeksScrape.py
index.html		index.html
logo.png		logo.png
screenshot.JPG		screenshot.JPG
screenshot02.JPG		screenshot02.JPG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Article-Scraper

Modules Used

To-Do

Usage

About

Uh oh!

Releases

Packages

Languages

SlyCodePanda/Article-Scraper

Folders and files

Latest commit

History

Repository files navigation

Article-Scraper

Modules Used

To-Do

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages