Skip to content

vinay2k2/Reddit-scaping-in-R

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

Re-Usable Reddit-scaping-in-R

This is a very covenient code to scrape reddit comments

Index

What it is?

Scrape Reddit Data using R.
This is a pretty convenient script for scrapping Reddit comments.

What you require?

Requirements:
you need to have two piece of information:
Requiremnet 1:Go to the page you want to Scrape let say: https://www.reddit.com/r/television/ so here televison is the subreddit.Collect all subreddits manualy.
Requirement 2:Decide what is the minimum length you want to assume before you scrape a comment lets call it threshold_for_comment

Language Used

R

How to Re-use?

How to use this? All you need to do is modify the code at two places: Locate following lines and just plugin the desired values i.e. Subreddits and value for threshold_for_comment subreddits<-c("television","todayilearned","news","worldnews","movies","videos","sports","pics","AskReddit","music")
threshold_for_comment<-100

contributors

Author Working As contact @
Vinay Kumar MS @ IIT-KGP @github
Surjodoy Ghosh Dastider M-Tech @ IIT-KGP @github

Contact:

You can contact [email protected] incase you are not getting the code/logic somewhere.

About

Scrape Reddit Data using R

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages