Tweep is an advanced Twitter scraping tool written in python that allows for scraping Tweets and pictures from Twitter profiles without using Twitter’s API.
Benefits
Some of the benefits of using Tweep vs Twitter API:
- Fast initial setup
- Can be used anonymously
- No rate limitations
- Can fetch all Tweets (Twitter API limits to last 3200 Tweets)
Requirements
- BeautifulSoup4
pip install beautifulsoup4
Usage
-u
The user’s Tweets you want to scrape.-s
Search for Tweets containing this word or phrase.--year
Filter tweets before the specified year.--pics
Download all the pictures from Tweets.--fruit
Display Tweets with “low-hanging-fruit”.--tweets
Display Tweets only.--verified
Display Tweets only from verified users (Use with-s
).--users
Display users only (Use with-s
).
Low-Hanging Fruit
The --fruit
feature will display Tweets that might contain sensitive info such as:
- Profiles from leaked databases (Myspace or LastFM)
- Email addresses
- Phone numbers
Basic Examples and Combos.
A few simple examples to help you understand the basics:
python tweep.py -u username
– Scrape all the Tweets from user‘s timeline.python tweep.py -u username -s pineapple
– Scrape all Tweets from the user‘s timeline containing pineapple.python tweep.py -s pineapple
– Collect every Tweet containing pineapple from everyone’s Tweets.python tweep.py -u username --year 2014
– Collect Tweets that were tweeted before 2014.python tweep.py -u username --pics
– Download all pictures from user‘s timeline.python tweep.py -u username --fruit
– Show Tweets with low-hanging fruit.python tweep.py -s "Donald Trump" --verified --users
– List verified users that Tweet about Donald Trump.
Example String
881653591265746945 2017-07-02 23:19:31 UTC <dqt> i need to drink more water
Add Comment