Plan for Social Network Analysis
Part One
Goal: identify the users, in Australia, who tweet about #fridays4thefuture the most.
Questions: how many? 20?
Tools: Instagram-Scraper (https://github.com/arc298/instagram-scraper)
Tested: And worked. Convert CSV by number of posts. More research needed.
- Scrape hashtag #fridaysforfuture
instagram-scraper fridaysforfuture –tag -t none -m 50
- Didn’t work – only friday for future user account, not the hashtag
instagram-scraper fridaysforfuture –tag –maximum 10
- This one worked better.
- Downloaded media too.
- But this doesn’t actually do the JSON file
- New approach. Now using instatouch: https://github.com/drawrowfly/instagram-scraper#installation
instatouch hashtag fridaysforfuture -c 1000 -t csv
- No need for Converting to JSON
- Then sort by user frequency: https://www.extendoffice.com/documents/excel/1551-excel-sort-by-frequency.html
- Then find instagram user: https://commentpicker.com/instagram-username.php
- Next step: get the followers list of the top 20 of these.
python grab.py –account accountname –percentage percentageoffollowerstoscrape
Part Two
Goal: Get the followers list of these
Tools: InstaPy and Grab seems to work. Not on private accounts. Will need to import and convert to csv.
Sample:
6439777056: hackhermannjosef
2366464623: jjstronstad
16568958238: fight_for_a_future
40951851579: pfalz.max
37470761316: i_stand_for_greta
40981685338: freie_religion
36971060809: liquid_vinay
15962751241: earthspaan
12877887674: ecosenpaii
7373687310: chulifacio
16963857961: creativesforfuture
4162034534: captain_planet_daily
17538710007: patriaedecologismo
37585056146: gruene_kamen
1633492834: fantastic_earthpics
39023707441: anafranco.oficial
9600114974: extinctionrebellionbrighton
5561863638: the.lgbt.feminist
323249892: wagner_cc
source env/bin/activate
instapy % python quickstart.py
python3 grab.py -a hackhermannjosef -p 100
Part Three
Goal: Build a social network analysis demonstrating connections between these 20 users (nodes)
Instacrawlr might do this. That’s meant to work on hashtags, but it might also work on nodes