Skip to main content

Scrape data of all the episodes of a Tv Series from IMDB

Project description

Tvstats

Scrape data of all the episodes of a Tv Series from IMDB.

Installation

Run

python setup.py install

Dependencies

tvstats is based on Python 2.7. Requires BeautifulSoup4 for parsing, requests for downloading html. Matplotlib is required(optional) for using graph module.

Usage

Run the simple command

tvstats url

to generate json data. URL should point to homepage of a tv series. eg. http://www.imdb.com/title/tt0108778/?ref_=fn_al_tt_1

For options and help run

tvstats -h

Why?

Here are my reasons:

  • I was bored and had time to kill.

  • I love watching Tv Series. Thought it would be good to analyse some data before starting a new one.

  • Graphs are fun.

  • Lastly, I wanted to test out BeautifulSoup4 :).

Issues, Bugs, Graphs?

Let me knwow about the issues at https://github.com/leosartaj/tvstats/issues. Feel free to add new graphs or improve.

Examples

All the datasets can be found here. Graphs were made using graph function in ‘graph.py’.

Friends

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/friends.png

Game Of Thrones

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/gameOfThrones.png

Breaking Bad

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/breakingBad.png

The Big Bang Theory

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/tbbt.png

How I Met Your Mother

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/himym.png

Prison Break

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/prisonBreak.png

Hannibal

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/hannibal.png

Suits

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/suits.png

Dexter

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/dexter.png

Arrow

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/arrow.png

Person Of Interest

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/personOfInterest.png

Homeland

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/homeland.png

House Of Cards

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/houseOfCards.png

How to Get Away With Murder

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/howToGetAwayWithMurder.png

Orange Is The New Black

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/orangeIsTheNewBlack.png

Shameless

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/shameless.png

Sons Of Anarchy

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/sonsOfAnarchy.png

Spartacus

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/spartacus.png

The Walking Dead

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWalkingDead.png

Vikings

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/vikings.png

Flash

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/flash.png

The Wire

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theWire.png

Continuum

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/continuum.png

Lost

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/lost.png

The Sopranos

https://raw.githubusercontent.com/leosartaj/tvstats/master/data/graphs/theSopranos.png

Releases

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tvstats-0.0.2.tar.gz (5.6 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page