IMDbPY Changelog

What's new in IMDbPY 5.0

May 6, 2014
  • [general]
  • Spanish, French, Arabic, Bulgarian and German translations.
  • Introduced the list of French articles.
  • fix for GAE.
  • download_applydiffs.py script.
  • fixed wrong handling of encoding in episode titles
  • renamed README.utf8 to README.unicode
  • [http]
  • fixed searches (again).
  • search results are always in English.
  • updated the cookies.
  • support for obtaining metacritic score and URL.
  • fixed goofs parser.
  • fixed url for top250.
  • fixes for biography page.
  • fix for quotes.
  • better charset identification.
  • category and spoiler status for goofs.
  • changed query separators from ; to &.
  • fix for episodes of unknown seasons.
  • new cookie.
  • [mobile]
  • fixed searches.
  • [sql]
  • fix for MSSQL

New in IMDbPY 4.9 (Jun 16, 2012)

  • [general]:
  • urls used to access the IMDb site can be configured.
  • helpers function to handle movie AKAs in various
  • languages (code by Alberto Malagoli).
  • renamed the 'articles' module into 'linguistics'.
  • introduced the 'reraiseExceptions' option, to re-raise
  • evey caught exception.
  • [http]:
  • fix for changed search parameters.
  • introduced a 'timeout' parameter for connections to the web server.
  • fix for business information.
  • parser for the new style of episodes list.
  • unicode searches handled as iso8859-1.
  • fix for garbage in AKA titles.
  • [sql]:
  • vastly improved the store/restore of imdbIDs; now it should be faster
  • and more accurate.
  • now the 'name' table contains a 'gender' field that can be 'm', 'f' or NULL.
  • fix for nicknames.
  • fix for missing titles in the crazy credits file.
  • handled exceptions creating indexes, foreign keys and
  • executing custom queries.
  • fixed creation on index for keywords.
  • excluded {{SUSPENDED}} titles.

New in IMDbPY 4.8.2 (Nov 3, 2011)

  • [general]:
  • fixed install path of locales.
  • [http]:
  • removed debug code.

New in IMDbPY 4.8 (Nov 1, 2011)

  • [general]:
  • fix for a problem managing exceptions with Python 2.4.
  • converted old-style exceptions to instances.
  • enanchements for the reduce.sh script.
  • added notes about problems connecting to IMDb's web servers.
  • improvements in the parsers of movie titles.
  • improvements in the parser of person names.
  • [http]:
  • potential fix for GAE environment.
  • handled the new style of "in production" information.
  • fix for 'episodes' list.
  • fix for 'episodes rating'.
  • fix for queries that returned too many results.
  • fix for wrong/missing references.
  • removed no more available information set "amazon
  • reviews" and "dvd".
  • fix for cast of tv series.
  • fix for title of tv series.
  • now the beautiful parses work again.
  • [httpThin]:
  • removed "httpThin", falling back to "http".
  • [mobile]:
  • fix for missing headshots.
  • fix for rating and number of votes.
  • fix for missing genres.
  • many other fixes to keep up-to-date with the IMDb site.
  • [sql]:
  • fix for a nasty bug parsing notes about character names.
  • fixes for SQLite with SQLOjbect.

New in IMDbPY 4.2 (Oct 16, 2009)

  • The 'local' data access system is gone. See README.local.
  • the imdb.parser.common package was removed, and its code integrated
  • in imdb.parser.sql and in the imdbpy2sql.py script.
  • fixes for the installer.
  • the helpers module contains the fullSizeCoverURL function, to convert
  • a Movie, Person or Character instance (or a URL in a string)
  • in an URL to the full-size version of its cover/headshot.
  • Courtesy of Basil Shubin.
  • used a newer version of msgfmt.py, to work around a hideous bug
  • generating locales.
  • minor updates to locales.
  • updated the DTD to version 4.2.
  • [http]
  • removed garbage at the end of quotes.
  • fixed problems parsing company names and notes.
  • keys in character's quotes dictionary are now Movie instances.
  • fixed a bug converting entities char references (affected BeautifulSoup).
  • fixed a long-standing bug handling & with BeautifulSoup.
  • top250 is now correctly parsed by BeautifulSoup.
  • [sql]
  • fixed DB2 call for loading blobs/cblobs.
  • information from obsolete files are now used if and only if they
  • refer to still existing titles.
  • the --fix-old-style-titles argument is now obsolete.

New in IMDbPY 4.0 (Mar 13, 2009)

  • General:
  • the installer is now based on setuptools.
  • new functions get_keyword and search_keyword to handle movie's keywords (example scripts included).
  • Movie/Person/... keys (and whole instances) can be converted to XML.
  • two new functions, get_top250_movies and get_bottom100_movies, to retrieve lists of best/worst movies (example scripts included).
  • searching for movies and persons - if present - the 'akas' keyword is filled, in the results.
  • 'quotes' for movies is now always a list of lists.
  • the old set of parsers (based on sgmllib.SGMLParser) are gone.
  • fixed limitations handling multiple roles (with notes).
  • fixed a bug converting somethingIDs to real imdbIDs.
  • fixed some summary methods.
  • updates to the documentation.
  • HTTP:
  • adapted BeautifulSoup to lxml (internally, the lxml API is used).
  • currentRole is no longer populated, for non-cast entries (everything ends up into .notes).
  • fixed a bug search for too common terms.
  • fixed a bug identifying 'kind', searching for titles.
  • fixed a bug parsing airing dates.
  • fixed a bug searching for company names (when there's a direct hit).
  • fixed a bug handling multiple characters.
  • fixed a bug parsing episode ratings.
  • nicer keys for technical details.
  • removed the 'agent' page.
  • SQL:
  • searching for a movie, the original titles are returned, instead of AKAs.
  • support for Foreign Keys.
  • minor changes to the db's design.
  • fixed a bug populating tables with SQLAlchemy.
  • imdbpy2sql.py shows user time and system time, along with wall time.
  • Local:
  • searching for a movie, the original titles are returned, instead of AKAs.

New in IMDbPY 3.9 (Jan 6, 2009)

  • general:
  • introduced the search_episode method, to search for episodes' titles.
  • movie['year'] is now an integer, and no more a string.
  • fixed a bug parsing company names.
  • introduced the helpers.makeTextNotes function, useful to pretty-print strings in the 'TEXT::NOTE' format.
  • http:
  • fixed a bug regarding movies listed in the Bottom 100.
  • fixed bugs about tv mini-series.
  • fixed a bug about 'series cast' using BeautifulSoup.
  • sql:
  • fixes for DB2 (with SQLAlchemy).
  • improved support for movies' aka titles (for series).
  • made imdbpy2sql.py more robust, catching exceptions even when huge amounts of data are skipped due to errors.
  • introduced CSV support in the imdbpy2sql.py script.