Box-Office Algorithm Predicts Revenue From Movie Screenplay

More

It seems like every summer, I go to see at least one horrible movie, which forces me to question the sanity of Hollywood. "How could they have thought that was going to do well?" Well, some business scholars want to take the guesswork out of the movie business. They're working on an algorithm that can (prest-o, change-o!) distill a screenplay's text into a box office tally.

Imagine a world where Hollywood producers could predict, with scientific precision, the box office revenue a movie will generate just by reading the screenplay. A new forecasting model devised by a trio of marketing professors from Wharton and NYU promises to deliver something like that. Among their findings: action movies with multidimensional conflicts are the most surefire investments, and horror films the riskiest.

Read the full story at Freakonomics at The New York Times.

The full paper is fascinating [pdf], mostly for the factoids. For example, they use a common natural language processing method called "bag-of-words." Here were the 30 most common words (all forms included) in their dataset of movie scripts. The f-word, man, dad, mom: those I can understand. But how about "corridor"? Then start thinking of all the movies in which someone walks/runs/fights down a corridor. (So many!) Note "chamber" and "tunnel" as well. It's like this study discovered a hidden truth about the way Hollywood architecture has to work.

30word.jpg

Jump to comments
Presented by

Alexis C. Madrigal

Alexis Madrigal is the deputy editor of TheAtlantic.com, where he also oversees the Technology Channel. He's the author of Powering the Dream: The History and Promise of Green Technology. More

The New York Observer has called Madrigal "for all intents and purposes, the perfect modern reporter." He co-founded Longshot magazine, a high-speed media experiment that garnered attention from The New York Times, The Wall Street Journal, and the BBC. While at Wired.com, he built Wired Science into one of the most popular blogs in the world. The site was nominated for best magazine blog by the MPA and best science Web site in the 2009 Webby Awards. He also co-founded Haiti ReWired, a groundbreaking community dedicated to the discussion of technology, infrastructure, and the future of Haiti.

He's spoken at Stanford, CalTech, Berkeley, SXSW, E3, and the National Renewable Energy Laboratory, and his writing was anthologized in Best Technology Writing 2010 (Yale University Press).

Madrigal is a visiting scholar at the University of California at Berkeley's Office for the History of Science and Technology. Born in Mexico City, he grew up in the exurbs north of Portland, Oregon, and now lives in Oakland.

Get Today's Top Stories in Your Inbox (preview)

Tracing Sriracha's Origin to a Seaside Town in Thailand

Ever wonder how the wildly popular hot sauce got its name? It all started in Si Racha.


Elsewhere on the web

Join the Discussion

After you comment, click Post. If you’re not already logged in you will be asked to log in or register. blog comments powered by Disqus

Video

Where the Wild Things Go

A government facility outside of Denver houses more than a million products of the illegal wildlife trade, from tigers and bears to bald eagles.

Video

Adults Need Playtime Too

When was the last time you played your favorite childhood game?

Video

Is Wine Healthy?

James Hamblin prepares to impress his date with knowledge about the health benefits of wine.

Video

The World's Largest Balloon Festival

Nine days, more than 700 balloons, and a whole lot of hot air

Writers

Up
Down

More in Technology

Just In