Detecting Bursty Terms in Computer Science

burst-detection

Research topics rise and fall in popularity over time, some more swiftly than others. The fastest rising topics are typically called bursts; e.g. "deep learning", "internet of things" and "big data". Being able to detect and track bursty terms in the literature could give insight into how scientific thought evolves over time.

In this repository, we take a trend detection algorithm from technical stock market analysis and apply it to 31 years of computer science research abstracts, treating the prevalence of each term in the dataset like the price of a stock. Unlike previous work in this domain, we use the free text of abstracts and titles, resulting in a finer-grained analysis. We report a list of bursty terms, then use historical data to build a classifier to predict whether they will rise or fall in popularity in the future, obtaining accuracy in the region of 80%. As a consequence, we now have a pipeline that can be applied to any time-ordered collection of text to yield past and present bursty terms and predict their probable fate.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Detecting Bursty Terms in Computer Science.ipynb		Detecting Bursty Terms in Computer Science.ipynb
README.md		README.md
clusters.xlsx		clusters.xlsx

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detecting Bursty Terms in Computer Science

About

Releases

Packages

Languages

ssr01357/burst-detection

Folders and files

Latest commit

History

Repository files navigation

Detecting Bursty Terms in Computer Science

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages