Skip to content

Latest commit

 

History

History
33 lines (17 loc) · 2.1 KB

README.md

File metadata and controls

33 lines (17 loc) · 2.1 KB

cinematic

This project preserves the information about 732 Chinese films produced in the "Seventeen Years Period" (1949-1966).

Data

Sources

Modelled data of the 656 entries in The Catalogue of Chinese Artistic Films (中国艺术影片编目; China Film Archive, 1982) with romanised title, original title, translated title, release year, production, colour, length in film reels, and recorded special aspects

The staff information and plot summary of the 656 entries, separated from the main file due to the large file sizes

An extra collection of 76 entries not included by the book

Some formatted csv files that can be directly imported to Gephi for social network analysis (7293 nodes and 259622 edges)

OCR source data scanned from The Catalogue of Chinese Artistic Films (the source data may contain some missing attributes, which have been fixed in the main metadata file)

Results of Topic Modelling generated with a Gensim model trained with 9312 stopwords (including 7150 names of fictional characters from the OCR results) and 16 topics

Visualisation

A filmography visualiser in JavaScript for previewing the dataset

A Plotly scatter map of filmmakers/actors/actresses' geographical movement

A simplified 2d graph Jupyter Notebook of film counts by year and region

A Gephi file of a social network generated based on the staff information of all 732 entries

A Jupyer Notebook of topic modelling restuls for plot summaries with PyLDAvis graph

Visualisation of Crowd Size Distribution in All Films