Skip to content

extendi/les-miserables

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 

Repository files navigation

Les Misérables

The detection of the most important nodes in a graph-database (or network dataset) is a problem of wide interest in various scientific disciplines, particularly in the data sciences.

Basically any dataset that can be described through a set of nodes or entities, and a set of links describing how these entities interact or relate between them. In a network graph people are represented by nodes, and links represent the different social ties between them (friendship, collaboration, common hobbies, etc.).

The file les_miserables.gml or les_miserables.json are the network representation of the interactions between major characters in the novel Les Misérables The weight of the edge (attribute value in the file) between each node also indicates how often co-appearances between characters occur throughout the novel. Therefore, a thicker edge informs us that those characters appear together more often than those with a thinner edge.

Project scope

Generate the network graph on Neo4j, you can start a local server using the Neo4j docker image. Create a ruby project/script to import the network and run some analysis on it. If you are more confident with a different programming language, use it!

  1. Add nodes/edges to the graph;
  2. run a clustering algorithm to detect main charachers groups;
  3. detect most central characters;
  4. try to visualize the network and cluster using the library you prefer, could be js, d3, networkx, or whathever library you prefer.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published