This repository contains the code for implementing the algorithms given in Krishnaraj, P. M., Ankith Mohan, and K. G. Srinivasa. "Performance of procedures for identifying influentials in a social network: prediction of time and memory usage as a function of network properties." Social Network Analysis and Mining 7.1 (2017): 34..
To execute the scripts, do the following:
-
In params.csv, edit the values in column 2, the values after each of the commas.
- Do not change column 1, i.e, Input file, File separator, File header, Output path or Seed.
- Do not give a space after the comma.
- Failure to comply with either of the above steps will result in code failure or abnormal behavior.
-
Set the current directory to the
Influence
directory. -
Run
source("scripts/run.R")
The output will be the following directories:
- abscut: An eps file for each of the detected cluster with its influentials identified by the absolute cut score method in red.
- cluster: An eps file for each of the detected cluster.
- fixcut: An eps file for each of the detected cluster with its influentials identified by the fixed percentage of propulation method in blue.
- randhist: A png file for each of the detected cluster with the frequency as function of the in-degree of each of the vertices in this cluster. All vertices with in-degree greater than its random counterpart at 95% level of significance is considered an influential.
- randplot: An eps file for each of the detected cluster with its influentials identified by the random permutation method in chocolate.
- sample: An eps file for each of the samples until the statistically significant sample is found using the stopping criteria. Each eps file shows the clusters in a different color.
- screeplot: An eps file for each of the detected cluster with the screeplot of the indegree of all the vertices corresponding to this cluster:
- A vertical blue line for fixed percentage of population method. All vertices to the left of this line are influentials.
- A horizontal red line for absolute cut score method. All vertices above this line are influentials.
- A horizontal green line for standard deviation method. All vertices above this line are influentials.
- sdcut: An eps file for each of the detected cluster with its influentials identified by the standard deviation method in green.