NetMax - Influence Maximization in Social Networks

NetMax is a Python library that provides the implementation of several algorithms for the problem of Influence Maximization in Social Networks, originally formulated in "Maximizing the Spread of Influence through a Social Network" (Kempe, Kleinberg and Tardos, 2003). NetMax is built upon NetworkX, a popular python library for working with graphs. It also addresses the problem of Competitive Influence Maximization, as an extensive-form strategic game setting in which multiple entities try to maximize their own influence across the network while minimizing the others'. It works with both signed and unsigned networks, implementing progressive, semi-progressive and non-progressive diffusion models.

Table of Contents:

NetMax - Influence Maximization in Social Networks

Installation

To install the library you can run the following command:

pip install netmax

Requirements

NetMax was developed with Python 3.12 and requires the installation of the following libraries:

networkx (version 3.3)
numpy
scipy
tqdm
heapdict

If you have already installed the library using pip, you don't need to install the requirements, otherwise you can simply run the following command:

pip install -r requirements.txt

Overview

This framework wants to be a useful tool for all those people who study the problem of Influence Maximization. The users instantiate the InfluenceMaximization class, setting some basic parameters:

input_graph: a directed graph representing the network (of type networkx.DiGraph)
agents: a dictionary where the key is the agent name (str) and the value is his budget (int)
alg: the algorithm to use for influence maximization (see Algorithms)
diff_model: the diffusion model to use (see Diffusion models)
inf_prob: the probability distribution used to generate (if needed) the probabilities of influence between nodes. The framework implements different influence probabilities, default is None (see Influence probabilities)
endorsement_policy: the policy that nodes use to choose which agent to endorse when they have been contacted by more than one agent. The framework implements different endorsement policies, default is 'random' (see Endorsement policies)
insert_opinion: True if the nodes do not contain any information about their opinion on the agents, False otherwise (or if the opinion is not used)
inv_edges: a bool indicating whether to invert the edges of the graph
first_random_seed: a bool indicating whether to insert a first node (chosen randomly) in the seed set of every agent
r: number of simulations to execute (default is 100)
verbose: if True sets the logging level to INFO, otherwise displays only the minimal information

Important: alg, diff_model, inf_prob and endorsement_policy can be either str or class parameters:

If they are str parameters, they represent the name attribute of the corresponding class already present in the framework. This was done in order to prevent the user from directly importing and instantiating all the specific classes, which could have not been user-friendly. To view all the keywords for these parameters, see the corresponding section
Otherwise, they must extend the corresponding superclass depending on the parameters (Algorithm for alg, DiffusionModel for diff_model, InfluenceProbability for inf_prob, EndorsementPolicy for endorsement_policy). This way, the user can define his own custom classes

After creating the InfluenceMaximization object, the user may call its run() method, which returns:

seed: a dictionary where the key is the agent name and the value is the seed set found
spread: a dictionary where the key is the agent name and the value is the expected spread
execution_time: the total execution time (in seconds)

All these values are also available in the result attribute (which is a dictionary) of the InfluenceMaximization object.

Algorithms

NetMax provides the implementation of many state-of-the-art algorithms.

Simulation-based

Monte-Carlo Greedy: implemented by the class MCGreedy (keyword: mcgreedy)
CELF: implemented by the class CELF (keyword: celf)
CELF++: implemented by the class CELF_PP (keyword: celfpp)

Proxy-based

Highest Out-Degree Heuristic: implemented by the class HighestOutDegree (keyword: outdeg)
Degree Discount: implemented by the class DegDis (keyword: degdis)
Group PageRank: implemented by the class Group_PR (keyword: group_pr)

Sketch-based

StaticGreedy: implemented by the class StaticGreedy (keyword: static_greedy)
RIS: implemented by the class RIS (keyword: ris)
TIM: implemented by the class TIM (keyword: tim)
TIM+: implemented by the class TIMp (keyword: tim_p)

Diffusion models

The supported diffusion models are:

Independent Cascade: implemented by the class IndependentCascade (keyword: ic)
Linear Threshold: implemented by the class LinearThreshold (keyword: lt)
Triggering Model: implemented by the class Triggering (keyword: tr)
Decreasing Cascade: implemented by the class DecreasingCascade (keyword: dc)
Semi-Progressive Friend-Foe Dynamic Linear Threshold: implemented by the class SemiProgressiveFriendFoeDynamicLinearThreshold (keyword: sp_f2dlt)
Non-Progressive Friend-Foe Dynamic Linear Threshold: implemented by the class NonProgressiveFriendFoeDynamicLinearThreshold (keyword: np_f2dlt)

Influence probabilities

The influence probabilities are used to label the edges between the network nodes if they are not already labeled. The user can choose between:

A constant value, set by default at 0.1 (keyword: constant)
A uniform distribution between 0.01 and 0.1 (keyword: uniform)
A distribution based on similarity between nodes computed with SimRank algorithm (keyword: similarity)
A ratio model which distributes the probability uniformly based on the in-degree of the target node (keyword: ratio)
A hybrid approach based on the average degree of the graph (keyword: hybrid)
An opinion-based approach (keyword: opinion) which assigns to each node a vector of opinions (namely, values between 0 and 1) and computes the influence probability comparing the opinions of the two nodes with cosine similarity and taking into account also their SimRank similarity, with the formula:

$p(u,v)=b+k*\left(\frac{1}{outdeg(u)}*similarity(u,v)+cossim(opinion(u),opinion(v))\right)$

Endorsement policies

In the competitive setting it is possible that, in the same time step, multiple agents contact the same node. Therefore, it is necessary an endorsement policy that dictates which agent the node chooses to endorse. Several endorsement policies are implemented:

A random policy, which chooses randomly between the agents that contacted the node in that specific time step (keyword: random)
A voting-based policy, which chooses the most occurring agent between the already activated neighbors of the node (keyword: voting)
A community-based approach, which applies the voting strategy to the community the node belongs to instead of its neighbors (keyword: community)
A similarity-based policy, which essentially is a weighted voting strategy based on the SimRank similarity between the node and its neighbors (keyword: sim_endorsement)

Useful papers

Here is a non-exhaustive list of useful papers which have been studied thoroughly to develop this framework:

Bharathi et al. - Competitive Influence Maximization in Social Networks
Borgs et al. - Maximizing Social Influence in Nearly Optimal Time
Borodin et al. - Threshold Models for Competitive Influence in Social Networks
Budak et al. - Limiting the Spread of Misinformation in Social Networks
Carnes et al. - Maximizing influence in a competitive social network
Chen et al. - Efficient Influence Maximization in Social Networks
Chen et al. - StaticGreedy Solving the Scalability-Accuracy Dilemma in Influence Maximization
Goyal et al. - CELF++ Optimizing the Greedy Algorithm for Influence Maximization in Social Networks
Goyal et al. - Learning Influence Probabilities In Social Networks
Goyal et al. - SimPath An Efficient Algorithm for Influence Maximization under the Linear Threshold Model
Gursoy et al. - Influence Maximization in Social Networks Under Deterministic Linear Threshold Model
Huang et al. - Competitive and complementary influence maximization in social network A follower's perspective
Kempe et al. - Influential Nodes in a Diffusion Model for Social Networks
Kempe et al. - Maximizing the Spread of Influence through a Social Network
Kong et al. - Online Influence Maximization under Decreasing Cascade Model
Leskovec et al. - Cost-Effective Outbreak Detection in Networks
Li et al. - GetReal Towards Realistic Selection of Influence Maximization Strategies in Competitive Networks
Lin et al. - A Learning-based Framework to Handle Multi-round Multi-party Influence Maximization on Social Networks
Liu - Influence maximization in social networks An ising-model-based approach
Liu et al. - Influence Maximization over Large-Scale Social Networks A Bounded Linear Approach
Lu et al. - From Competition to Complementarity Comparative Influence Diffusion and Maximization
Calio, Tagarelli - Complex influence propagation based on trust-aware dynamic linear threshold models
Tang et al. - Influence Maximization in Near-Linear Time A Martingale Approach
Tang et al. - Influence Maximization Near Optimal Time Complexity Meets Practical Efficiency
Wang et al. - Community-based Greedy Algorithm for Mining Top-K Influential Nodes in Mobile Social Networks
Zhou et al. - UBLF An Upper Bound Based Approach to Discover Influential Nodes in Social Networks
Zhu et al. - Minimum cost seed set for competitive social influence

Example

The following is a short example of how to create a simple influence maximization game with two agents. The data is read from a txt file with a function defined in the utils.py file, but alternatively the user can directly input a networkx.DiGraph object with the methods provided by networkx.

import utils
import influence_maximization as im

g = utils.read_adjacency_matrix("../data/network.txt")
# Dictionary <agent_name: agent_budget>
agents = {
    'Agent_0': 10
    'Agent_1': 10
}
im_instance = im.InfluenceMaximization(input_graph=g, agents=agents, alg='tim_p', 
                                        diff_model='ic', inf_prob='random', r=1000,
                                        insert_opinion=False, endorsement_policy='random', verbose=True)
seed, spread, execution_time = im_instance.run()
print(f"Seed sets found: {seed}")
print(f"Spreads: {spread}")
print(f"Total execution time: {execution_time}")

Credits

The creators of NetMax are Lorenzo Bloise and Carmelo Gugliotta.

This project has been developed as a coursework assigned for the class "Social Networks and Media Analysis", Spring 2024 (Instructor: prof. Andrea Tagarelli) at the DIMES Department, University of Calabria, Italy.

Name		Name	Last commit message	Last commit date
Latest commit History 83 Commits
.idea		.idea
__pycache__		__pycache__
build/lib/netmax		build/lib/netmax
data		data
demo		demo
netmax		netmax
tests		tests
.DS_Store		.DS_Store
.gitignore		.gitignore
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NetMax - Influence Maximization in Social Networks

Installation

Requirements

Overview

Algorithms

Simulation-based

Proxy-based

Sketch-based

Diffusion models

Influence probabilities

Endorsement policies

Useful papers

Example

Credits

About

Releases 2

Packages

Contributors 2

Languages

License

lorenzobloise/netmax

Folders and files

Latest commit

History

Repository files navigation

NetMax - Influence Maximization in Social Networks

Installation

Requirements

Overview

Algorithms

Simulation-based

Proxy-based

Sketch-based

Diffusion models

Influence probabilities

Endorsement policies

Useful papers

Example

Credits

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Packages 0

Contributors 2

Languages

Packages