WoS-disambiguation

Project to compare and develop disambiguation solutions for Web fo Science and beyond

Set up

Environment

Install all packages

conda env create -f environment.yml
conda install -y -c bioconda -c conda-forge snakemake # TODO: put this into environement.yml

Create a kernel for the virtual environment that you can use in Jupyter lab/notebook.

python -m ipykernel install --user --name WoS-disambiguation

Password and Username for ElasticSearch

Set up the config.yaml file, i.e.,

cp workflow/config.template.yaml workflow/config.yaml

Then, set your password and username for the ElasticSearch server:

data_dir: "./data/"
es_password: "password for ElasticSearch"
es_username: "usename for ElasticSearch"
es_endpoint: "localhost:9200/wos/_search/"
shared_dir: "/gpfs/sciencegenome/WoS-disambiguation"

Don't worry. The config.yaml is gitignored and won't be pushed to the remote.

Connect your computer to the ElasticSearch server

Establish the ssh tunneling:

username={your username in the server}
privatekey={your private key to ssh to the server}
server=iuni2.carbonate.uits.iu.edu
port=9200
ssh -i $privatekey -N -L $port:$username@localhost:$port $server

Linking to the shared directory

(Updated) Because the IO is the major bottleneck of the Leiden algorithm, I recommend copying the entire shared folder to your local:

cp -r /gpfs/science-genome/WoS-disambiguation/ data/

If the disk space is the concern, use the symbolic link, i.e., under the root of this repository,

ln -s /gpfs/science-genome/WoS-disambiguation/ data/

Packages

numpy
scipy
pandas
networkx
tqdm
snakemake
requests
sqlite3
joblib
hashlib

Name		Name	Last commit message	Last commit date
Latest commit History 117 Commits
data		data
libs/leiden_algorithm/leiden_algorithm		libs/leiden_algorithm/leiden_algorithm
notebooks		notebooks
workflow		workflow
.gitignore		.gitignore
README.md		README.md
Snakefile		Snakefile
environment.yml		environment.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

WoS-disambiguation

Set up

Environment

Password and Username for ElasticSearch

Connect your computer to the ElasticSearch server

Linking to the shared directory

Packages

About

Releases

Packages

Contributors 3

Languages

iuni-cadre/WoS-disambiguation

Folders and files

Latest commit

History

Repository files navigation

WoS-disambiguation

Set up

Environment

Password and Username for ElasticSearch

Connect your computer to the ElasticSearch server

Linking to the shared directory

Packages

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages