Skip to content

FunctionLab/sleipnir

Folders and files

NameName
Last commit message
Last commit date
Jan 21, 2022
Nov 9, 2012
Sep 17, 2020
Aug 3, 2010
Apr 12, 2022
Jul 11, 2022
Jul 11, 2022
Jul 11, 2022
Apr 26, 2022
Sep 17, 2020
Nov 14, 2013
Mar 4, 2011
Jan 14, 2022
Apr 8, 2008
Mar 4, 2011
Apr 23, 2013
Apr 8, 2008
Mar 23, 2008
Mar 4, 2011
Apr 23, 2013
Jan 21, 2022
Jun 25, 2013
Nov 27, 2017
Apr 30, 2013
Feb 20, 2015
Oct 11, 2013
Aug 1, 2013
Sep 10, 2015
Jul 31, 2012
Jul 23, 2012
Apr 7, 2008
Mar 29, 2011
Feb 7, 2015
Apr 7, 2008
Apr 7, 2008
Apr 7, 2008

Repository files navigation

Overview:

Sleipnir is a C++ library enabling efficient analysis, integration, mining, and machine learning over genomic data. This includes a particular focus on microarrays, since they make up the bulk of available data for many organisms, but Sleipnir can also integrate a wide variety of other data types, from pairwise physical interactions to sequence similarity or shared transcription factor binding sites.

Docs:

Main documentation: https://functionlab.github.io/sleipnir-docs/

The Sleipnir wiki and bug reporting system are at: (TBD)

The file README.developer has notes for Sleipnir developers.

Sleipnir also includes the code to compile SEEK (the human coexpression search engine). See the link http://seek.princeton.edu/installation.jsp for information on its installation.

Code:

The latest version of Sleipnir software can be obtained by issuing the following command:

git clone https://github.com/FunctionLab/sleipnir.git

Build:

  1. Install g++, cmake

  2. Install libraries

    • On Mac:
      • brew install libsvm
      • brew install libomp
      • brew install thrift
      • brew install gsl
      • brew install boost
    • On CentOS Linux:
      • sudo yum install libsvm
      • sudo yum install libgomp
      • sudo yum install thrift-devel
      • sudo yum install gsl
      • sudo yum install boost
    • On Ubuntu Linux:
      • apt-get update
      • apt-get install build-essential
      • apt-get install libsvm-dev
      • apt-get install libomp-dev
      • apt-get install libthrift-dev
      • apt-get install libgsl-dev
      • apt-get install libboost-dev
      • apt-get install libboost-graph-dev
      • apt-get install libboost-regex-dev
      • apt-get install libreadline-dev
  3. Clone repository

  4. Prep make files with cmake

    • mkdir Debug
    • cd Debug/
    • cmake -DCMAKE_BUILD_TYPE=Debug ..
    • Alternately replace 'Debug' with 'Release' in all the above commands to make the release build
  5. Build the code

    • (On Mac) - Edit sleipnir/src/libsvm.h
      • Replace: #include <libsvm/svm.h>
      • With: #include <svm.h>
    • cd Debug/
    • make
      • In case of errors:
        • make clean
        • make VERBOSE=1
  6. [Optional] Install SVM_PERF libraries to build: Data2SVM, SVMperfer, SVMperfing, SVMfe, SVMer

Tests:

  1. One-time prep: create the conda environment (by default this will create the 'genomics' conda env)

    • conda env create --file scripts/seek/conda_environment.yml
  2. Run the c++ unit tests

    • Debug/tests/unit_tests
  3. Test the scripts for building and merging SEEK database compendiums

    • conda activate genomics
    • python -m pytest -s -v scripts/seek/tests
  4. Run the SEEK system tests (test SeekMiner and SeekRPC)

    • conda activate genomics
    • python -m pytest -s -v tests/
  5. Run Seek DB tests (test that the database gives expected bio-informative results). These tests can only be run where the full SEEK database is installed.

    • cd tests/bioinform_tests
    • PREP: Install and init Git LFS (Large File Storage)
      • On Mac: brew install git-lfs
      • On Centos: yum install git-lfs
      • On Ubuntu: apt-get install git-lfs
      • Initialize git-lfs: git lfs install
      • Refresh the gold standard tgz files (should be multipe MB in size)
        • rm gold_standard_results/*
        • git restore gold_standard_results/*
    • Run the tests: (The bioinform test has an option for different lengths of test, i.e. how many queries are run)
      • bash run_paramtest.sh -v -s <path_to_seek_db> -b <path_to_seek_binaries>
      • bash run_querysize.sh -v -s <path_to_seek_db> -b <path_to_seek_binaries>
      • bash run_bioinform.sh -v -s <path_to_seek_db> -b <path_to_seek_binaries> -t [tiny,short,medium,long]

About

No description, website, or topics provided.

Resources

License

Unknown, Unknown licenses found

Licenses found

Unknown
LICENSE
Unknown
COPYING

Stars

Watchers

Forks

Packages

No packages published