lbs-tools

sequence

MMseq2 clustering

from lbs.sequences import MMSeqsClusterer

mmseq = MMSeqsClusterer(mmseqs_loc='path_to_mmseq2_bin', tmp_dir='/tmp/')

list of sequences to fasta file

from lbs.sequences import seq_list_to_fasta

seq_list_to_fasta(sequence_list: List[str], path: str = 'file.fasta')

run psiblast

from lbs.sequences import run_psiblast

run_psiblast(path_psiblast: str, #path to psiblast directory
             fasta_file: str, #input .fasta file
             max_target_seqs: int =2, dbtype: str ='prot')

md

from lbs.md import Params, OpenMM

params = Params() # MD params object
path_pdb = 'example.pdb'

mm = OpenMM(params)
mm.prepare_components() # initialize force field and langevin integrator
pdb = mm.prepare_pdb(path_pdb) # fix pdb issues & add solvent

df = mm.run(pdb, 'result.pdb') # run simulation store structure in 'result.pdb'
                               # df - energy etc. over time

coloring sequence

from lbs.utils.color_sequence import DivColorScaling
from IPython.display import HTML, display
clr = DivColorScaling()
# change color map
clr.cmap = plt.cm.Greens

sequence = "LBS is the best!"
# use float 0-1 range or int 0-255
seq_importance = [0, 125, 130, 140, 150, 200, 0, 50, 250, 100, 100, 100, 60, 60, 60, 60, 60]
# iterate over sequence and importance
html_string = ''
for letter, importance in zip(sequence, seq_importance):
    html_string += clr.html_colored_letter(letter, importance)
# it may not be viewed correctly :)
HTML(html_string)

LBS is the best!

scripts

calculate protein sequence embeddings

Script to create embeddings from sequences via prot_t5_xl_half_uniref50 stored in dataframe by default seq column in used as embedder input. Records are stored as list maintaining dataframe order. Example use

python scripts/embeddings.py -i data.csv -o data.emb

In python load via:

import torch
torch.load(..)

or

import pickle
with open(.., 'rb') as f:
    embs = pickle.load(f)

Name		Name	Last commit message	Last commit date
Latest commit History 138 Commits
.ipynb_checkpoints		.ipynb_checkpoints
lbs		lbs
scripts		scripts
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lbs-tools

sequence

MMseq2 clustering

list of sequences to fasta file

run psiblast

md

coloring sequence

scripts

calculate protein sequence embeddings

About

Releases

Packages

Contributors 8

Languages

labstructbioinf/lbs-tools

Folders and files

Latest commit

History

Repository files navigation

lbs-tools

sequence

MMseq2 clustering

list of sequences to fasta file

run psiblast

md

coloring sequence

scripts

calculate protein sequence embeddings

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 8

Languages

Packages