Skip to content

This repository contains the link for all Position Specific Scoring Matrices(PSSMs) of all protein in SwissProt(UniProtKB)

License

Notifications You must be signed in to change notification settings

gozsari/PSSMs-SwissProt

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

17 Commits
 
 
 
 
 
 
 
 

Repository files navigation

PSSMs-SwissProt

This repository contains the link for all Position-Specific Scoring Matrices(PSSMs) of all protein in SwissProt(UniProtKB). You can download PSSM files of all proteins in SwissProt from here. These PSSMs are created by using PSI BLAST and UniRef50 database from UniProtKB.

What is a PSSM?

A PSSM, or Position-Specific Scoring Matrix, is a type of scoring matrix used in protein BLAST searches in which amino acid substitution scores are given separately for each position in a protein multiple sequence alignment. Thus, a Tyr-Trp substitution at position A of an alignment may receive a very different score than the same substitution at position B. This is in contrast to position-independent matrices such as the PAM and BLOSUM matrices, in which the Tyr-Trp substitution receives the same score no matter at what position it occurs.

PSSM scores are generally shown as positive or negative integers. Positive scores indicate that the given amino acid substitution occurs more frequently in the alignment than expected by chance, while negative scores indicate that the substitution occurs less frequently than expected. Large positive scores often indicate critical functional residues, which may be active site residues or residues required for other intermolecular interactions.

PSSMs can be created using PSI-BLAST, which finds similar protein sequences to a query sequence, and then constructs a PSSM from the resulting alignment. Alternatively, PSSMs can be retrieved from the NCBI CDD database, since each CD is represented by a PSSM that encodes the observed substitutions in the seed alignments. These CD records can be found either by text searching in Entrez Conserved Domains or by using RPS-BLAST (Reverse Position-Specific BLAST), also known as CD-Search, to locate these domains on an input protein sequence.

This information is taken from here.

The following is a screenshot of a sample PSSM.

alt text

What is PSI BLAST?

PSI-BLAST (Position-Specific Iterative Basic Local Alignment Search Tool) derives a position-specific scoring matrix (PSSM) or profile from the multiple sequence alignment of sequences detected above a given score threshold using protein–protein BLAST. This PSSM is used to further search the database for new matches, and is updated for subsequent iterations with these newly detected sequences. Thus, PSI-BLAST provides a means of detecting distant relationships between proteins. In this chapter, we discuss practical aspects of using PSI-BLAST and provide a tutorial on how to uncover distant relationships between proteins and use them to reach biologically meaningful conclusions.

This information is taken from here.

alt text

About

This repository contains the link for all Position Specific Scoring Matrices(PSSMs) of all protein in SwissProt(UniProtKB)

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published