<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP18839

Description TATA-box binding protein
SequenceMDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTSSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQVAQQSTSQQAAQVASGQAPPLFHSQTLTTAPLPGTTPLYPSPLTPMTPITPATPASESSGIVPQLQNIVSTVNLGCKLDLKTIALRARNAEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAKSEEQSRLAARKYARVVQKLGFPAKFLDFKIQNMVGSCDVKFPIRLEGLVLTHQQFSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAKVRAEIYEAFENIYPILKGFRKTT
Length335
PositionTail
OrganismPropithecus coquereli (Coquerel's sifaka) (Propithecus verreauxi coquereli)
KingdomMetazoa
LineageEukaryota> Metazoa> Chordata> Craniata> Vertebrata> Euteleostomi> Mammalia> Eutheria> Euarchontoglires> Primates> Strepsirrhini> Lemuriformes> Indriidae> Propithecus.
Aromaticity0.07
Grand average of hydropathy-0.476
Instability index65.29
Isoelectric point9.80
Molecular weight37348.35
Publications

Function

Annotated function
GO - Cellular Component
cytoplasm	GO:0005737	IEA:Ensembl
euchromatin	GO:0000791	IEA:Ensembl
female pronucleus	GO:0001939	IEA:Ensembl
male pronucleus	GO:0001940	IEA:Ensembl
transcription factor TFIIA complex	GO:0005672	IEA:Ensembl
transcription factor TFIID complex	GO:0005669	IEA:Ensembl
transcription preinitiation complex	GO:0097550	IEA:Ensembl
GO - Biological Function
aryl hydrocarbon receptor binding	GO:0017162	IEA:Ensembl
enzyme binding	GO:0019899	IEA:Ensembl
RNA polymerase II cis-regulatory region sequence-specific DNA binding	GO:0000978	IEA:Ensembl
RNA polymerase II core promoter sequence-specific DNA binding	GO:0000979	IEA:Ensembl
RNA polymerase II general transcription initiation factor activity	GO:0016251	IEA:Ensembl
RNA polymerase II repressing transcription factor binding	GO:0001103	IEA:Ensembl
RNA polymerase III general transcription initiation factor activity	GO:0000995	IEA:Ensembl
TFIIB-class transcription factor binding	GO:0001093	IEA:Ensembl
GO - Biological Process
RNA polymerase II preinitiation complex assembly	GO:0051123	IEA:Ensembl

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP18839
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     100.12|      16|      16|      58|      73|       1
---------------------------------------------------------------------------
   58-   73 (35.81/10.74)	QQQQQQQQQQQQQQQQ
   75-   89 (33.45/ 9.64)	QQQQQQQQQQQQQQ.Q
   90-  105 (30.87/ 8.43)	QQQQQQQVAQQSTSQQ
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     213.35|      62|      88|     160|     221|       2
---------------------------------------------------------------------------
  160-  221 (107.88/68.28)	QLQNIVSTVNLGCKLDLKTIALRARN.AEYNPKRFAAVIMRIREPRTTALIFSSGKMVCTGAK
  250-  312 (105.47/66.62)	KIQNMVGSCDVKFPIRLEGLVLTHQQfSSYEPELFPGLIYRMIKPRIVLLIFVSGKVVLTGAK
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      51.33|      15|      16|     123|     137|       3
---------------------------------------------------------------------------
  123-  137 (28.75/16.30)	LT.TAPLPGTTPLYPS
  139-  154 (22.58/11.27)	LTpMTPITPATPASES
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      65.08|      19|      21|       9|      28|       4
---------------------------------------------------------------------------
    9-   28 (33.55/18.13)	PYAQGLaSPQG.AMTPGIPIF
   33-   52 (31.52/13.22)	PYGTGL.TPQPiQNTSSLSIL
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP18839 with Med15 domain of Kingdom Metazoa

Intrinsically Disordered Regions

IDR SequenceStartStop
1) MDQNNSLPPYAQGLASPQGAMTPGIPIFSPMMPYGTGLTPQPIQNTSSLSILEEQQRQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQQVAQQSTSQQAAQVASGQAPPLFHSQTLTTAPLPGTTPLYPSPLTPMTPITPATPASESSGIV
1
158

Molecular Recognition Features

MoRF SequenceStartStop
NANANA