<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP23954

Description Uncharacterized protein
SequenceAQLQPEARGRIVDKIMDTLKNHLPVSVSLAPEALSELKKIAARFEEKVYTEATSQYDYLRKISVKLLSMETQRQQAAGNAQLIPNQNNPAPGLHPQGSNQAQTSAVPLMSQQQARQPNASTSVQASSLTNIRQNLPGVNQTSTMQTASVIPQNTMNNDLAQGTSQDGYAAQRQMAGRQQQQQSQQLIYHQHQQPSLQSQQPNIPLQQQQQQLMGQQPNLQQNQLIGQQNGAVMMQQQQRLAVQSNNLLNVQQTHQMLNQQYIPLYQPQQLGSQANMSSLQQHQQNQQQQQLFGTVPNVSSMQWMHMQQRKAQQPQQQHAQQPPMGLMQPQFQHNQLQQLQHLMPQFQSQPNQLQEQLRMQQQSSMQQRLQTSGAMLLQQNNMDQQNQFIQAPRGLQEVSSSTSADSTAQTGHASAGDWQEEIHQMIKRLKDQYFAELSELFNKMCVKLQHVDSIIPPQISSEQYDKMKSFKIMLERILQMLQIGKSSVQPSMMDNVPRYEKQIISILNSQRKPVQPQIHQQFQPPAG
Length527
PositionTail
OrganismAegilops tauschii subsp. strangulata (Goatgrass)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Liliopsida> Poales> Poaceae> BOP clade> Pooideae> Triticodae> Triticeae> Triticinae> Aegilops.
Aromaticity0.04
Grand average of hydropathy-0.870
Instability index67.64
Isoelectric point9.46
Molecular weight59921.83
Publications
PubMed=25035499
PubMed=29158546

Function

Annotated function
GO - Cellular Component
nucleus	GO:0005634	IEA:UniProtKB-SubCell
GO - Biological Function
chromatin DNA binding	GO:0031490	IEA:InterPro
transcription coactivator activity	GO:0003713	IEA:InterPro
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP23954
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     641.29|     124|     126|     161|     286|       1
---------------------------------------------------------------------------
   51-  157 (132.20/28.85)	E..ATSQYDYL.........R.KISvkllsmetQRQQAAGNAQL..IPNQnnpapglHPQGSNQAQTSAVPLMS.QQ.Q...AR...QPNAST....SVQASSLT......NIRQ........NLPGV....NQTSTMQTASVIP.........Q...NTM.....NN
  161-  286 (219.78/56.76)	Q..GTSQDGYAAQ.......R.QMA........GRQQQQQSQQL..IYHQ.......HQQPSLQSQQPNIPLQQ.QQ.QQLMGQ...QPNLQQNQLIGQQNGAVM......MQQQQRLAVQSNNLLNV....QQTHQMLNQQYIPLYQPQQLgSQANmSSLQQHQQNQ
  290-  402 (157.32/35.80)	QlfGTVPNVSSMQ.......WmHMQ........QRKAQQPQQQ.....HA........QQPPMGLMQPQFQHNQlQQlQHLMPQfqsQPNQLQEQLRMQQQSSM..........QQRL..............QTSGAMLLQQN.NMDQQNQF.IQAP.RGLQEVSSST
  405-  523 (131.99/28.79)	D..STAQTGHASAgdwqeeiH.QMI........KRLKDQYFAELseLFNK........MCVKLQHVDSIIP.................PQISSEQYDKMKSFKIMlerilqMLQIGKSSVQPSMMDNVpryeKQIISILNSQRKPV.QPQ............IHQQFQ
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP23954 with Med15 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) NNMDQQNQFIQAPRGLQEVSSSTSADSTAQTGHASAGDWQEE
2) QQLQHLMPQFQSQPNQLQEQLRMQQQSSMQQRLQTSGAMLL
3) RQQAAGNAQLIPNQNNPAPGLHPQGSNQAQTSAVPLMSQQQARQPNASTSVQASSLTNIRQNLPGVNQTSTMQTASVIPQNTMNNDLAQGTSQDGYAAQRQMAGRQQQQQSQQLIYHQHQQPSLQSQQPNIPLQQQQQQLMGQQPNLQQNQLIGQQ
4) SSMQWMHMQQRKAQQPQQQHAQQPPMGLMQPQFQHNQ
380
337
73
299
421
377
228
335

Molecular Recognition Features

MoRF SequenceStartStop
1) YEKQIISILN
499
508