<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP31959

Description Pre-mRNA-processing protein 40C
SequenceMEGENTTDPPYTTAASSGQSIFVRPPPIAPVLATTSNFSQSELKELHSMSIASTGFVSQSVPYSVTAQWGTNAAASSNVNPIPQASPMLANAPFGRPGTLAPPGLMTSPPAFPGSNPFSTTPRPGMSAGPAQMNPGIHPHMYPPYHSLPGTPQGMWLQPPSMGGIPRAPFLSHPTTFPGSYPFPVRGISPNLPYSGSHPLGASPMGSVGNVHALPGRQPDISPGRKTEELSGIDDRAGSQLVGNRLDAWTAHKSEAGVLYYYNSVTGQSTYEKPPGFGGEPDKVPVQPIPVSMESLPGTDWALVSTNDGKKYYYNNKTKVSSWQIPAEVKDFGKKLEERAMESVASVPSADLTEKGSDLTSLSAPAISNGGRDAASLKTTNFGSSALDLVKKKLHDSGMPVSSTITSEANSGKTTEVTPSGESGNSTGKVKDAPGAGALSDSSSDSEDEDSGPSKEECSKQFKEMLKERGIAPFSKWEKELPKIIFDPRFKAIPSHSVRRSLFEQYVKTRAEEERREKRAAHKAAIEGFRQLLDDASTDIDQHTDYRAFKKKWGNDLRFEAIERKEREGLLNERVLSLKRSAEQKAQEIRAAAASDFKTMLREREISINSHWSKVKDSLRNEPRYRSVAHEDREVFYYEYIAELKAAQRGDDHEMKARDEEDKLRERERELRKRKEREVQEVERVRQKIRRKEASSSYQALLVEKIRDPEASWTESKPILERDPQKRASNPDLEPADKEKLFRDHVKSLYERCVHDFKALLAEALSSEAATLQTEDGKTALNSWSTAKQVLKPDIRYSKMPRQDREVVWRRYVEDISRKQRHENYQEEKQRDYKT
Length835
PositionUnknown
OrganismArabidopsis thaliana (Mouse-ear cress)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> malvids> Brassicales> Brassicaceae> Camelineae> Arabidopsis.
Aromaticity0.07
Grand average of hydropathy-0.837
Instability index47.15
Isoelectric point8.32
Molecular weight92805.61
Publications
PubMed=10819329
PubMed=27862469
PubMed=11910074
PubMed=19467629
PubMed=22021418

Function

Annotated function Binds the phosphorylated C-terminal domain (CTD) of the largest subunit of RNA polymerase II and functions as a scaffold for RNA processing machineries (Probable). May be involved in pre-mRNA splicing (Probable).
ECO:0000305	PubMed:19467629
GO - Cellular Component
nucleus	GO:0005634	IBA:GO_Central
GO - Biological Function
RNA polymerase binding	GO:0070063	IBA:GO_Central
transcription coregulator activity	GO:0003712	IBA:GO_Central
GO - Biological Process
mRNA processing	GO:0006397	IEA:UniProtKB-KW
RNA splicing	GO:0008380	IEA:UniProtKB-KW

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP31959
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     108.74|      22|      49|      93|     114|       1
---------------------------------------------------------------------------
   93-  112 (32.07/12.64)	...........PF.GRPGTL.APP..GLMTSPPAF
  135-  162 (30.12/11.44)	PG..ihphmypPYhSLPGT...PQ..GMWLQPPSM
  163-  194 (25.27/ 8.46)	GGiprapflshPT.TFPGSY.PFPvrGISPNLP.Y
  195-  222 (21.28/ 6.01)	SG..shplgasPM.GSVGNVhALP..G..RQPDIS
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     214.18|      35|     126|     674|     708|       2
---------------------------------------------------------------------------
  512-  539 (29.37/19.92)	.....................EE.ERR.......EKRAA.....H..KAAIEGFRQLLDDASTD
  564-  603 (32.41/22.98)	RK...EREGL...........LN.ERVlslkrsaEQKAQ...eiR..AAAASDFKTML....RE
  604-  643 (37.18/27.78)	RE...ISINS...........HW.SKV.......KDSLRnepryR..SVAHEDREVFYYEYIAE
  674-  708 (54.12/44.84)	RK...EREVQ...........EV.ERV.......RQKIR.....R..KEASSSYQALLVEKIRD
  716-  767 (23.62/14.14)	SKpilERDPQkrasnpdlepaDK.EKL......fRDHVK.....SlyERCVHDFKALLAEALSS
  802-  832 (37.48/28.08)	RQ...DREV.............VwRRY.......VEDIS.....R..KQRHENYQE...EKQRD
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     231.93|      52|      53|     249|     300|       3
---------------------------------------------------------------------------
  228-  287 (62.72/37.55)	......EELSGIDdragsqlvgnrldaWTAHKSEA.GVLYYYNSVTG..QSTYEKPP...GFGGE.P.....DKVPVQ
  288-  342 (78.81/49.24)	PIPVSMESLPGTD..............WALVSTND.GKKYYYNNKTK..VSSWQIPAevkDFGKK.L.....EERAME
  343-  397 (48.55/27.24)	....SVASVPSAD..............LTEKGSDL.TSLSAPAISNGgrDAASLKTT...NFGSS.AldlvkKKLHDS
  398-  433 (41.84/22.37)	GMPVS..............................sTITSEANS..G..KTTEVTPS...GESGNsT.....GKVKDA
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      57.73|      18|      54|      10|      33|       4
---------------------------------------------------------------------------
   10-   29 (30.30/21.16)	PYTTAASSGQSifVRP........PPIA
  117-  142 (27.43/10.42)	PFSTTPRPGMS..AGPaqmnpgihPHMY
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP31959 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) KKLEERAMESVASVPSADLTEKGSDLTSLSAPAISNGGRDAASLKT
2) LPYSGSHPLGASPMGSVGNVHALPGRQPDISPGRKTEELSGIDDRAGS
3) NAAASSNVNPIPQASPMLANAPFGRPGTLAPPGLMTSPPAFPGSNPFSTTPRPGMSAGPAQMNPGIHPHMYPPY
4) SGMPVSSTITSEANSGKTTEVTPSGESGNSTGKVKDAPGAGALSDSSSDSEDEDSGPSKEECSKQFK
334
192
72
397
379
239
145
463

Molecular Recognition Features

MoRF SequenceStartStop
1) KPDIRY
2) LPKIIFDPRF
3) VFYYEYIA
792
481
635
797
490
642