<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP09747

Description Pre-mRNA-processing protein 40C
SequenceMATLASAVSDVGVEEPSPAKAADPKEPAAAVEEPAAAAAELAGASIPSPVAAADAGDASSGPALATTPPASPATSAAPPPVSPAPLVSPAPPAEPGPPRSQFAGSLSYIAPGTPSPSAAFSYNVLPRAPPAPQVGGAAASLQPCSSPALMVAPIPASALQPPAPGQYFGNRPSFSYNVVSHANARLPTGQQFQPVTGANLAGPISRFVPPGSLQPPTPGHITRPSTAFPGSMAPNPPGSIQLPFSVPRPSNIPFGAIAQQGSSDINNLKSDSPRAPEVTPQAMQLSTGMPSKSPSTIASASGSPSIPIQTLTNSSVPPRPEVFGATRPSVPAQPSATVSNPTGFLGRPIVPPAAPLPQTPPPIATQGGTPQNSQRPFYPSYPSGPGIVPPQPLWPHPHPPQPTGFQQPPFQYYPAGPVGSLGRPITGASAATMAFANVQPPGVSTGGDRKVQASTNAGSEQSTHAAAEPDSTGHGGQVTEQLEDNRNTGVQDSDAWSAHKTETGVVYYYNALTGESTYQKPTGYKGELEKVATEPVPVSWDKLAGTNWSIVTTSDGKKYYYDNKQKVSSWQLPPEVCEILKNAESGSLKEGSTSLQDAATIENKGVISIDASTPAIQTGGRDSLPLRQTVAPASPSPLDLIKKKLQDAGASSAPSALATSSATSELNGSKPADAALKGQLVANNGEKLKDNNGDVNISDSSSDSDDEEHGPSKEDCIRQFKVMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSTRRAIFDHYVRTRAEEERKEKRAALKAAVEAYKELLEEASEDINQKTDYQEFKRKWGADTRFEALDRKEREILFSEKVKAVQEKVQSMRKAVIANFKSMLRESKDITSTSRWAKVKENFRSDPRYKAMKHEERETIFNEYIVELKSAEQEAEQAAKAKVDEQAKLKERERETRKRKEREEQEMERVKMKIRRKEAVSSYQALLVEMIKDPKASWTESKPKLEKDPQGRARNPDLGQGDAEKLFRDHVKDLYERCVRDFRALLSEVITPEVAARTTAEGKTAINSWSEAKGHLRSDLRYNKLPSKDKESIWRRYADDLTRKLRQSDTKEKDKSDTDGKQPRSSDPPRRR
Length1103
PositionUnknown
OrganismZea mays (Maize)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Liliopsida> Poales> Poaceae> PACMAD clade> Panicoideae> Andropogonodae> Andropogoneae> Tripsacinae> Zea.
Aromaticity0.06
Grand average of hydropathy-0.697
Instability index55.10
Isoelectric point8.76
Molecular weight118497.12
Publications
PubMed=19965430

Function

Annotated function
GO - Cellular Component
nucleus	GO:0005634	IBA:GO_Central
GO - Biological Function
RNA polymerase binding	GO:0070063	IBA:GO_Central
transcription coregulator activity	GO:0003712	IBA:GO_Central
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP09747
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      54.99|      15|      16|     210|     224|       2
---------------------------------------------------------------------------
  210-  224 (28.76/ 9.49)	PGSLQPPTPGHITRP
  229-  243 (26.22/ 7.93)	PGSMAPNPPGSIQLP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      43.31|      13|      16|     130|     143|       3
---------------------------------------------------------------------------
   83-   95 (23.97/ 8.50)	PAPLVSPAPPA.EP
  130-  143 (19.34/ 8.90)	PAPQVGGAAASlQP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     383.26|      59|      97|     787|     845|       4
---------------------------------------------------------------------------
  686-  758 (47.34/25.50)	..EKLKDNNGDVNisdssSDSDDEEhgpskedcirqFKvmLKERGVAPFSK....WEKELPKIVFDP...RFKAIPS....HSTR...R
  787-  845 (92.32/57.28)	YKELLEEASEDIN.....QKTDYQE...........FK..RKWGADTRFEA....LDRKEREILFSE...KVKAVQEK.V.QSMR...K
  851-  911 (72.67/43.39)	FKSMLRE.SKDIT.....STSRWAK...........VK..ENFRSDPRYKA....MKHEERETIFNEyivELKSAEQE.A.EQAA...K
  913-  948 (36.89/18.12)	...................KVDEQA...........KL..KERERETR........KRKEREEQEME...RVKM...K.I.R..R...K
  954- 1018 (64.42/37.57)	YQALLVEMIKD.......PKASWTE...........SK..PKLEKDPQGRArnpdLGQGDAEKLFRD...HVKDLYERcV.RDFRallS
 1026- 1082 (69.63/41.25)	AARTTAEGKTAIN........SWSE...........AK..GHLRSDLRYNK....LPSKDKESIWRR...YADDLTRK.LrQSDT...K
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     147.57|      29|      30|     347|     376|       5
---------------------------------------------------------------------------
  304-  327 (41.33/13.33)	PSIPIQTLT.NSSVPP.............RPEV.....F.GATR
  328-  360 (31.81/ 8.51)	PSVPAQ..........psatvsnptgflgRPIVPPAAPL.PQTP
  361-  398 (38.51/15.63)	PPIATQGGTpQNSQRP.....fypsypsgPGIVPP.QPLwPHPH
  614-  637 (35.92/10.59)	PAI..QTGG.RDSLPL.............RQTVAPASPS.P...
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     240.56|      51|      51|     469|     519|       6
---------------------------------------------------------------------------
  412-  446 (42.57/20.26)	............YYPAGPVGSLGR.............P.ITG.ASAATMAFANV...QPPGVSTG..
  447-  495 (63.51/34.09)	.GDRKVQAST....NAGSEQSTHA..........aaePDSTGHGGQVTEQLEDN...RNTGVQDSDA
  496-  544 (74.95/41.65)	WSAHKTETGVVYYYNALTGESTYQ............kP..TGYKGEL.EKVATE...PVPVSWDKLA
  548-  611 (59.53/31.46)	WSIVTTSDGKKYYYDNKQKVSSWQlppevceilknaeSGSLKEGST...SLQDAatiENKGVISIDA
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      58.85|      17|      51|     107|     128|       7
---------------------------------------------------------------------------
  110-  128 (30.47/22.23)	APGT...PSPSaaFSYNVLPRA
  163-  182 (28.38/ 7.78)	APGQyfgNRPS..FSYNVVSHA
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      63.50|      19|      24|      24|      42|       9
---------------------------------------------------------------------------
   24-   42 (30.99/14.66)	PKEPAAAVEEPAAAAAELA
   47-   65 (32.51/15.81)	PSPVAAADAGDASSGPALA
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP09747 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AEQAAKAKVDEQAKLKERERETRKRKEREEQEMERVKMKI
2) PISRFVPPGSLQPPTPGHITRPSTAFPGSMAPNPPGSIQLPFSV
3) PSNIPFGAIAQQGSSDINNLKSDSPRAPEVTPQAMQLSTGMPSKSPSTIASASGSPSIPIQTLTNSSVPPRPEVFGATRPSVPAQPSATVSNPTGFLGRPIVPPAAPLPQTPPPIATQGGTPQNSQRPFYPSYPSGPGIVPPQPLWPHPHPPQPTGFQQPPFQYYPAGPVGSLGRPITGASAATMAFANVQPPGVSTGGDRKVQASTNAGSEQSTHAAAEPDSTGHGGQVTEQLEDNRNTGVQDSDAWS
4) SSAPSALATSSATSELNGSKPADAALKGQLVANNGEKLKDNNGDVNISDSSSDSDDEEHGPSKEDCI
5) SWTESKPKLEKDPQGRARNPDLGQGDAEKL
6) TLASAVSDVGVEEPSPAKAADPKEPAAAVEEPAAAAAELAGASIPSPVAAADAGDASSGPALATTPPASPATSAAPPPVSPAPLVSPAPPAEPGPPRSQFAGSLSY
906
203
249
651
968
3
945
246
497
717
997
108

Molecular Recognition Features

MoRF SequenceStartStop
1) ADPKEPAAAVEEPAAAAAELAGASIPSPVA
2) GKKYYY
3) LDLIKKKLQ
4) SIWRRYA
22
556
638
1063
51
561
646
1069