<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP09750

Description Pre-mRNA-processing protein 40C
SequenceMAPNPPGSIQLPFSVPRPSNIPFGAIAQQGSSDINNLKSDSPRAPEVTPQAMQLSTGMPSKSPSTIASASGSPSIPIQTLTNSSVPPRPEVFGATRPSVPAQPSATVSNPTGFLGRPIVPPAAPLPQTPPPIATQGGTPQNSQRPFYPSYPSGPGIVPPQPLWPHPHPPQPTGFQQPPFQYYPAGPVGSLGRPITGASAATMAFANVQPPGVSTGGDRKVQASTNAGSEQSTHAAAEPDSTGHGGQVTEQLEDNRNTGVQDSDAWSAHKTETGVVYYYNALTGESTYQKPTGYKGELEKVATEPVPVSWDKLAGTNWSIVTTSDGKKYYYDNKQKVSSWQLPPEVCEILKNAESGSLKEGSTSLQDAATIENKGVISIDASTPAIQTGGRDSLPLRQTVAPASPSPLDLIKKKLQDAGASSAPSALATSSATSELNGSKPADAALKGQLVANNGEKLKDNNGDVNISDSSSDSDDEEHGPSKEDCIRQFKVMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSTRRAIFDHYVRTRAEEERKEKRAALKAAVEAYKELLEEASEDINQKTDYQEFKRKWGADTRFEALDRKEREILFSEKVKAVQEKVQSMRKAVIANFKSMLRESKDITSTSRWAKVKENFRSDPRYKAMKHEERETIFNEYIVELKSAEQEAEQAAKAKVDEQAKLKERERETRKRKEREEQEMERVKMKIRRKEAVSSYQALLVEMIKDPKASWTESKPKLEKDPQGRARNPDLGQGDAEKLFRDHVKDLYERCVRDFRALLSEVITPEVAARTTAEGKTAINSWSEAKGHLRSDLRYNKLPSKDKESIWRRYADDLTRKLRQSDTKEKDKSDTDGKQPRSSDPPRRR
Length872
PositionUnknown
OrganismZea mays (Maize)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Liliopsida> Poales> Poaceae> PACMAD clade> Panicoideae> Andropogonodae> Andropogoneae> Tripsacinae> Zea.
Aromaticity0.06
Grand average of hydropathy-0.876
Instability index49.17
Isoelectric point8.99
Molecular weight96115.30
Publications
PubMed=19965430

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP09750
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     358.41|      58|      60|     564|     621|       1
---------------------------------------------------------------------------
  514-  557 (31.15/14.90)	......................DPRFKA....IpSHSTRRAIFDHYVRTrAEEE.R.KEKRaalKAAVEAYK
  564-  621 (89.83/58.05)	SED...INQKTDYQEFKRKWGADTRFEA....L.DRKEREILFSEKVKA.VQEK.V.QSMR...KAVIANFK
  627-  678 (69.02/42.74)	SKD...ITSTSRWAKVKENFRSDPRYKA....M.KHEERETIFNEYIVE.LKSA.E.QEAE...QA......
  680-  724 (54.27/31.90)	.......KAKVDEQAKLKERERETR.........KRKEREEQEMERVKM....K.I.R..R...KEAVSSYQ
  731-  792 (60.17/36.24)	IKD.....PKASWTESKPKLEKDPQGRArnpdL.GQGDAEKLFRDHVKD.LYERcV.RDFRallSEVITP..
  800-  851 (53.98/31.68)	AEGktaIN...SWSEAKGHLRSDLRYNK....L.PSKDKESIWRRYADD.LTRK.LrQSDT...K.......
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     262.97|      48|      49|      97|     145|       2
---------------------------------------------------------------------------
    4-   39 (41.63/14.25)	.............NPPGSIQ...LPFSV...PR.P.SNIPfgAIAQQGSSD........INN..LKS
   42-   89 (40.00/13.61)	PRAPEvTPQA.MQLSTG...............M.PsKSPS..TIASASGSPsipiqtltNSSvpPRP
   97-  145 (89.84/42.48)	PSVPA.QPSAtVSNPTGFLGRPIVPPAA...PL.P.QTPP..PIATQGGTP........QNS..QRP
  149-  177 (46.92/17.19)	....S.YPS..........GPGIVPP.Q...PLwP.HPHP..P......QP........TGF..QQP
  178-  216 (44.59/15.98)	.PFQY.YPAG....PVGSLGRPITGASAatmAF.A.NVQP..PGVSTGG..................
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     256.72|      49|      49|     242|     290|       3
---------------------------------------------------------------------------
  225-  271 (51.52/25.04)	.......NAGSEQSTHaaAE..P......dstGHGGQVT......E...QLEDN...R.NTGVQDSDA...WSAHKTE
  272-  323 (66.12/34.20)	TGVVYYYNALTGESTY..QK..P........tGYKGEL.......E...KVATE...P.VPVSWDKLAgtnWSIVTTS
  324-  386 (53.88/26.52)	DGKKYYYDNKQKVSSW..QL..PpevceilknAESGSLK......EgstSLQDAatiE.NKGVISIDA....STPAIQ
  387-  434 (58.09/29.17)	TG...GRDSLPLRQTV..APasP.........SPLDLIK......K...KLQDA...G.ASSAPSALA...TSSATSE
  436-  473 (27.12/ 9.74)	NGSKPADAALKGQ..........................lvanngE...KLKDN...NgDVNISDSSS...DS.....
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP09750 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AEQAAKAKVDEQAKLKERERETRKRKEREEQEMERVKMKI
2) MAPNPPGSIQLPFSVPRPSNIPFGAIAQQGSSDINNLKSDSPRAPEVTPQAMQLSTGMPSKSPSTIASASGSPSIPIQTLTNSSVPPRPEVFGATRPSVPAQPSATVSNPTGFLGRPIVPPAAPLPQTPPPIATQGGTPQNSQRPFYPSYPSGPGIVPPQPLWPHPHPPQPTGFQQPPFQYYPAGPVGSLGRPITGASAATMAFANVQPPGVSTGGDRKVQASTNAGSEQSTHAAAEPDSTGHGGQVTEQLEDNRNTGVQDSDAWS
3) SSAPSALATSSATSELNGSKPADAALKGQLVANNGEKLKDNNGDVNISDSSSDSDDEEHGPSKEDCI
4) SWTESKPKLEKDPQGRARNPDLGQGDAEKL
675
1
420
737
714
266
486
766

Molecular Recognition Features

MoRF SequenceStartStop
1) GKKYYY
2) LDLIKKKLQ
3) SIWRRYA
325
407
832
330
415
838