<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP09955

Description Pre-mRNA-processing protein 40A
SequenceMASNMQASGLPQPPRPPMMGSSAQPQNLGPPMPMQFRPVIPSQPPPQFVPPAAQQFRSVGEPMPGANVGMPGQMPHFPQPGQHMPHSNQVPPVSQGVPMVYQPARPMSSAPMQPQQQAAYAGGHLLTMGAPMQPLTYTYQPTSIPPVVQPWSTGPGQSVTHVPPLVQSGHQQVSAPTTLPPVNLSEPSSSDWQEHTAAEGKKYYYNKKTRQSSWEKPVELMTPLERADASTEWKEFTTPEGRKYYFNKVTKQSKWTIPDELKVARELAEKASNQQPDQESGIATSALVRSAAFEPSTAPANQSSSAVGIIASSTHDGSSNSVLSGAPLPHNVENTSSSIVGMQNGGSSTAVVPVAASTEVPLVATDAGSSRNNDENSSLTTGADAEDGTSAEDLEEAKKTMPVAGKINVTPVEEKTSEEEPVVYATKMEAKNAFKSLLESVNVESDWTWDQTMRVIINDKRYGALKTLGERKQAFNEYLNQRKKFEAEEKRIKQRKARDDFLAMLEERKELTSSTRWSKAILMFEDDERFKAVERPREREDLFENYLVELHKKEKAKAAEEHKRYVAEYRAFLESCDFIKASTQWRKVQERLEDDERYSRLEKFDRLDIFQEYIRHLEKEEEEQKRVQKDQVRRQERKNRDGFRKMLEEHVADGTLNARTRWRDYCAQIKDSQSYLAVASNTSGSTPKELFDDVIEELDKQYQEDKTQIKEVVKSGKIPMTTSWTLEEFQTAILEDDALKGISTINIKLIYDDQLERLKEKEQKEAKKRQRLGENFSDLLYSIKEISASSTWDDSKQLFEDSQEFSFE
Length808
PositionUnknown
OrganismZea mays (Maize)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Liliopsida> Poales> Poaceae> PACMAD clade> Panicoideae> Andropogonodae> Andropogoneae> Tripsacinae> Zea.
Aromaticity0.07
Grand average of hydropathy-0.843
Instability index53.01
Isoelectric point5.49
Molecular weight91124.85
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP09955
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     122.01|      34|      39|     181|     219|       1
---------------------------------------------------------------------------
  181-  219 (61.16/38.97)	PvnlseP.....SSSDWQEHTAAEGKKYYYNKKTRQSSWEKPVE
  222-  260 (60.85/30.37)	T.....PleradASTEWKEFTTPEGRKYYFNKVTKQSKWTIPDE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     400.67|      66|      66|     470|     535|       2
---------------------------------------------------------------------------
  414-  467 (55.19/34.19)	.............EKTSEEEPVV.......YATKmeAKNAFKSLL.ES...VNVESDWTWDQTMRVIINDKRYGAL...KT
  470-  535 (102.82/70.55)	ERKQAFNEYLNQRKKFEAEEKRI.......KQRK..ARDDFLAMLEER...KELTSSTRWSKAILMFEDDERFKAV...ER
  538-  603 (90.98/61.51)	EREDLFENYLVELHKKEKAKAAE.......EHKR..YVAEYRAFLESC...DFIKASTQWRKVQERLEDDERYSRL...EK
  605-  683 (84.29/56.40)	DRLDIFQEYIRHLEKEEEEQKRVqkdqvrrQERK..NRDGFRKMLEEHvadGTLNARTRWRDYCAQIKDSQSYLAVasnTS
  748-  805 (67.39/43.50)	..KLIYDDQLERLKEKEQKEAKK.......RQRL..G.ENFSDLLYSI...KEISASSTWDDSKQLFEDSQEF........
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     143.73|      17|      18|     133|     149|       3
---------------------------------------------------------------------------
   43-   54 (22.92/ 8.24)	QP.......PP.QFVPPAAQ
   61-   79 (23.74/ 8.80)	EPMpgANVGMP.GQMPHFPQ
   82-   95 (24.98/ 9.63)	QHM..PHSNQ....VPPVSQ
  102-  113 (21.12/ 7.03)	QP......ARP.MSSAP.MQ
  133-  149 (30.71/13.51)	QPL..TYTYQP.TSIPPVVQ
  152-  167 (20.26/ 6.45)	S....TGPGQSvTHVPPLVQ
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      48.28|      14|      18|       1|      17|       4
---------------------------------------------------------------------------
    1-   16 (23.37/19.17)	MASNMQASGLpqPPRP
   19-   33 (24.91/ 7.91)	MGSSAQPQNL.gPPMP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      96.90|      24|      32|     301|     329|       5
---------------------------------------------------------------------------
  301-  326 (37.35/32.61)	NQSSSAVGIIAssTHDGSS......NSVLSGA
  334-  355 (36.57/16.94)	NTSSSIVGM....QNGGSS......TAVVPVA
  356-  383 (22.97/ 6.51)	..ASTEVPLVA..TDAGSSrnndenSSLTTGA
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP09955 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) IPDELKVARELAEKASNQQPDQESGIATSALVRSAAFEPSTAPANQSSSAVGIIASSTHDGSSNSVLSGAPLPHNVENTSSSIVGMQNGGSSTAVVPVAASTEVPLVATDAGSSRNNDENSSLTTGADAEDGTSAEDLEEAKKTMPVAGKINVTPVEEKTSEEEPVVYA
2) MASNMQASGLPQPPRPPMMGSSAQPQNLGPPMPMQFRPVIPSQPPPQFVPPAAQQFRSVGEPMPGANVGMPGQMPHFPQPGQHMPHSNQVPPVSQGVPMVYQPARPMSSAPMQPQQQAAYAGGHLLT
3) SIPPVVQPWSTGPGQSVTHVPPLVQSGHQQVSAPTTLPPVNLSEPSSSDWQEHTAAEGKKYYYN
257
1
143
425
127
206

Molecular Recognition Features

MoRF SequenceStartStop
1) EGKKYYYNKK
2) FSDLLYSIKEISAS
3) GRKYYFNK
4) QLFEDSQ
199
776
241
797
208
789
248
803