<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP09966

Description Pre-mRNA-processing protein 40A
SequenceMASNMQSSGLPQPPRPPMMGSSAQPQNLGPPMPMQFRPVMPSQHPPQFMPPAAQQFRPVGEPMAGANVGMPGQMPHFPQPGQHLPHSNQVPPVSQGVPMVYQPARPMSSAPMQQQQQTAYAGGHLPTMGAPMQPLTYTYQPTSIPPVAQPWSTGPGQSVHHVPPLVPSGHQPVSAPTTLPPVNLSEPSSSDWQEHTAAEGKKYYYNKKTRQSSWEKPVELMTPLERADASTEWKEFTTPEGRKYYFNKVTKQSKWTIPDELKVARELAENASNQQPDRESGIATSALVRSAAFEPSTAPANQSSSAVGIIASSAHDGSSNSVLSGPPLPHNVENTSSSIVGMQNGGSSTAVVPVAASTEVPLVATDAGSSRNNDENSSLTTGADAEDGTSAEDLVEAKKTMPVAGKINVTPVEEKTSEEEPVVYATKMEAKNAFKSLLESVNVESDWTWDQTMRVIINDKRYGALKTLGERKQAFNEYLNQRKKFEAEEKRIKQRKARDDFLAMLEESKELTSSTRWSKAILMFEDDERFKAVERPREREDLFENYLVELHKKEKAKAAEEHKRYVAEYRAFLESCDFIKASTQWRKVQERLEDDERYSRLEKFDRLDIFQEYIRHLEKEEEEQKRVQKDQVRRQERKNRDGFRKMLEEHVADGTLNARTRWRDYCAQIKDSQSYLAVASNTSGSTPKELFDDVIEELGKQYQEDKIQIKEVVKSGKIPMTTSWTLEEFQTATLEDDALKGISTINIKLVYDDQLERLKEKEQKDAKKRQRLGENFSDLLYSIKEISASSTWDDSKQLFEDSQEFRALDSETYARELFEECVVHLKERLKEKERLREEEKV
Length841
PositionUnknown
OrganismZea mays (Maize)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Liliopsida> Poales> Poaceae> PACMAD clade> Panicoideae> Andropogonodae> Andropogoneae> Tripsacinae> Zea.
Aromaticity0.07
Grand average of hydropathy-0.861
Instability index53.74
Isoelectric point5.61
Molecular weight95093.33
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP09966
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     421.54|      66|      66|     470|     535|       1
---------------------------------------------------------------------------
  429-  493 (78.81/48.32)	..EAKNAFKSLL.E...SVNVESDWTWDQTMRVIINDKRYGAL...KTlgER....KQAFNEYLNQRKKFEAEEKRI.......K
  494-  561 (95.55/60.13)	QRKARDDFLAMLEE...SKELTSSTRWSKAILMFEDDERFKAV...ERprER....EDLFENYLVELHKKEKAKAAE.......E
  562-  635 (85.05/52.72)	HKRYVAEYRAFLES...CDFIKASTQWRKVQERLEDDERYSRL...EK.fDR....LDIFQEYIRHLEKEEEEQKRVqkdqvrrQ
  636-  706 (73.53/44.60)	ERKNRDGFRKMLEEhvaDGTLNARTRWRDYCAQIKDSQSYLAVasnTS....gstpKELFDDVIEELGKQYQEDK..........
  770-  836 (88.60/55.22)	QRLG.ENFSDLLYS...IKEISASSTWDDSKQLFEDSQEFRAL...DSetYA....RELFEECVVHLKERLKEKERL.......R
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     295.19|      39|      39|     181|     219|       2
---------------------------------------------------------------------------
   25-   46 (23.47/ 7.68)	.......................PQNL........G.PPMPM...QFRPVMPSQHPP
   47-   74 (37.45/17.63)	QFMPPAAQQ...FRPV....................gEPMAG......ANVGMPGQM
   75-  124 (49.69/26.33)	PHFPQPGQHLPHSNQVP......PVSQgvpmvyqpA.RPMSSapmQQQQQTAYAGGH
  137-  158 (32.31/13.97)	YTY.....QPTSIPPV....................aQP.......W...STGPGQS
  159-  202 (54.07/29.44)	VHHVPPLVPSG.HQPVSapttlpPVNL........S.EPSSS...DWQEHTAAEGKK
  203-  243 (56.17/30.94)	YYYNKKTRQSSWEKPVE...lmtPLER........A.D.AST...EWKEFTTPEGRK
  244-  281 (42.04/20.89)	YYFNKVTKQSKWTIPDE...lkvAREL........A.ENASN.....QQPDRESG..
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      67.17|      24|      26|     310|     333|       3
---------------------------------------------------------------------------
  324-  352 (33.15/20.57)	SGPPLPHNVEntsssIVGMQNGGSSTAVV
  356-  379 (34.02/21.31)	ASTEVPLVAT.....DAGSSRNNDENSSL
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      42.14|      14|      26|     386|     399|       4
---------------------------------------------------------------------------
  386-  399 (23.22/16.41)	EDGTSAED.LVEAKK
  413-  427 (18.92/11.87)	EEKTSEEEpVVYATK
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP09966 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) IPDELKVARELAENASNQQPDRESGIATSALVRSAAFEPSTAPANQSSSAVGIIASSAHDGSSNSVLSGPPLPHNVENTSSSIVGMQNGGSSTAVVPVAASTEVPLVATDAGSSRNNDENSSLTTGADAEDGTSAEDLVEAKKTMPVAGKINVTPVEEKTSEEEPVVYA
2) MASNMQSSGLPQPPRPPMMGSSAQPQNLGPPMPMQFRPVMPSQHPPQFMPPAAQQFRPVGEPMAGANVGMPGQMPHFPQPGQHLPHSNQVPPVSQGVPMVYQPARPMSSAPMQQQQQTAYAGGHLPTMGAPMQPLTYTYQPTSIPPVAQPWSTGPGQSVHHVPPLVPSGHQPVSAPTTLPPVNLSEPSSSDWQEHTAAEGKKYYYN
257
1
425
206

Molecular Recognition Features

MoRF SequenceStartStop
1) GKKYYY
2) GRKYYFN
3) LLYSIKEI
200
241
779
205
247
786