<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP10898

Description Pre-mrna-processing protein 40a
SequenceMANNSHFTGMQPPIPPLAGPPGPHQGAIPSMSLQFQPMLPPQQAQPYVSGPSQQFQPLGHTNVAMLPQPSQIQFPQPMQQVAGRPVVGGHSMPQGPPTPHDFQRNRPVTPVQAPLQPNLPMSNNHMPGAGGPSLPLSSSYNQVNADSSSSQYQMQIHDHSFPSGGQPWMSTSNHNVNSVTNMQKTGELAAHLAVPEAVYGVDSVETALSDWIEHTSRNGKKYYYNRRTRISSWEKPLELMTEMERADASTDWREFTSPQGKKYYYNKVTRKSKWKMPDEVKLARQSMKVDLVKGLGKERDSISHASDFGSISGVKTSSPGANGSPVSAQGAMSSPIAVAPVSNLPAIVASESSLSGNISSLTIGAVEMQNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSAQDSVVYEDGVSLENTEEVKKDAMISETGSVTPSEEKTVEPGPLVYESKAEAKSAFKILLESANIGSDWTWDQAMRAIINDRRYGALKSLGERKQAFNEYLSHKKKLEAEERRIKQKKAREDFRIMLEECKELAPSTRWSKAISIFEHDERFKAVERAKDREDLFEDYMEELEKKERARALEEQKRNRVEYLEFLKSCDFIKASSQWRKVQDRLEDDERCSRLEKIDRLEIFQEYIRDLEREEEEHRKLRMDEMRKAERKNRDEFRKLMEEHVAAGILNAKTNWRDYCIKVKDLPAYLAVSSNTSGPTAKDLFQDVFDELEKQYLDDKSRIRDAVRMAEIGLTSTWMLEDFKVAIAKDISSPPISDTNLKFVFEELLERAREREEKEAKRRKRLADEFYELLHTSKEITASSKWEDCKSIFGDRIMGEESFLLEIFDKFISELKEKAKEKERKRQEDKSRKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSDKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGEDGEVR
Length1008
PositionUnknown
OrganismNicotiana attenuata (Coyote tobacco)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> lamiids> Solanales> Solanaceae> Nicotianoideae> Nicotianeae> Nicotiana.
Aromaticity0.07
Grand average of hydropathy-1.024
Instability index49.19
Isoelectric point6.51
Molecular weight115172.33
Publications

Function

Annotated function
GO - Cellular Component
nucleus	GO:0005634	IEA:EnsemblPlants
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP10898
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      66.09|      19|      19|     857|     875|       1
---------------------------------------------------------------------------
  857-  871 (18.92/ 6.13)	.......ERKRQEDKSRKEKER
  872-  892 (24.12/ 9.85)	KDREkkkEKHRR.DKDRGDKSR
  921-  941 (23.05/ 9.08)	SDRD.kkHRKRHMSSFDDNENE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     133.08|      34|      38|     205|     238|       2
---------------------------------------------------------------------------
  205-  238 (68.20/38.96)	ET..ALSDWIEHTSRNGKKYYYNRRTRISSWEKPLE
  244-  279 (64.88/36.73)	ERadASTDWREFTSPQGKKYYYNKVTRKSKWKMPDE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     188.81|      37|      38|      24|      60|       3
---------------------------------------------------------------------------
   25-   60 (64.63/36.45)	QGAIPSMS.LQ..F.QPMLP....PQQAQ.PYVSGPSQQFQPLGH...
   66-  100 (47.87/25.11)	...LPQPSqIQ..FpQPM.Q....QVAGR.PVVGGHSMPQGPPTP..H
  103-  141 (46.30/24.05)	QRNRPVTP.VQapL.QPNLP....MSNNHmPGAGGPS...LPLSSsyN
  142-  170 (30.01/13.03)	QVNADSSS.SQ..Y.QMQIHdhsfPSGGQ.PWMS..............
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      70.38|      15|      38|     942|     956|       4
---------------------------------------------------------------------------
  942-  956 (29.40/14.56)	KDHSRNS.YRH.DN.DHK
  962-  977 (21.62/ 9.00)	DQHVWSSeVNS.EG.QHK
  981-  997 (19.36/ 7.38)	RDHRSGS.LRDgDNeDHK
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     465.52|      66|      66|     517|     582|       5
---------------------------------------------------------------------------
  457-  514 (73.03/42.85)	........EAKSAFKILL.E...SANIGSDWTWDQAMRAIIND.RRYGAL...KSLGE..RK....QA........FNEYLSHKKKLE
  517-  582 (106.78/66.17)	ERRIKQK.KAREDFRIMLEE...CKELAPSTRWSKAISIFEHD.ERFKAV...ERAKD..RE....DL........FEDYMEELEKKE
  585-  649 (94.05/57.38)	RALEEQK.RNRVEYLEFLKS...CDFIKASSQWRKVQDRLEDD.ERCSRL...EKI.D..RL....EI........FQEYIRDLEREE
  659-  729 (71.79/42.00)	EMRKAER.KNRDEFRKLMEEhvaAGILNAKTNW.RDYCIKVKDlPAYLAVssnTSGPT..AK....DL........FQDVFDELEKQ.
  731-  790 (48.58/25.96)	...LDDKsRIRDAVR..MA......EIGLTSTW.....MLE.D...FK.V...AIAKDisSP....PIsdtnlkfvFEELLERARERE
  793-  852 (71.30/41.66)	EAK.RRK.RLADEFYELLHT...SKEITASSKWEDCKSIF.........G...DRIMG..EEsfllEI........FDKFISELKEK.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      50.78|      18|      19|     408|     426|       6
---------------------------------------------------------------------------
  415-  433 (24.83/15.54)	DGvSLENTEE..VKKDAMISE
  434-  453 (25.94/11.96)	TG.SVTPSEEktVEPGPLVYE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      45.32|      17|      20|     364|     383|       7
---------------------------------------------------------------------------
  364-  383 (21.72/21.46)	G.AVEMQNSLepASPaVATSE
  386-  403 (23.60/11.42)	GtAVTLENSV..ATP.VTSSE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      56.05|      18|      19|     319|     336|       8
---------------------------------------------------------------------------
  319-  336 (31.79/19.80)	PGAN.GSPVSAQGAMSSPI
  340-  358 (24.27/13.31)	PVSNlPAIVASESSLSGNI
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP10898 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AKEKERKRQEDKSRKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSDKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGEDGEVR
2) MANNSHFTGMQPPIPPLAGPPGPHQGAIPSMSLQFQPMLPPQQAQPYVSGPSQQFQPLGHTNVAMLPQPSQIQFPQPMQQVAGRPVVGGHSMPQGPPTPHDFQRNRPVTPVQAPLQPNLPMSNNHMPGAGGPSLPLSSSYNQVNADSSSSQYQMQIHDHSFPSGGQPWMSTSNHNVNSVTNMQKTGE
3) QNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSA
4) SDFGSISGVKTSSPGANGSPVSAQGAMSSP
5) TEEVKKDAMISETGSVTPSEEKTVEPGPLV
853
1
369
306
422
1008
187
407
335
451

Molecular Recognition Features

MoRF SequenceStartStop
1) YSFEE
911
915