<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12288

Description pre-mRNA-processing protein 40A-like isoform X3
SequenceMPDEVKLARQSMKVDLVKGLGKERDSISHASDFGSISGVKTSPLSANGSPVSAQGAMSSPIAVAPVSNLPTIVASESSSLSGNISSLTIGAVEMQNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSAQDSVVYEDGVSLENTEEVKKDATVSETGSATPSEEKTVEPGPLVYESKAEAKSAFKILLESANIGSDWTWDQAMRAIINDRRYGALKSLGERKQAFNEYLSHKKKLEAEERRIKQKKAREDFRIMLEECKELAPSTRWSKAISIFEHDERFKAVERAKDREDLFEDYMEELEKKERARALEEQKRNRVEYLEFLKSCDFIKASSQWRKVQDRLEDDERCSCLEKIDRLEIFQEYIRDLEREEEEHRKLRMDEMRKAERKNRDEFRKLMEEHVAAGILNAKTNWRDYCIKVKDLPAYLAVSSNTSGPKAKDLFQDVFDELEKQFVFEELLERAREREEKEAKRRKRLADEFYELLHTSKEITASSKWEDCKSIFGDRIMGEESFLLEIFDKFISELKEKAKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSEKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGEDGEVR
Length687
PositionUnknown
OrganismNicotiana tabacum (Common tobacco)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> lamiids> Solanales> Solanaceae> Nicotianoideae> Nicotianeae> Nicotiana.
Aromaticity0.06
Grand average of hydropathy-1.156
Instability index47.63
Isoelectric point6.06
Molecular weight79508.60
Publications
PubMed=24807620

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12288
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     228.41|      40|      40|     305|     344|       1
---------------------------------------------------------------------------
  220-  255 (36.41/17.06)	KSLGER...K.QAFN..E.YLSHKK.......KLEAEERRIkQ.K.........................K....A.REDF
  256-  284 (34.51/15.74)	RIM....................LEEC..KELAPSTRWSKA.I.S.......................ifE....H.DERF
  311-  352 (63.08/35.73)	RALEEQ...K.RNRV..E.YLEFLKSC..DFIKASSQWRKV.Q.D.......................rlE....D.DERC
  353-  391 (31.38/13.55)	SCLE.....K.IDRL..EiFQEYIR....DLEREEEEHRKL.RmD.......................emR....K.AER.
  466-  536 (31.34/13.52)	REREEKeakR.RKRLadE.FYELLHTS..KEITASSKWEDC.K.SifgdrimgeesflleifdkfiselkE....KaKEKE
  537-  575 (31.70/13.76)	RKRQED...KaRKEK..E.........rkDREKKKEKHRRD.K.D.........................RgdksR.KERE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      69.94|      21|      32|     602|     630|       2
---------------------------------------------------------------------------
  602-  623 (34.68/24.86)	RDKKHRKRHMSSFDDNeNEKDH
  635-  655 (35.26/10.35)	KKSKQVDQHVWSSEVN.SEGQH
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      33.23|      12|      17|      66|      80|       3
---------------------------------------------------------------------------
   66-   79 (14.84/10.35)	VSnlPTIVASESSS
  101-  112 (18.39/ 7.71)	AS..PAVATSEKNG
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      59.73|      19|      38|     118|     140|       4
---------------------------------------------------------------------------
  118-  136 (33.12/15.52)	ENS..VATPVTSSEFPSAQDS
  146-  166 (26.60/16.17)	ENTeeVKKDATVSETGSATPS
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      34.89|      10|      17|      35|      44|       5
---------------------------------------------------------------------------
   35-   44 (17.02/11.00)	SISGVKTSPL
   52-   61 (17.87/11.90)	SAQGAMSSPI
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12288 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSEKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGEDGEVR
2) QNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSA
3) TEEVKKDATVSETGSATPSEEKTVEPGPLV
532
95
148
687
133
177

Molecular Recognition Features

MoRF SequenceStartStop
1) ADTYSFEEIKR
2) DQHVWSSEVN
3) KDGEFGEDGEVR
4) KKEKHRRDKDR
5) QHKKQKR
587
641
676
556
654
597
650
687
566
660