<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12662

Description pre-mRNA-processing protein 40A-like
SequenceMAARWPTRRQNFAGKFSSDRYTTAGDFHWACLRAVFLDAYVILTHCTHASMFHPRLYSWKLLISIPEIRWGQRADASTDWREFTSPQGKKYYYNKVTRKSKWKMPDEVKLARQSMKVDLVKGLGKERDSISHASDFGSISGVKTSPLSANGSPVSAQGAMSSPIAVAPVSNLPTIVASESSSLSGNISSLTIGAVEMQNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSAQDSVVYEDGVSLENTEEVKKDATVSETGSATPSEEKTVEPGPLVYESKAEAKSAFKILLESANIGSDWTWDQAMRAIINDRRYGALKSLGERKQAFNEYLSHKKKLEAEERRIKQKKAREDFRIMLEECKELAPSTRWSKAISIFEHDERFKAVERAKDREDLFEDYMEELEKKERARALEEQKRNRVEYLEFLKSCDFIKASSQWRKVQDRLEDDERCSCLEKIDRLEIFQEYIRDLEREEEEHRKLRMDEMRKAERKNRDEFRKLMEEHVAAGILNAKTNWRDYCIKVKDLPAYLAVSSNTSGPKAKDLFQDVFDELEKQYLDDKSRIRDAIRMAEIGLTSTWTLEDFKVAISKDISSPPISDTNLKFVFEELLERAREREEKEAKRRKRLADEFYELLHTSKEITASSKWEDCKSIFGDRIMGEESFLLEIFDKFISELKEKAKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSEKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGVDGEVR
Length837
PositionUnknown
OrganismNicotiana sylvestris (Wood tobacco) (South American tobacco)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> lamiids> Solanales> Solanaceae> Nicotianoideae> Nicotianeae> Nicotiana.
Aromaticity0.08
Grand average of hydropathy-1.041
Instability index45.87
Isoelectric point7.60
Molecular weight97064.51
Publications
PubMed=23773524

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12662
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     453.53|      65|      65|     345|     409|       1
---------------------------------------------------------------------------
  270-  336 (61.56/29.63)	.........EEK.TV..EpgplvyesKAEAK..................SAFKIL.L.E...SANIG.SDWTWDQAMRAIIN.D.RRYGAL...K..SL.GERKQAFNEYL
  340-  404 (104.40/55.56)	KKL.EA...EER.RI..K........QKKAR..................EDFRIM.LEE...CKELA.PSTRWSKAISIFEH.D.ERFKAV...E..RA.KDREDLFEDYM
  405-  471 (83.74/43.05)	EEL.EK...KERaRAleE........QKRNR..................VEYLEF.LKS...CDFIK.ASSQWRKVQDRLED.D.ERCSCL...E..KI..DRLEIFQEYI
  472-  552 (60.57/29.03)	RDL.EReeeEHR.KL..R.mdemrkaERKNR..................DEFRKL.MEEhvaAGILN.AKTNW.RDYCIKVK.DlPAYLAVssnT..SG.PKAKDLFQDVF
  553-  634 (44.62/19.37)	DEL.EK.................qylDDKSRirdairmaeigltstwtlEDFKVA.I.....SKDISsPPISDTNLKFVFEElL.ERARER...EekEA.KRRKRLADEFY
  635-  675 (51.70/23.66)	E......................................................lLHT...SKEIT.ASSKWEDCKSIF...G.DRIMGE...E..SF...LLEIFDKFI
  676-  733 (46.93/20.77)	SELkEKakeKER.KR..Q........EDKAR.................kEKERKD.REK...KKE.K.H..RRDKD....RG.D.KSRKER...E..RTkKDGTD......
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      79.56|      19|      19|     771|     789|       2
---------------------------------------------------------------------------
  752-  769 (23.95/ 9.86)	RD.KKHR.KRHMSSFDDNEN
  771-  789 (33.91/16.77)	KDHSRNS.YRHDNDHKKSKQ
  791-  808 (21.69/ 8.29)	DQHVWSSeVNSEGQHKK..Q
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     135.99|      33|      33|     120|     152|       3
---------------------------------------------------------------------------
  120-  152 (54.70/31.27)	VKGLGKERDSISHA..SDFGSISGVKTSPLSANGS
  154-  188 (46.03/25.36)	VSAQGAMSSPIAVApvSNLPTIVASESSSLSGNIS
  191-  215 (35.26/18.04)	TIGAVEMQNSLEPA..S.....PAVATS..EKNG.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      70.97|      22|      27|     218|     239|       5
---------------------------------------------------------------------------
  218-  239 (38.50/27.84)	VTLENS..VATPVTSSEFPSAQDS
  246-  269 (32.48/22.30)	VSLENTeeVKKDATVSETGSATPS
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12662 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSEKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSLRDGDNEDHKDGEFGVDGEVR
2) QNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSA
3) TEEVKKDATVSETGSATPSEEKTVEPGPLV
682
198
251
837
236
280

Molecular Recognition Features

MoRF SequenceStartStop
1) GEFGVDGEVR
2) KADTYSFEEIKRL
3) KKYYYNKVTR
4) QHKKQKRDHR
5) QHVWSSEVN
6) RKDREKKKEKHRRDKDR
7) RKERER
828
736
89
804
792
700
721
837
748
98
813
800
716
726