<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12296

Description pre-mRNA-processing protein 40A-like
SequenceMTDMERADASTDWREFTSPQGKKYYYNKVTRKSKWKMPDEVKLARQGMKVDLVKGLGKERDSISHASDFGSISGVKTSSPSANGSPVSAQGAMSSPIAVAPVSNLPTIVASESSSLSGNISSLTIGAVEMQNSLEPASPAVATSEKNGTAVTLENSVATPVTSSEFPSAQDSIVYEDGVSLENTEEVKKDATISETGSVTPSEEKTVEPGPLVYESKAEAKSAFKILLESANIGSDWTWDQAMRAIINDRRYGALKSLGERKQAFNEYLSHKKKLEAEERRVKQKKAREDFRIMLEECKELAPSTRWSKAISIFEHDERFKAVERAKDREDLFEDYMEKLEKKERARALEEQKRNRVEYLEFLKSCDFIKASSQWRKVQDRLEDDERCSCLEKIDRLEIFQEYIRDLEREEEEHRKLRMDELRKAERKNRDEFRKLMEEHLAAGILNAKTNWRDYCIKVKDLPAYQAVSSNTSGPTAKDLFQDVFDELEKQYLDDKSRIRDAVRMAEIGLTSTWTLEDFKVAIAKDISSPPISDTNLKFVFEELLERAREREEKEAKRRKRLADEFYELLHTSKEITASSKWEDCKSIFGDRIMGEESFLLEIFDKFSSELKEKAKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSDKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSHRDGDSEDHKDGEFGEDGEVR
Length770
PositionUnknown
OrganismNicotiana tabacum (Common tobacco)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> lamiids> Solanales> Solanaceae> Nicotianoideae> Nicotianeae> Nicotiana.
Aromaticity0.07
Grand average of hydropathy-1.147
Instability index45.28
Isoelectric point6.38
Molecular weight89114.22
Publications
PubMed=24807620

Function

Annotated function
GO - Cellular Component
U1 snRNP	GO:0005685	IBA:GO_Central
U2-type prespliceosome	GO:0071004	IBA:GO_Central
GO - Biological Function
RNA binding	GO:0003723	IBA:GO_Central
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro
mRNA splicing, via spliceosome	GO:0000398	IBA:GO_Central

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12296
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     111.74|      19|      19|     619|     637|       1
---------------------------------------------------------------------------
  414-  431 (27.71/11.38)	HRKLRMDELRKA.E......RKNRD
  619-  637 (32.68/14.79)	ERKRQEDKARKEKE......RKDRE
  641-  658 (29.66/12.72)	EKHRR.DKDRGDKS......RKERE
  692-  716 (21.69/ 7.26)	RHMSSFDDNENEKDhsrnsyRHDND
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     413.43|      65|      65|     273|     337|       2
---------------------------------------------------------------------------
  203-  269 (70.08/36.99)	.....EEK.TV..EpgplvyesKAEAK..S..................A.FKILL.ESANIGSD.....WTWDQAMRAIIN....D.RRYGAL...KSLGE..........RKQAFNEYL
  273-  337 (107.78/60.97)	KKLEAEER.RV..K........QKKAR..E..................D.FRIMLEECKELAPS.....TRWSKAISIFEH....D.ERFKAV...ERAKD..........REDLFEDYM
  338-  404 (85.01/46.49)	EKLEKKERaRAleE........QKRNR..V..................E.YLEFLKSCDFIKAS.....SQWRKVQDRLED....D.ERCSCL...EKI.D..........RLEIFQEYI
  408-  485 (48.70/23.40)	EREE......................E..EhrklrmdelrkaerknrdE.FRKLMEE..HLAAGilnakTNW.RDYCIKVK....DlPAYQAVssnTSGPT..........AKDLFQDVF
  486-  545 (53.77/26.62)	DELE...K.QY..L........DDKSRirD..................A.VR..MA...EIGLT.....STWTL.............EDFK.V...AIAKDissppisdtnLKFVFEELL
  549-  599 (48.08/23.00)	R..EREEK.EA..K........RRKRL..A..................DeFYELLHTSKEITAS.....SKWEDCKSIFGDrimgE.ESF..............................
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     125.39|      31|      50|      59|      89|       3
---------------------------------------------------------------------------
   59-   89 (51.74/33.16)	ERDSISHA..SDFGSISGVKTSSPSANGSPVSA
   93-  125 (42.49/25.99)	MSSPIAVApvSNLPTIVASESSSLSGNISSLTI
  130-  151 (31.16/17.22)	MQNSLEPA..S.....PAVATS..EKNGTAV..
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      41.12|      12|      19|     717|     729|       4
---------------------------------------------------------------------------
  717-  729 (18.68/13.20)	HKKSKQvDQHVWS
  738-  749 (22.45/11.74)	HKKQKR.DHRSGS
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12296 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) ISHASDFGSISGVKTSSPSANGSPVSAQGAMSSP
2) KAKEKERKRQEDKARKEKERKDREKKKEKHRRDKDRGDKSRKERERTKKDGTDSDKADTYSFEEIKRLGSDRDKKHRKRHMSSFDDNENEKDHSRNSYRHDNDHKKSKQVDQHVWSSEVNSEGQHKKQKRDHRSGSHRDGDSEDHKDGEFGEDGEVR
3) QNSLEPASPAVATSEKNGTAVTLENSVATPVTS
4) TEEVKKDATISETGSVTPSEEKTVEPGPLV
63
614
131
184
96
770
163
213

Molecular Recognition Features

MoRF SequenceStartStop
1) GEFGEDGEVR
2) KADTYSFEEIKRL
3) QHKKQKRDH
4) QHVWSSEVN
5) REKKKEKHRRDKDR
6) WREFTSPQGKKYYYNKVTRKSKWKMPDEVKL
761
669
737
725
636
13
770
681
745
733
649
43