<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP00524

Description Pre-mRNA-processing protein 40A isoform 3
SequenceMANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQVPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYYNKKTRQSSWEKPLELMTPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAAIPVSSNTSQASSPVSVTPVAAVANPSPTLVSGSTVVPVSQSAATNASEVQSPAVAVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERARDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKVQHFQKRIQANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAYLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTKKDETDSENLDISDSHGHKEDKKKEKEKDRKHRKRHQSGGDDGSSDKDDREESKKSRRHGSDRKKSRKHAHSPESDNESRHKKHKRDHRDGSRRNSGYEELEDGEVGEDGEIQ
Length1041
PositionUnknown
OrganismTheobroma cacao (Cacao) (Cocoa)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> malvids> Malvales> Malvaceae> Byttnerioideae> Theobroma.
Aromaticity0.06
Grand average of hydropathy-1.089
Instability index59.01
Isoelectric point7.15
Molecular weight117717.31
Publications
PubMed=23731509

Function

Annotated function
GO - Cellular Component
cytosol	GO:0005829	IEA:EnsemblPlants
mediator complex	GO:0016592	IEA:EnsemblPlants
U1 snRNP	GO:0005685	IBA:GO_Central
U2-type prespliceosome	GO:0071004	IBA:GO_Central
GO - Biological Function
RNA binding	GO:0003723	IBA:GO_Central
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro
mRNA splicing, via spliceosome	GO:0000398	IBA:GO_Central

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP00524
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     318.87|      38|      38|     216|     253|       1
---------------------------------------------------------------------------
    7-   36 (30.55/14.84)	...........PSSAqPHW.....PPAVGSLGPQSY...GSPLSSQFRP..
   39-   82 (41.43/23.38)	PMqqgqHF...VPAA.SQQfrpvgQVPSSNVGMPAVQNQ...QMQFSQPMQ
  100-  127 (45.32/26.43)	PM....HV...PFGQ.TN......RPLTSGSP.......QSHQTA..PPLN
  136-  156 (36.30/19.35)	P.....GM...PPSS.S.Y.....S...........Y....VPSSFGQPQN
  169-  205 (37.16/20.02)	QV....HAsvaPVAG.QPW........LSSGNQSVSLAIPIQQTGQQPPL.
  216-  253 (69.86/45.71)	PI....HT...PPSA.SDW.....QEHTSADGRRYYYNKKTRQSSWEKPLE
  257-  294 (58.25/36.59)	PI....ER...ADAS.TVW.....KEFTTPEGRKYYYNKVTKQSKWTIPEE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      67.12|      16|      16|     897|     912|       2
---------------------------------------------------------------------------
  893-  908 (26.29/11.87)	EKAKKEKERE.EKEKRK
  909-  925 (21.60/ 8.43)	EKERKEKERErEREKGK
  955-  968 (19.23/ 6.69)	EKDRKHRKRH.QSG..G
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      64.24|      16|      16|     317|     332|       3
---------------------------------------------------------------------------
  317-  332 (26.84/17.23)	ASQAPVAGAVSSAEMP
  333-  348 (21.28/11.83)	AAAIPVSSNTSQASSP
  401-  415 (16.12/ 6.84)	GSTTPVTSVNANTTM.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     468.18|      64|      65|     471|     534|       4
---------------------------------------------------------------------------
  419-  465 (38.48/17.00)	................L.ES...TA..........SQDsvHFT....NGASAQDIEEAKK.G.M..ATAG.K.VNVTPVE.......EKVPDDE
  471-  534 (98.09/54.10)	NKQEAK..N..AFKSLL.ES...AN..........VQS..DWT....WEQTMREIINDKRYGAL..KTLG.E.RKQAFNEYLG..QRKKLEAEE
  538-  599 (76.36/40.58)	RQKKAR..E..EFTKMLeES...KE..........LTS..SMR....WSKAQSLFENDERFKAV..ERAR.D.REDLFENYIV..E...LERKE
  602-  678 (66.93/34.71)	NAAEEKrrNiaEYRKFL.ES...CDfikvqhfqkrIQA..NSQ....WRKVQDRLEDDERCSRL..EKI..D.RLVMFQDYIH..DLEKEEEEK
  690-  753 (61.64/31.42)	ERKN.R..D..AFRKLM.DE...HV..........VDG..TLTaktyWRDYCLKVKDLPPYLAVasNTSG.StPKDLFEDV........VEELE
  759-  819 (65.15/33.60)	DKTHIK..D..AMKS...GK...IS..........MVS..TWT....VED.FKAAISED.VGSL..PISDiN.LKLVYEELLK..SAKEKEEKE
  822-  887 (61.52/31.34)	KRQRLA..D..DFTKLL.HTykeIT..........ASS..DWE....DSRPLFE..ESQEYRSI..AEES.L.RREIFEEYIAylQEKAKEKE.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     102.61|      22|      78|     927|     948|       5
---------------------------------------------------------------------------
  927-  948 (35.28/18.42)	RTKKDETDSENLDISDSHGHKE
  980- 1001 (32.81/16.61)	ESKKSRRHGSDRKKSRKHAHSP
 1008- 1028 (34.52/17.87)	RHKKHKRDHRDGSRRNS.GYEE
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP00524 with Med35 domain of Kingdom Viridiplantae

Unable to open file!