<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP00526

Description Pre-mRNA-processing protein 40A isoform 1
SequenceMANNSQPSSAQPHWPPAVGSLGPQSYGSPLSSQFRPVVPMQQGQHFVPAASQQFRPVGQVPSSNVGMPAVQNQQMQFSQPMQQFPPRPNQPGLSAPSAQPMHVPFGQTNRPLTSGSPQSHQTAPPLNSHMPGLGAPGMPPSSSYSYVPSSFGQPQNNVSASSQFQPTSQVHASVAPVAGQPWLSSGNQSVSLAIPIQQTGQQPPLISSADTAANAPIHTPPSASDWQEHTSADGRRYYYNKKTRQSSWEKPLELMTPIERADASTVWKEFTTPEGRKYYYNKVTKQSKWTIPEELKLAREQAQVVASQGAPSDTGVASQAPVAGAVSSAEMPAAAIPVSSNTSQASSPVSVTPVAAVANPSPTLVSGSTVVPVSQSAATNASEVQSPAVAVTPLPAVSSGGSTTPVTSVNANTTMIRSLESTASQDSVHFTNGASAQDIEEAKKGMATAGKVNVTPVEEKVPDDEPLVYANKQEAKNAFKSLLESANVQSDWTWEQTMREIINDKRYGALKTLGERKQAFNEYLGQRKKLEAEERRMRQKKAREEFTKMLEESKELTSSMRWSKAQSLFENDERFKAVERARDREDLFENYIVELERKERENAAEEKRRNIAEYRKFLESCDFIKANSQWRKVQDRLEDDERCSRLEKIDRLVMFQDYIHDLEKEEEEKKKMQKEQLRRAERKNRDAFRKLMDEHVVDGTLTAKTYWRDYCLKVKDLPPYLAVASNTSGSTPKDLFEDVVEELEKQYQQDKTHIKDAMKSGKISMVSTWTVEDFKAAISEDVGSLPISDINLKLVYEELLKSAKEKEEKEAKKRQRLADDFTKLLHTYKEITASSDWEDSRPLFEESQEYRSIAEESLRREIFEEYIAYLQEKAKEKERKREEEKAKKEKEREEKEKRKEKERKEKEREREREKGKERTKKDETDSENLDISDSHGHKEDKKKEKEKDRKHRKRHQSGGDDGSSDKDDREESKKSRRHGSDRKKSRKHAHSPESDNESRHKKHKRDHRDGSRRNSGYEELEDGEVGEDGEIQ
Length1032
PositionUnknown
OrganismTheobroma cacao (Cacao) (Cocoa)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> malvids> Malvales> Malvaceae> Byttnerioideae> Theobroma.
Aromaticity0.06
Grand average of hydropathy-1.089
Instability index59.31
Isoelectric point6.82
Molecular weight116551.96
Publications
PubMed=23731509

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP00526
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     314.56|      38|      38|     216|     253|       1
---------------------------------------------------------------------------
    7-   36 (30.56/12.31)	.......PSSAQpHW.....PPAVGSLGPQSY...GSPLSSQFRP..
   45-   82 (36.89/16.50)	..HF...VPAAS.QQfrpvgQVPSSNVGMPAVQNQ...QMQFSQPMQ
  100-  127 (44.71/21.68)	PMHV...PFGQT.N......RPLTSGSP.......QSHQTA..PPLN
  136-  156 (36.19/16.04)	P.GM...PPSSS..Y.....S...........Y....VPSSFGQPQN
  169-  205 (38.02/17.25)	QVHAsvaPVAGQ.PW........LSSGNQSVSLAIPIQQTGQQPPL.
  216-  253 (69.90/38.37)	PIHT...PPSAS.DW.....QEHTSADGRRYYYNKKTRQSSWEKPLE
  257-  294 (58.29/30.68)	PIER...ADAST.VW.....KEFTTPEGRKYYYNKVTKQSKWTIPEE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|      78.33|      14|      15|     888|     901|       2
---------------------------------------------------------------------------
  888-  901 (22.59/ 9.73)	KEKEREEKEKRKEK
  934-  947 (17.97/ 6.26)	SHGHKEDKKKEKEK
  950-  963 (18.58/ 6.72)	KHRKRHQSGGDDGS
  995- 1008 (19.19/ 7.18)	DNESRHKKHKRDHR
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     455.31|      65|      65|     545|     609|       3
---------------------------------------------------------------------------
  479-  541 (79.04/40.36)	.FKSLL.E...SA....NVQSDWTWEQTMREIINDKRYG........AL...KTLGERKQAFNEYLGQRKKLEAEERRMRQ...KK
  545-  609 (103.35/54.84)	EFTKMLEE...SK....ELTSSMRWSKAQSLFENDERFK........AV...ERARDREDLFENYIVELERKERENAAEEK...RR
  613-  671 (87.00/45.10)	EYRKFLES...CD....FIKANSQWRKVQDRLEDDERCS........RL...EKI.DRLVMFQDYIHDLEKEE.....EEK...KK
  687-  760 (56.42/26.89)	AFRKLMDEhvvDG....TLTAKTYW........RDYCLKvkdlppylAVasnTSGSTPKDLFEDVVEELEKQYQQDKTHIKdamKS
  766-  820 (47.56/21.62)	VSTWTVED...FKaaisEDVGSLPISDINLKLVYEELLK........S......AKEKE........EKEAKKRQRLADD......
  821-  881 (81.95/42.09)	.FTKLLHT...YK....EITASSDWEDSRPLFEESQEYR........SI...AEESLRREIFEEYIAYLQEKAKE...KER...KR
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     120.73|      30|      30|     351|     380|       4
---------------------------------------------------------------------------
  316-  347 (29.35/13.89)	V.ASQAPVAGA....VSSAEMpaaaIPVSSntSQASS
  351-  380 (51.48/29.67)	V.TPVAAVANPSPTLVSGSTV....VPVSQ..SAATN
  384-  412 (39.90/21.42)	VqSPAVAV.TPLPAVSSGGST....TPVT...SVNAN
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP00526 with Med35 domain of Kingdom Viridiplantae

Unable to open file!