<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP20386

Description Pre-mRNA-processing protein 40A
SequenceMDNNNPQNQSLRPPAIGSQGFGTNYSMQFRPGVPAQPGHPYMQPASGTQQYRPMGQPGQFSQPMQQMPARPGQPAPSSQPMPMSYMQQPPQYQNPAAGPGGVPHSSYAFTAPPFGQPTNVSSQFHAVSQMHTPVAPTGGQPWMPSGNQGPGNVTPVSQPGQQPSTTAPTNQASSAGQEASAADWQEFTAADGRRYYYNKRTKQSSWEKPMELMTPLERADASTVWKEFTTAEGKKYYYNKDTKQSKWTIPEELKLAREQAEKEASRGTQSPAILTPSAIDQPSASASSANGVSSSPVAVMPVIPVGSAPSVASESSSLASPLIPAVGGSSMVSANVHSSPLGKTEKSSSHGAPEAVDGVSVQDTEEAKKGMAAAGKVNTTPSEEKVVDGETFIYATKQEAKAAFKSLLEDSNVRADWNWEQAMRVIINDKRYGALKALGERKQAFNEYLMQRKKLEAEERRLKQRKAKEEFTKMLEESKELTSSMRWSKAIALFEDDERYKAVDRSSDREDLFQSYLVDLQKKERAKAQEEQRQYRLEYRQFLETCGLIKVDTQWRKVQDRLEDDERCSRLEKFDRLEIFQEYIRDLEKEEDEQRKIKKERIKRVERKNRDEFRKMLDEHVVSGTLNAKTQWRDYCPKVKESVAYQAVATNTSGSTPKDLFEDVVEELERKYHEDKSRVKDALKLKKVAVLPTWTFEDFKAASDEDIISSLSDINLQLVFEDLIERAKEKEEKELKKRQRLVKDFTDLLYDMKEITASSTWEECHQLFEESSEYRAISDESIAREAFEEYVARLVEKAKEKERKREEEKARKEEKEKRKEKERKEREKEKEKEKEKERSKKDDSDSENIDTVSEHKEDKKREKDKDRKHRKRHQDDLSSDKDERDDSKKSRRHGSDRKKSRKHTYSPESDSESKHKKHKKDSRSKNGDGDVDGQEELEDGELGEDGEIQ
Length949
PositionUnknown
OrganismArtemisia annua (Sweet wormwood)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> campanulids> Asterales> Asteraceae> Asteroideae> Anthemideae> Artemisiinae> Artemisia.
Aromaticity0.07
Grand average of hydropathy-1.191
Instability index56.23
Isoelectric point6.29
Molecular weight108229.92
Publications
PubMed=29703587

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP20386
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     147.27|      37|      38|     184|     220|       1
---------------------------------------------------------------------------
  184-  220 (75.12/44.72)	WQEFTAADGRRYYYNKRTKQSSWEKPMELMTPLERAD
  225-  261 (72.15/42.67)	WKEFTTAEGKKYYYNKDTKQSKWTIPEELKLAREQAE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      77.91|      16|      17|      49|      64|       3
---------------------------------------------------------------------------
   42-   60 (28.34/12.36)	MQPasgT.QQY..RPMGQPGQF
   61-   77 (26.18/10.91)	SQP...M.QQMpaRP.GQPAPS
   78-   92 (23.39/ 9.04)	SQP...MpMSY....MQQPPQY
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     482.43|      65|      65|     397|     461|       4
---------------------------------------------------------------------------
  397-  461 (94.66/58.55)	KQ.EAKAAFKSLLED.S.N.VRADWN....WEQAMRVIINDKRYGAL...KALGERKQAFNEYLMQ.........RKKLEAEERR
  463-  529 (83.86/50.99)	KQrKAKEEFTKMLEE.SkE.LTSSMR....WSKAIALFEDDERYKAV...DRSSDREDLFQSYLVD.........LQKKERAKAQ
  532-  596 (85.71/52.29)	QR.QYRLEYRQFLET.C.GlIKVDTQ....WRKVQDRLEDDERCSRL...EKF.DRLEIFQEYIRD.........LEKEEDEQRK
  606-  671 (71.00/41.99)	ER.KNRDEFRKMLDE.H.V.VSGTLNaktqWRDYCPKVKESVAYQAVatnTSGSTPKDLFEDV...............VEELERK
  676-  734 (69.41/40.88)	KS.RVKDALK..LKK.V.A.VLPTWT....FED.FKAASDEDIISSL...SDI.NLQLVF.EDLIE.........RAK.EKEEKE
  738-  812 (77.80/46.75)	RQ.RLVKDFTDLLYDmK.E.ITASST....WEECHQLFEESSEYRAI...SDESIAREAFEEYVARlvekakekeRKREEEKARK
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      73.96|      19|      19|     293|     311|       5
---------------------------------------------------------------------------
  133-  154 (27.50/13.47)	PVAPTGGQpwmPSG...NQGPGNVT
  282-  300 (27.24/13.27)	PSASASSA...NGV...SSSPVAVM
  301-  320 (19.22/ 7.24)	PVIPVGSA...PSVaseSSSLAS..
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      68.39|      15|      15|     898|     912|       6
---------------------------------------------------------------------------
  833-  847 (19.76/ 6.72)	EKEKERSKKDDSDSE
  898-  912 (24.66/10.19)	KKSRKHTYSPESDSE
  916-  930 (23.98/ 9.71)	KKHKKDSRSKNGDGD
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      89.23|      29|      29|     330|     358|      10
---------------------------------------------------------------------------
  330-  358 (47.66/31.36)	SMVSANVHSSPLGKTEKSSSHGAPEAV.DG
  360-  389 (41.57/26.29)	SVQDTEEAKKGMAAAGKVNTTPSEEKVvDG
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP20386 with Med35 domain of Kingdom Viridiplantae

Unable to open file!