<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12559

Description pre-mRNA-processing protein 40A isoform X3
SequenceMASNPPPSGPQPLWPPSVGSTPPQGFGSSFPMQFRPAVSTQQGQPFASSISASPQYRPVGQTSNAGMPPGHHSQIPQFSQPMQQFPPRPNQPGHGILLSQAIQMPFIQSSMPQPQQVIPPLNSHMPGVSGAGNPFSSSYTVQSSSQMHVPSFPSGGQPWLSSGSQNTPVTAPTLLTNQQLSAIAPSVPAGTASPQNASDWQEYEAADGRRYYYNKITKQSSWEKPLELMTPLERADASTVWKEFTTADGRKYYYNKETKQSKWTIPEELKLARELAEKSAGQVVQTGASTTSGVQVTAAVTSTEQPSAVTPVSSTPSSTVSGVASSPVPVTSAVSDHASPLVVSGSSAIPAVTPAMPSSSGVSSPAVSGSTGSAALANASQTQMSGFENLSPQVSSSLSGASIQDIEEAKKGMAVAGKVNVVPAEEKTTEEEPFLYATKQEAKNAFKALLEFANVESDWTWEQTMRVIINDKRYGALKTLGERKQAFNEYLMQRKKQEAEERRHRQRKAKEEFTKMLEESKELISSTRWSKAVTMFEDDERFKAVEREADREDLFRNYLVDLQKKERAKAQEEYRQNRLEYRHFLETCGFIKVDTQWRKVQDLLEDDERCLRLEKIDRLDIFQEYIRDLEKEDEEQRKLQKEQLRRTERKNRDAFRKMMEEQITAGTLTAKTSWRDYCQMVKESVAYQAVASNTSGSTPKDLFEDVVEELEKQYHEDKIRVKDVVKSEKITISSTWTFEDFKATILEGIGSPSIHDVNLQLIFEDLIERAKEKEEKETKKRQRLAKDFTDKLSTIKEITASSSWEECKELVEDTSEFRAIGEETISRAVFEEYVAWLQEKAKEKERRREEEKAKKEKEKEEKEKRKDKERREKEREKEKERDKEKERGNERATKDEADSESMDVTDNYEPKEERKREKDRERKHRKRHHISNDEVTSDKDEKEEPERDKERKHRKRHHSSNDELASDKDEKEESKRSRRHSSDRKKSKKLAHSPESDGESRHKRHRRDHRDGSRRNGGGYEELEDGELGEDGES
Length1034
PositionUnknown
OrganismNicotiana sylvestris (Wood tobacco) (South American tobacco)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> lamiids> Solanales> Solanaceae> Nicotianoideae> Nicotianeae> Nicotiana.
Aromaticity0.06
Grand average of hydropathy-1.098
Instability index60.24
Isoelectric point6.11
Molecular weight117534.98
Publications
PubMed=23773524

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12559
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     175.60|      26|      26|     919|     944|       1
---------------------------------------------------------------------------
  512-  539 (23.88/ 8.38)	EFTKMLEESKE.......liSSTRWSKAVTMFEDD
  768-  786 (24.39/ 8.73)	ERAKEKEEK.E.........TKKR.QRLAK.....
  856-  873 (26.28/10.03)	E..KEKEEKEK.........RKDKERR......EK
  874-  898 (34.31/15.59)	EREKEK.ERDK.........EKERGNERATKDEAD
  934-  962 (37.35/17.69)	EVTSDKDEKEE......perDKERKHRKRHHSSND
  963-  997 (29.38/12.18)	ELASDKDEKEEskrsrrhssDRKKSKKLAHSPESD
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     107.30|      22|      23|      24|      45|       2
---------------------------------------------------------------------------
   12-   44 (36.10/18.75)	PlwppsvgstppQGF.GSSF...PMQFRPAVSTQQ.GQ
   45-   67 (19.15/ 6.61)	P..............fASSIsasP.QYRPVGQTSNaGM
   68-   93 (26.37/11.78)	P...pghhsqipQ.F...SQ...PMQQFPPRPNQP.G.
   95-  113 (25.67/11.28)	.............GI.LLSQ...AIQM.PFIQSSM.PQ
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     136.37|      35|      38|     194|     228|       3
---------------------------------------------------------------------------
  194-  228 (72.36/45.81)	PQNASD....WQEYEAADGRRYYYNKITKQSSWEKPLEL
  231-  269 (64.01/39.64)	PLERADastvWKEFTTADGRKYYYNKETKQSKWTIPEEL
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     124.69|      23|      23|     586|     608|       4
---------------------------------------------------------------------------
  586-  607 (33.22/18.13)	......ETCGFIKVDTQWRKVQDLLEDD
  608-  633 (19.39/ 7.60)	ErclrlEKID..RLDIFQEYIRDLEKED
  645-  662 (24.31/11.35)	R.....RTER..KNRDAFRK...MMEEQ
  664-  684 (27.12/13.49)	.......TAGTLTAKTSWRDYCQMVKES
  794-  813 (20.65/ 8.57)	.......TIKEITASSSWEECKELVED.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     114.43|      15|      23|     124|     138|       5
---------------------------------------------------------------------------
  124-  138 (27.08/14.63)	HMPGVSGAGNPFSS....S
  148-  162 (25.92/13.65)	HVPSFPSGGQPWLS....S
  327-  341 (20.03/ 8.74)	PVPVTSAVSDHASP....L
  355-  373 (20.89/ 9.45)	AMPSSSGVSSPAVSgstgS
  383-  397 (20.51/ 9.13)	QMSGFENLSPQVSS....S
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     137.79|      39|      66|     472|     510|       6
---------------------------------------------------------------------------
  425-  473 (26.98/12.75)	........EE...KTTEEEPF..LYATKQEAKnafkallefanvesdwtwEQTMRVIIND.............................KR
  474-  541 (50.70/31.43)	YGALKTLGER...KQAFNEYL..MQRKKQEAE..................ERRHRQRKAKeeftkmleeskelisstrwskavtmfeddER
  542-  565 (23.35/ 9.89)	FKAVEREADR...EDLFRNYL..VDLQKK..............................................................
  817-  854 (36.76/20.46)	FRAI...GEEtisRAVFEEYVawLQEKAKE.K..................ERRREEEKAK...............................
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      42.56|      12|      32|     283|     294|       7
---------------------------------------------------------------------------
  283-  294 (21.18/12.47)	VVQTGASTTSGV
  312-  323 (21.38/12.67)	VSSTPSSTVSGV
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12559 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) ELAEKSAGQVVQTGASTTSGVQVTAAVTSTEQPSAVTPVSSTPSSTVSGVASSPVPVTS
2) KERRREEEKAKKEKEKEEKEKRKDKERREKEREKEKERDKEKERGNERATKDEADSESMDVTDNYEPKEERKREKDRERKHRKRHHISNDEVTSDKDEKEEPERDKERKHRKRHHSSNDELASDKDEKEESKRSRRHSSDRKKSKKLAHSPESDGESRHKRHRRDHRDGSRRNGGGYEELEDGELGEDGES
3) MASNPPPSGPQPLWPPSVGSTPPQGFGSSFPMQFRPAVSTQQGQPFASSISASPQYRPVGQTSNAGMPPGHHSQIPQFSQPMQQFPPRPNQPGHGILLSQAIQMPFIQSSMPQPQQVIPPLNSHMPGVSGAGNPFSSSYTVQSSSQMHVPSFPSGGQPWLSSGSQNTPVTAPTLLTNQQLSAIAPSVPAGTASPQNASDWQ
274
844
1
332
1034
201

Molecular Recognition Features

MoRF SequenceStartStop
1) GYEELEDGEL
2) ILLSQAIQ
3) KKSKKLAHS
4) SRHKRHRRDHRD
1019
96
985
1000
1028
103
993
1011