<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12962

Description pre-mRNA-processing protein 40C-like isoform X2
SequenceMQPPLPVPQGALSSSASFSFTPNPQLVQNAQIQPSKSDMLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPVTSRMPTTPPFPMSSGSSGTSGTLGHPVSVPSIQMITASAAVDSPSSAVPGPGAPVSLNPAVQQQVYPPYTSLPSMVSSPQGYWMQHPPMGGFPRPPFVPYPTVYPGPFPSTSSGMPLPAPSSDSQPPGFRPLGMSPFAPSAAALANQSLAILTGFPPQGIDNRKLVHDVTTKVESAGNEQSDVWTAHKTDTGVVYYYNALTGESTYEKPAGFKGEPDQVTVQPTPVSVEQLAGTDWALVTTNDGKKYYYNSKTKISSWQIPNEVTELRKKQDSEVSKENAVSVPNIDVVAEKGSTPISLSAPAVNTGGRDAMPLRTSVVPGSSSALDLIKKKLQDPGVPSSSPVPVMPVTATHELNGLRAVDVKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSHSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEALDRKDRELLLNERVLLLKRAAEEKARAIRAAAASSFKSMLKEKGDINVNSRWSRVKDSLRDDPRYKCVKHEDREVLFNEYISELKAIEEKAERKDKVKKEEEKLKERERELRKRKEREEQEMERVRLKVRRKEAVASFQALLVETIKDSQASWTESKPKLEKDPQGRAANPDLDSSDMEKLFREHIKMLFERCVNDFRALLAKVITQDAAAQETEGGKTALNSWSTAKRLLKPDPRYNKMPRKEREALWRRYAEDMLRKQKLALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR
Length885
PositionUnknown
OrganismGossypium hirsutum (Upland cotton) (Gossypium mexicanum)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> malvids> Malvales> Malvaceae> Malvoideae> Gossypium.
Aromaticity0.06
Grand average of hydropathy-0.740
Instability index55.57
Isoelectric point8.96
Molecular weight97742.94
Publications
PubMed=25893780

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12962
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     271.13|      50|      58|       4|      53|       1
---------------------------------------------------------------------------
    4-   42 (53.51/18.48)	.................PLPVPQ.GA...LSSSASF....SFTPNP..QLVQNAQIQPSKSDMLAT
   43-  101 (68.01/25.22)	GTQAMAASSPStvsqsgPLPVHN.SSeftMNASTTP....SFAPVT..SRMPTTPPFPMSSGSSGT
  103-  146 (35.88/10.28)	GTLGHPVSVPS............iQM...ITASAAVdspsSAVPGPgaPVSLNPAVQQQ.......
  147-  195 (81.09/31.30)	.VYPPYTSLPS......MVSSPQ.GY...WMQHPPM....GGFPRP..PFVPYPTVYPGPFPSTSS
  196-  231 (32.65/ 8.78)	G...MPLPAPS.sdsqpPGFRPL.GM...SPFAPSA....AALANQ..SL................
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     449.13|      65|      65|     549|     613|       2
---------------------------------------------------------------------------
  486-  546 (71.78/35.61)	....IMQFKEMLKE..RGVAPFSKWEKELPKIVFDPRFKA....IpSHSARRSL........FEH.............YVK..TRAEEerkEKR
  549-  613 (107.16/56.45)	QKAAIEGFKQLLDEASEDIGHDTNYQTFKRKWGSDPRFEA....L.DRKDRELL........LNE.............RVLLLKRAAE...EKA
  617-  679 (97.98/51.05)	RAAAASSFKSMLKE.KGDINVNSRWSRVKDSLRDDPRYKC....V.KHEDREVL........FNE.............YISELK.AIE...EKA
  685-  724 (49.44/22.45)	VKKEEEKLK........................ERER.EL....R.KRKEREEQ........EME.............RVRLKVRRKE...AVA
  726-  798 (57.20/27.02)	.......FQALLVETIKD..SQASWTESKPKLEKDPQGRAanpdL.DSSDMEKL........FREhikmlfercvndfRALLAKVITQ...DAA
  800-  858 (65.58/31.96)	QET..EGGKTALN..........SWSTAKRLLKPDPRYNK....M.PRKEREALwrryaedmLRK.............QKLALDQ.EE...EK.
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      99.43|      26|      50|     266|     291|       3
---------------------------------------------------------------------------
  266-  291 (49.49/39.52)	WTAHKTDTGVVYYYNALTGESTYEKP
  318-  343 (49.93/39.95)	WALVTTNDGKKYYYNSKTKISSWQIP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      40.28|      13|      14|     447|     460|       6
---------------------------------------------------------------------------
  447-  460 (18.08/14.58)	GLQSESNKDKlKDA
  464-  476 (22.20/12.93)	GSISDSSSDS.EDA
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12962 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) EKGSTPISLSAPAVNTGGRDAMPLRTSVVP
2) GLRAVDVKGLQSESNKDKLKDANGDGSISDSSSDSEDADSGPSKEECIMQF
3) MQPPLPVPQGALSSSASFSFTPNPQLVQNAQIQPSKSDMLATGTQAMAASSPSTVSQSGPLPVHNSSEFTMNASTTPSFAPVTSRMPTTPPFPMSSGSSGTSGTLGHPVSVPSIQMIT
4) QKLALDQEEEKHTDVKGRSSGGDFGRYSSGTRRTHERR
373
439
1
848
402
489
118
885

Molecular Recognition Features

MoRF SequenceStartStop
1) ALWRRYA
2) FGRYSS
3) LDLIKKKLQ
835
871
408
841
876
416