<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP23685

Description Pre-mRNA-processing protein 40C
SequenceMCKPFLTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAALKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKREEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRGQKASHDSREEKHTDAEGRNYLESSKPPFESGRSYERR
Length558
PositionUnknown
OrganismGlycine soja (Wild soybean)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> fabids> Fabales> Fabaceae> Papilionoideae> 50 kb inversion clade> NPAAA clade> indigoferoid/millettioid clade> Phaseoleae> Glycine> Glycine subgen. Soja.
Aromaticity0.06
Grand average of hydropathy-1.063
Instability index43.03
Isoelectric point8.88
Molecular weight64156.29
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP23685
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     445.47|      64|      66|     224|     287|       1
---------------------------------------------------------------------------
  159-  216 (63.89/32.40)	..IIQFKEMLKE..RG..VA.............PF.SKW.......EKelpkivfDPRFKAIPSYSARRS...LFEHYV..KTRAEEERK
  224-  287 (107.81/59.66)	AAIEGFKRLLDEASED..INY.......NTDYQTFRKKW.......RN.......DPRFEALDRKEQEHL...LNERVLPLKKAAEEKAQ
  292-  353 (95.94/52.29)	AAAASFKSMLKERG.D..ISF.......NSRWSRVKENL.......RD.......DPRYKCVRHEDREVL...FNEYISEL.KAAEHAAE
  358-  398 (57.69/28.55)	A.....KR..EE..QD.....................KL.......RE.......RER.ELRKRKEREEQ...EMERV.RLKIRRKDAVT
  402-  460 (60.65/30.39)	A.......LLVETIKDplVSWteskpklEKDAQ..RRA.........T.......NPDLDPLD...TEKL...FREHVKMLQERCAHEFR
  465-  527 (59.49/29.66)	EVLTS.....DAASQE..TD..........DGKTVLNSWstakrllKS.......DPRYNKVPRKEREALwrrYAEDMLRGQKASHD...
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      62.92|      19|      43|      41|      59|       4
---------------------------------------------------------------------------
   41-   59 (33.61/24.93)	LSDRGSGMV..TLNAPAINTG
   85-  105 (29.31/20.84)	LQDSGTPVAssSIPAPSVQTG
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP23685 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AELKKKQDGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKL
2) DSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECII
3) EHAAERETKAKREEQDKLRERERELRKRKER
4) LRGQKASHDSREEKHTDAEGRNYLESSKPPFESGRSYERR
17
87
349
519
85
160
379
558

Molecular Recognition Features

MoRF SequenceStartStop
1) GRSYE
2) LDLIKKKL
3) WRRYA
552
78
511
556
85
515