<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP23684

Description Pre-mRNA-processing protein 40C isoform E
SequenceMPDSVAQDVGKLSSASSIPHSVPAHTSTSIMPPPSDPNYRPATSWMPTAMSFPVLPVMPTQGNPGPPGLASSAIISSNPAAPSTGTDSSPAALLRPNMPTSAIASDPTAPQKGLPYPSVPAMAAPPQGLWLQPPQMSGVLRPPYLQYPAPFPGPFPFPARGVALPAVPIPDSQPPGVTPVGAAGGTSTPSSSHQLRGTTALQTEVISGPADDKKKLNSVDTVNEDAANNDQLDAWTAHKTEAGIIYYYNAVTGESTYDKPAGFKGESHQVSAQPIPVSMMDLPGTDWRLVSTSDGKKYYYNNRTKTSCWQIPNEVAELKKKQDGDVTKDHLMSVSNTNVLSDRGSGMVTLNAPAINTGGRDAAALKPSSLQNSPSALDLIKKKLQDSGTPVASSSIPAPSVQTGPESNGSKTVDSTAKGLQVDNNKDKAKDTNGDANVSDTSSDSEDEDNGPSKEECIIQFKEMLKERGVVPFSKWEKELPKIVFDPRFKAIPSYSARRSLFEHYVKTRAEEERKEKRAAQKAAIEGFKRLLDEASEDINYNTDYQTFRKKWRNDPRFEALDRKEQEHLLNERVLPLKKAAEEKAQAMRAAAAASFKSMLKERGDISFNSRWSRVKENLRDDPRYKCVRHEDREVLFNEYISELKAAEHAAERETKAKMEEQDKLRERERELRKRKEREEQEMERVRLKIRRKDAVTLFQALLVETIKDPLVSWTESKPKLEKDAQRRATNPDLDPLDTEKLFREHVKMLQERCAHEFRVLLAEVLTSDAASQETDDGKTVLNSWSTAKRLLKSDPRYNKVPRKEREALWRRYAEDMLRRQKASHDSREEKHTDAEGRNYLESSKHPFESGRSYERR
Length857
PositionUnknown
OrganismGlycine soja (Wild soybean)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> fabids> Fabales> Fabaceae> Papilionoideae> 50 kb inversion clade> NPAAA clade> indigoferoid/millettioid clade> Phaseoleae> Glycine> Glycine subgen. Soja.
Aromaticity0.06
Grand average of hydropathy-0.843
Instability index51.65
Isoelectric point8.42
Molecular weight95349.82
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP23684
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     443.91|      66|      66|     521|     586|       1
---------------------------------------------------------------------------
  458-  517 (71.98/38.57)	....IIQFKEMLKE..RG..VVP.......FSKWEKELPKIVFDPRFKAIPSYSARRS...LFEHYVKTRAEEERKEK
  521-  586 (102.58/58.03)	QKAAIEGFKRLLDEASED..INY.......NTDYQTFRKKWRNDPRFEALDRKEQEHL...LNERVLPLKKAAEEKAQ
  589-  649 (86.61/47.87)	RAAAAASFKSMLKERG.D..ISF.......NSRWSRVKENLRDDPRYKCVRHEDREVL...FNEYISEL.KAAEH...
  650-  689 (41.65/19.27)	..AAERETKAKMEE..QD.....................KLRERER.ELRKRKEREEQ...EMERV.RLK........
  692-  759 (66.14/34.85)	RKDAVTLFQALLVETIKDplVSWteskpklEKDAQ..RRA..TNPDLDPLD...TEKL...FREHVKMLQERCAHEFR
  764-  829 (74.95/40.46)	EVLTSDAASQETDDGKTV..LN..........SWSTAKRLLKSDPRYNKVPRKEREALwrrYAEDMLRRQKASHDSRE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     200.63|      38|      40|      80|     117|       2
---------------------------------------------------------------------------
   21-   53 (27.10/ 7.55)	........SVPAHTST..SIMPP....PSDP....NyRPATSwMPtamsFP
   54-   79 (26.55/ 7.26)	VLPVMPTQGNPGP.....PGLASSAiISSNP....................
   81-  117 (64.45/27.00)	.APSTGTDSSPAALLR..PNMPTSA.IASDP....T.APQKG.LP....YP
  123-  158 (48.56/18.73)	AAPPQGLWLQP.......PQM..SG.VLRPPylqyP.APFPGpFP....FP
  182-  216 (33.97/11.13)	AAGGTSTPSSSHQ.LRgtTALQTEV.I.SGP....A.DDKKK.L.......
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     101.53|      26|      50|     235|     260|       4
---------------------------------------------------------------------------
  235-  260 (49.49/40.52)	WTAHKTEAGIIYYYNAVTGESTYDKP
  287-  312 (52.04/43.04)	WRLVSTSDGKKYYYNNRTKTSCWQIP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      47.54|      13|     376|       7|      19|       8
---------------------------------------------------------------------------
    7-   19 (23.21/13.97)	QDVGKLSSASSIP
  385-  397 (24.34/15.01)	QDSGTPVASSSIP
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP23684 with Med35 domain of Kingdom Viridiplantae

Unable to open file!