<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP20733

Description Uncharacterized protein
SequenceMQQQQDDEPMGDGPPPAAAADGAAAAGSGGGSGGGGGGGGRSDRRAEVYTIEFDSLVYSLAWSVRPDKPFRIAVGTFREDTRNTVEILQLDESAGSFRRDNALSFTHPYPPTKLMFCPDPDTSRPDLLATTGDHLRVWRVGEGGVALDRLLSNNKNSEFCAPLTSFDWNDHDPKRLGTSSIDTTVTIWDIERGHVDTQLIAHDREVYDIAWGGLAIFATVAADGSVRVFDLRDKEHSTIIYEAPRPDTPLLRLSWNKQDPRYMATLPGESTEVVVLDIRYPTVPIAELSRHQGYVNALSWAPHSSCHICTVGDDAQALIWDLSSVGQQQQAQQQQQQQQQQQQPSGAAPANGPRGAAAAAAGGGASNGGGGSGIGLDPILSYTAGSDINALQWSVAHPDWIAINCGATTEVLRV
Length414
PositionHead
OrganismRaphidocelis subcapitata
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Chlorophyta> core chlorophytes> Chlorophyceae> CS clade> Sphaeropleales> Selenastraceae> Raphidocelis.
Aromaticity0.07
Grand average of hydropathy-0.409
Instability index45.98
Isoelectric point4.78
Molecular weight44295.34
Publications
PubMed=29795299

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP20733
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     260.91|      73|     330|       2|      79|       1
---------------------------------------------------------------------------
    2-   79 (125.61/56.36)	QQQQDDEPmgDGPPPAAAADGAAAAGSGGGSGGGGGGGGRSDRRAEVYTIEFDslVYSLAWSV.RPDkPFRIAVGTFRE
  337-  410 (135.30/50.73)	QQQQQQQP..SGAAPANGPRGAAAAAAGGGASNGGGGSGIGLDPILSYTAGSD..INALQWSVaHPD.WIAINCGATTE
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|      69.11|      16|      17|     178|     193|       2
---------------------------------------------------------------------------
  178-  193 (27.28/18.22)	TSSIDTTVTIWDIERG
  197-  212 (27.66/18.57)	TQLIAHDREVYDIAWG
  219-  231 (14.17/ 6.06)	TVAADGSVRVFDL...
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     109.88|      30|      42|     240|     269|       3
---------------------------------------------------------------------------
  240-  269 (56.11/31.61)	IYEAPRPDTPLLRLSWNKQDPRYMATLPGE
  285-  314 (53.77/30.02)	IAELSRHQGYVNALSWAPHSSCHICTVGDD
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP20733 with Med30 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) GQQQQAQQQQQQQQQQQQPSGAAPANGPRGAAAAAAGGG
2) MQQQQDDEPMGDGPPPAAAADGAAAAGSGGGSGGGGGGGGRSDRR
326
1
364
45

Molecular Recognition Features

MoRF SequenceStartStop
1) PPPAAAADGAAA
14
25