<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP20441

Description Transcription factor IIS
SequenceMRIKEIVNNSPDESVSDLCDSLRKLQLMTISVETLKGTEIGKSVNVLRKHGSKDVRQIARSLIEMWKDMVDEWVNATQKMAGSEATPESMNPSVLDEEEGLPSPPLDDLAFLNPHSMSLELSEKFFDGMDDYGNPRKSSELNRNRSNGEKPPVEKQNITKWKQQKDTTVKPCKPLVADHTPKRVMKPNMDRKLQNHEKPVVPKRQVAQQQRKPTSSDGETVQDKLEATKRKLQERYQQAENAKRQRTIQVMELHDLPKQSIIAPKNQQMRPGNNHNRNWANGRF
Length284
PositionUnknown
OrganismArtemisia annua (Sweet wormwood)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> asterids> campanulids> Asterales> Asteraceae> Asteroideae> Anthemideae> Artemisiinae> Artemisia.
Aromaticity0.04
Grand average of hydropathy-1.075
Instability index48.73
Isoelectric point9.34
Molecular weight32580.63
Publications
PubMed=29703587

Function

Annotated function
GO - Cellular Component
nucleus	GO:0005634	IEA:UniProtKB-SubCell
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP20441
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     110.01|      31|      45|     135|     165|       1
---------------------------------------------------------------------------
  149-  196 (39.07/18.14)	EKPPVEKQNITKWKQQK.......dttvkpckplvadhtP....KRvmKPNMD..RKLQNH
  197-  249 (28.47/11.49)	EKPVVPKRQVA..QQQRkptssdgetvqdkleatkrklqE....RY..QQAENakRQRTIQ
  252-  282 (42.47/20.26)	ELHDLPKQSIIAPK..........................nqqmRP..GNNHN..RNWANG
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      46.39|      15|      19|     106|     121|       2
---------------------------------------------------------------------------
  106-  121 (22.48/16.05)	LDDL.AFLNPHSMSlEL
  126-  141 (23.91/12.86)	FDGMdDYGNPRKSS.EL
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      55.94|      17|     204|      10|      26|       3
---------------------------------------------------------------------------
   10-   26 (30.94/21.53)	SPDESVSD.LCDSLRKLQ
  216-  233 (25.00/16.16)	SDGETVQDkLEATKRKLQ
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP20441 with Med26 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) ATQKMAGSEATPESMNPSVLDEEEGLPSPPLDDLAFLNP
2) SEKFFDGMDDYGNPRKSSELNRNRSNGEKPPVEKQNITKWKQQKDTTVKPCKPLVADHTPKRVMKPNMDRKLQNHEKPVVPKRQVAQQQRKPTSSDGETVQDKLEATKRKLQERYQQAENAKRQRTIQVMELHDLPKQSIIAPKNQQMRPGNNHNRNWANGRF
76
122
114
284

Molecular Recognition Features

MoRF SequenceStartStop
1) LSEKFF
121
126