<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP27143

Description AGAP013434-PA
SequenceMGAEGASPDINYYQHNADEDEDEDEDGYRDPTSLLEGCRYEDVSGMQLNGEDGTGYHYAQQGQQQQQQQQQTLSQAQIEFLKSMVGFRKKWTPYELRTCNDIRRIIGSREYESLRRKDGMPLPSVKTLRKYRQAAQLPDTVDVMGRKRKHNELEDSPTNQSRAVSRHSAGGESGSADHGENGSGVLFPFGNDIDKEQLKLEQDPDGGHFDVSLLTIPQINFLQTLTMNTKGFSDYELQRSLELKNELGSGNYETLRKQEFLNIPTLAMLTRYEQDNGLEDLSRLVNGRRTAGGSGGSDRKRRKEAEAALELARDQEEQRNREIEEQRAREQQQQQEQEQEQQRQLELYNRLTQADPSRHPLFDSIAGDASPTDGADPAAAAAAIAAAMAAAAAAGLHPSQLEHLNGGVGGGGGTVGGEDSSLSMFEHHSIFDKDTPMFDMTKLSPAHVEFLNTLPENPKRWTQEMINSALDIKNEIGTRDYEILRRQGIPLPAARTLRRHKDIRNKADRAMSETGEILPASGPVTTPAGVVGGSG
Length535
PositionTail
OrganismAnopheles gambiae (African malaria mosquito)
KingdomMetazoa
LineageEukaryota> Metazoa> Ecdysozoa> Arthropoda> Hexapoda> Insecta> Pterygota> Neoptera> Endopterygota> Diptera> Nematocera> Culicoidea> Culicidae> Anophelinae> Anopheles.
Aromaticity0.05
Grand average of hydropathy-0.911
Instability index57.19
Isoelectric point5.08
Molecular weight59366.64
Publications
PubMed=12364791

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP27143
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     300.40|      67|     266|      12|      80|       1
---------------------------------------------------------------------------
   12-   80 (115.58/62.29)	YYQHNADED..............EDEDEDGYRDPTSLLEGCR................YEDVSGMQLNGEDGTG..YHYAQQGQQQQQQQQQTLSqaQIEF
  149-  199 (56.89/24.42)	..KHN..EL..............EDSPTNQSRAVSRHSAGGE....................SGSADHGENGSGvlFPFGNDIDKEQLK............
  201-  262 (45.25/17.90)	........................EQDPDGGHFDVSLLTIPQinflqtltmntkgfsdYELQRSLELKNELGSG...NY......ETLRKQEFLN......
  272-  347 (82.69/39.32)	YEQDNGLEDlsrlvngrrtaggsGGSDRKRRKEAEAALELAR................DQE...EQRNREIEEQ..RAREQQQQQEQEQEQQR....QLEL
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|     160.64|      44|     367|      89|     133|       2
---------------------------------------------------------------------------
   89-  133 (79.28/62.45)	KKWTPYELRTCNDIRRIIGSREYESLRRKdGMPLPSVKTLRKYRQ
  459-  502 (81.36/58.91)	KRWTQEMINSALDIKNEIGTRDYEILRRQ.GIPLPAARTLRRHKD
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP27143 with Med29 domain of Kingdom Metazoa

Intrinsically Disordered Regions

IDR SequenceStartStop
1) AAAAGLHPSQLEHLNGGVGGGGGTVGGEDSSLSMFEHH
2) ARTLRRHKDIRNKADRAMSETGEILPASGPVTTPAGVVGGSG
3) GLEDLSRLVNGRRTAGGSGGSDRKRRKEAEAALELARDQEEQRNREIEEQRAREQQQQQEQEQEQQRQLELYNRLTQADPSRHPLFDSIAGDASPTDGADPAAAAAAI
4) MGAEGASPDINYYQHNADEDEDEDEDGYRDPTSLLE
5) QLPDTVDVMGRKRKHNELEDSPTNQSRAVSRHSAGGESGSADHGENGSGVLF
391
494
277
1
136
428
535
384
36
187

Molecular Recognition Features

MoRF SequenceStartStop
1) AIAAAMA
2) MGAEGASPDINYYQHNADE
3) TGYHYAQQ
383
1
54
389
19
61