<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP02246

Description Pre-mRNA 3' end processing protein WDR33
SequenceMTAIAQPIRFSNTGAVGMAASTSGTSSNASNAQTGATTHLQVLSQQQFQPYYLRPAYRFKVNAQPSMSDIMADGPGRRLRKNVANVRRHIDYIANVLNHIEARVWQYGVRDRVALQPDVLYQNCALPPSSTLDKPVDCVLTKFVRAAMNKVKCPIYSICWTPEGKRLITGASTGEFTLWNGTAFNFETILQAHDSAIRALKWSHNDQWLVSADHDGFVKYWQPNMNNVHMYQAHKDEPIRSLSFAPTDVKLVTGSDDGTARVWDFARCAEERIFRGHGSDVRSVDWHPQKGLICTGSRDSQQPVKLWDPKSGQCLATLHDHKNSVMAVQWNKNGNWLLTGSRDHLIKVYDIRMMREMHSYRGHKKEVTALAWHPVHESLFVSGGGDGSMAYWLVNSEKELSFLEHAHDQAIWTLEWHPLGHILASGSNLGFPSAHDSAIRALKWSHNDQWLVSADHDGFVKYWQPNMNNVHMYQAHKDEPIRSLSFAPTDVKLVTGSDDGTARVWDFARCAEERIFRGHGSDVRSVDWHPQKGLICTGSRDSQQPVKLWDPKSGQCLATLHDHKNSVMAVQWNKNGNWLLTGSRDHLIKVYDIRMMREMHSYRGHKKEVTALAWHPVHESLFVSGGGDGSMAYWLVNSEKELSFLEHAHDQAIWTLEWHPLGHILASGSNDNNTKFWARNRPGDTQDDIFGLSSSSGIPMHMMAKESRTIDADTEASAVAIIPGMGLDDDIYQGMQKRESHTSGASTNSGPLAGPLLFPDDPNARQQAANIGAKKTLIKQPPPKKAQRQFERMWNVAKPAGGGGGEELEEEMPDEGSFRRPKTSLLGPPPPSMLGASKPSEPGRSYTPPQPATSPVAKRPESPPAHGSQSGVGAPPHGSQAPLIAQVGQQQLPLLNQSIVAMSGAGAALPLPPQIAAQWATPGGPNVAPKTTLIQQLNPPASGTQIRPSPIIAQELMPPSLSQQMQIGVPQMQQIASLPFPQPPATGPLLWPPPIVAVPVPQNLDMDYRMGSAPPLMMPPPMAPQTTAGTSADVDLRSEQQPSSSATPLLSQSWRTPPSTQMSPRSPDSALSPGGTRMHDPRMRRDGSAQGQGEARPQRNDMDRDQHGLAMSAPMRDPRRRPQTVSQLPAKQDSDYDGRYEGQGSKTTKRAWMPDMGAGEKSSYGGAGGSGLRSGQSPPTERSPHIFQRETNGAGSADYSSYEDRPHTHAPYQQYTGGGPMRRSGPLLDRGPSLSQQMQIGVPQMQQIASLPFPQPPATGPLLWPPPIVAVPVPQNLDMDYRMGSAPPLMMPPPMAPQTTAGTSADVDLRSEQQPSSSATPLLSQSWRTPPSTQMSPRSPDSALSPGGTRMHDPRMRRDGSAQGQGEARPQRNDMDRDQHGLAMSAPMRDPRRRPQTVSQLPAKQDSDYDGRYEGQGSKTTKRAWMPDMGAGEKSSYGGAGGSGLRSGQSPPTERSPHIFQRETNGAGSADYSSYEDRPHTHAPYQQYTGGGPMRRSGPLLDRGRGGRGRVRRGGY
Length1516
PositionTail
OrganismToxocara canis (Canine roundworm)
KingdomMetazoa
LineageEukaryota> Metazoa> Ecdysozoa> Nematoda> Chromadorea> Rhabditida> Spirurina> Ascaridomorpha> Ascaridoidea> Toxocaridae> Toxocara.
Aromaticity0.07
Grand average of hydropathy-0.703
Instability index58.97
Isoelectric point8.99
Molecular weight166164.59
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP02246
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
            14|     674.91|      46|      47|    1018|    1063|       1
---------------------------------------------------------------------------
  781-  827 (34.26/ 7.56)	.PPPKK....AQrqfermwnvaKP....A......GGG.GGEELE.E..EMPDEG........SFR..R...PKT..SLLG
  828-  871 (46.94/13.08)	PPPPSM....LG..........AS....K......PSE.PG...RSY..TPPQPATS.P.VAKRPE..S...PPAhgSQSG
  874-  903 (35.77/ 8.21)	APPH...............................GSQ.APLIAQVG..QQ....QL.PLLNQSIV..A..........MS
  911-  947 (39.18/ 9.70)	LPPQIA....AQ..........WA....T......PGG.PNV........APK...T.TLIQQ..L..N...PPAsgTQIR
  957-  999 (53.68/16.02)	MPPSLS....QQ..........MQ....I......GVP.QMQQIASL..PFPQPPATgPLL...W...P...PPI..VAVP
 1018- 1063 (88.09/31.00)	MPPPMA....PQ..........TT....A......GTS.ADVDLRSE..QQPSSSAT.PLLSQSWR..T...PPS..TQMS
 1064- 1112 (50.37/14.57)	PRSPDS....AL..........SP....G......GTRmHDPRMRRDgsAQGQGEAR.PQRNDMDR..D...QHG..LAMS
 1116- 1152 (37.45/ 8.95)	RDPRRR....PQ..........TVsqlpA......KQD.SDYDGRYE..GQGSKTT.....KRAW................
 1154- 1200 (38.92/ 9.59)	..........PD..........MG....AgekssyGGA.GGSGLRSG..QSPPTERS.PHIFQ..R..EtngAGS..ADYS
 1203- 1249 (37.01/ 8.76)	EDRPHThapyQQ..........YT....G......GG.....PMR.R..SGPLLDRG.PSLSQQMQigV...PQM..QQIA
 1253- 1272 (37.25/ 8.86)	FPQP...................P....A.......TG....................PLL...W...P...PPI..VAVP
 1291- 1336 (88.09/31.00)	MPPPMA....PQ..........TT....A......GTS.ADVDLRSE..QQPSSSAT.PLLSQSWR..T...PPS..TQMS
 1352- 1385 (50.45/14.61)	HDPRMR....RD..........GS....A.....................QGQGEAR.PQRNDMDR..D...QHG..LAMS
 1389- 1425 (37.45/ 8.95)	RDPRRR....PQ..........TVsqlpA......KQD.SDYDGRYE..GQGSKTT.....KRAW................
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
            13|     692.65|      40|      41|     193|     232|       2
---------------------------------------------------------------------------
  146-  180 (39.77/19.41)	AAMNKVK...........cPIYSICWTP..EGKRLI...TGASTGEFT.LWN
  181-  222 (57.16/31.44)	GTAFNFEtILQ..aH.DS.AIRALKWSH..NDQWLV...SADHDGFVK.YWQ
  223-  264 (58.90/32.64)	PNMNNVH.MYQ..aHkDE.PIRSLSFAP..TDVKLV...TGSDDGTAR.VWD
  266-  308 (53.13/28.65)	ARCAEER.IFR..gH.GS.DVRSVDW.H..PQKGLIctgSRDSQQPVK.LWD
  309-  351 (53.07/28.61)	PKSGQCL.ATL.hdH.KN.SVMAVQWNK..NGNWLL...TGSRDHLIKvYDI
  352-  393 (56.13/30.72)	RMMREMH.SYR..gH.KK.EVTALAW.HpvHESLFV...SGGGDGSMA.YWL
  394-  431 (45.30/23.24)	VNSEKEL.SFLehaH.DQ.AIWTLEWHP..LGHILA...SGSNLGF......
  432-  464 (49.20/25.93)	.........PS..aH.DS.AIRALKWSH..NDQWLV...SADHDGFVK.YWQ
  465-  506 (58.90/32.64)	PNMNNVH.MYQ..aHkDE.PIRSLSFAP..TDVKLV...TGSDDGTAR.VWD
  508-  550 (53.13/28.65)	ARCAEER.IFR..gH.GS.DVRSVDW.H..PQKGLIctgSRDSQQPVK.LWD
  551-  593 (53.07/28.61)	PKSGQCL.ATL.hdH.KN.SVMAVQWNK..NGNWLL...TGSRDHLIKvYDI
  594-  635 (56.13/30.72)	RMMREMH.SYR..gH.KK.EVTALAW.HpvHESLFV...SGGGDGSMA.YWL
  636-  678 (58.77/32.55)	VNSEKEL.SFLehaH.DQ.AIWTLEWHP..LGHILA...SGSNDNNTK.FWA
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      84.67|      18|     271|    1000|    1017|       3
---------------------------------------------------------------------------
 1000- 1017 (42.34/26.37)	VPQNLDMDYRMGSAPPLM
 1273- 1290 (42.34/26.37)	VPQNLDMDYRMGSAPPLM
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP02246 with Med16 domain of Kingdom Metazoa

Unable to open file!