<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP12714

Description pre-mRNA-processing protein 40C
SequenceMSSSQELQSSASGITAQASGLGQATGPSNPTVASPAPVSGPSNPKGPSGTTNEPAQESIRAKFITGPGYVVPAPSFSYSVIPKQNTASGSSLENSSSPALVSNQPASATAFQPSIPGQSLSSGPTFSYNIIPPAKIGSSAQQKLQSSTDVGSGPLGHSQVGNSTPSTTAASLQPPVPGQPGHPNTFGPGTGAQFMASQGPSPVSVPKGAPSIATSFSFNRIPQLAQKDLSSNSSASVAVAREAGTVSPASSSSVPVSMPFHVSPSSLAAATSPNLCPATLWMPVAPSFVPPPGMPITPGTPGPPGIAPSTPLSSTVTVNSEAMDSSSSTSLRPVVPSTVQQQMHSPYPALPSMPPPPQGLWLPPQIGGLQRPPFLPYPGVLPGSYPLPMRGMPLPSVPVPDSQPPGISPLGPPGGTPSSSVGSVHLPSNTTGKQPDLPPPGTDQHKHIDDLADKVGATVNAKVDAWTAHKTETGVVYYYNALTGESTYERPSEFHGEPDKVTVQPTPVSCEKLVGTDWALVTTNDGKKYYYNSKTKISSWQVPMEVTELRRKYDDDALKGNMTLVQNSVAFSEKLSAPISVTAPAINTGGREATSLRPSGVAGSSSALDLIKKKLQDSIAPATSSPLPTSSGPTTADLNGSRPVEAAVKGLQSENKDKVKDINGDGNISDSSSDSEDEDSGPSKEECIIQFKEMLKERGVAPFSKWEKELPKIVFDPRFKAVPGYSARRALFEHYVRTRAEEERKEKRAAQKAAIEGFKQLLEEASEDIDQRTDYQTFKMKWGSDPRFEALDRKERELLLNERVLPLKKAAEEKAQAIRAAAASGFKSLLREKGDINTSSRWSRVKDSLRSDPRYKSVKHEDRELLFNEYISELKAADEEAEREAKVKREEEDKLKEREREMRKRKEREEQEMERVRLKVQRKEAVACYQALLVETIKDPQVSWTESRPRLEKDPQGRATNSVLDSGDAEKLFREHVKILYERCAREFRTLLCEVITTEAASQMTNDGKTVLTSWSTAKRLLKTDPRYSKMPRKEREALWRRHAEEILWKKKLVSDPKEEKLNIETKARSSLDSGRSPTGLRRSHSRR
Length1088
PositionUnknown
OrganismNelumbo nucifera (Sacred lotus)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> Proteales> Nelumbonaceae> Nelumbo.
Aromaticity0.05
Grand average of hydropathy-0.644
Instability index61.66
Isoelectric point8.91
Molecular weight117451.51
Publications
PubMed=23663246

Function

Annotated function
GO - Cellular Component
nucleus	GO:0005634	IBA:GO_Central
GO - Biological Function
RNA polymerase binding	GO:0070063	IBA:GO_Central
transcription coregulator activity	GO:0003712	IBA:GO_Central
GO - Biological Process

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP12714
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             4|     484.14|     105|     106|      95|     200|       1
---------------------------------------------------------------------------
    2-   67 (50.91/ 9.71)	.......................................................................SSSQELQSSA...SGI.......TAQaSGLG..QATgPSN....PTVAS.........PA..PVSG...PSNPK..GPsGT.TnepaqesirAKFIT..GP
   96-  199 (187.42/58.27)	SS...................PALVS...NQPASATAF.Q....PSIPGQSLSSGPTFSYN.IIPPAKIGSSAQQKLQSSTDVGSGP.......LGH.SQVG..NST.PST....TAASL.........QP..PVPG..QPGHPNTFGP.GT.G.........AQFMASQG.
  200-  311 (122.79/33.42)	.....................PSPVSvpkGAPSIATSF.SfnriPQLAQKDLSSNSSASVA.VAREAGTVSPAS...SSSVPV.SMPfhvspssLAA.ATSP..NLC.PATlwmpVAPSF.........VP..P......PGMPIT..P.GTpG.........PPGIAPSTP
  327-  461 (123.01/36.01)	SStslrpvvpstvqqqmhspyPALPS...MPPPPQGLWlP....PQIGG..LQRPPFLPYPgVLPGSYPLPMRGMPLPSVPVPDSQP.......PGI.SPLGppGGT.PSS....SVGSVhlpsnttgkQPdlPPPGtdQHKHIDDLAD.KV.G.........ATVNA....
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             6|     428.75|      66|      66|     751|     816|       2
---------------------------------------------------------------------------
  688-  746 (75.55/38.96)	....IIQFKEMLKE....RGVAPFS.KWEKELPKIVF.......DPRFKA....VpGYSARRALF...EHYV.RT...RAEEERKE
  751-  816 (107.79/58.61)	QKAAIEGFKQLLEE..ASEDIDQRT.DYQTFKMKWGS.......DPRFEA....L.DRKERELLL...NERV.LPL.KKAAEEKAQ
  819-  882 (96.59/51.78)	RAAAASGFKSLLRE...KGDINTSS.RWSRVKDSLRS.......DPRYKS....V.KHEDRELLF...NEYI.SEL..KAADEEAE
  887-  925 (34.22/13.76)	VKREEE...DKLKE..REREMRK..................................RKEREEQE...MERVrLKVqRKEA.....
  926-  980 (51.47/24.27)	....VACYQALLVE..TIK..DPQV.SWTESRPRLEK.......DPQGRAtnsvL.DSGDAEKLF...REHV.KIL..........
  982- 1052 (63.13/31.38)	ERCARE.FRTLLCEviTTEAASQMTnDGKTVLTSWSTakrllktDPRYSK....M.PRKEREALWrrhAEEI.LWK.KK.......
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      68.38|      24|     262|     312|     337|       3
---------------------------------------------------------------------------
  312-  337 (36.74/20.21)	LSSTVTVNSEAMDSS..ssTSLRP..VVPS
  575-  604 (31.64/16.34)	LSAPISVTAPAINTGgreaTSLRPsgVAGS
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     125.87|      28|      49|     466|     493|       4
---------------------------------------------------------------------------
  466-  493 (51.85/31.00)	WTAHKTETGVVYYYNALTGESTYERPSE
  518-  545 (50.03/29.68)	WALVTTNDGKKYYYNSKTKISSWQVPME
  547-  568 (23.99/10.79)	.....TELRRKYDDDALKGNMTLVQNS.
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP12714 with Med35 domain of Kingdom Viridiplantae

Unable to open file!