<p>This section provides information about the protein and gene name(s) and synonym(s) and about the organism that is the source of the protein sequence.<p><a href='/help/names_and_taxonomy_section' target='_top'>More...</a></p>Detailed information on MDP23718

Description Pre-mRNA-processing protein 40B isoform A
SequenceMANNPQYPGLQPLRPPIAGSLDPPRNFVPPMPVQFRPVVPTQQSQQFISMPSQHYQHQPVGPGGVPLIGVGMPPQNQRSQFSQPIQQLPPRPSPQLPPPSQAIPMPVARPNMHIPSESMMHQPDSQVHSQAPNGYTPGLGGPAMPLSASYTFAPSAYGQVQTNFSSTGQFQPVPQIHALTGSSSQSITTGATLQSNGGQPSVTTVMPSATIAQPQLAKNGPTDWIEHTSATGRTFYYNKKTKVSSWEKPFELMTPIERVDATTNWKEYTSPDGRKYYYNKITNESKWSVPEELKLARELVEKAIVSGARPEALLNSHPQPSPTPSAIEATPNADNSSLPSQGEPSSPVSVSPVVTTSISNLQSEMPSGPSPSPADAITGTKVDELEAPLNTVTPSDTSVGSDKAIVTDINTAVTPMNDVDNDSAQATLGSADGVSAEDKEDGKNDSIGEKSNDEAAETKAVEPEPPVYANKMEAKDAFKALLESVNVGSDWTWDRSMRLIINDKRYGALKTLGERKQAFNEYLNQRKKQEAEEKRMKQKKAREDFKKMLEESTDLTSSARWSKAVSIFENDERFKAVERDRDRRDMFESFLEELLNKERAKVQEERKRNIMEYKKFLESCDFIKASTQWRKVQDRLEADERCSRLEKIDRLEIFQDYLHDLEKEEEEQKKIQKEELRKTERKNREEFRKLMEEHIASGILTAKTHWRDYYTKVKDLHAYVAVASNTSGSTPKDLFEDVAEELEKQYHEEKSRIKDTVKLAKITLSSTWAFEDFKSALSKAISTPPISDFNLKLVFDELLERAKEKEEKEAKKRKRLSDDFFHLLHSTKDITVSLKWEDCRPHVEDSQEFRSIGDESLCKEVFEEYIAQLKEEAKESERKRKEERAKKEKDREERERRKGKQRKEKEGGRERGKDEAHKKDKADSDSMELTEIQTSKENKRSEDDNRKQRKKLQSPEHEMDKGKTKKSHGHGSDRKKSRRHSSGHESDEGRHKRHKRDHCREGDLEDGEFGDDHVDRW
Length1017
PositionUnknown
OrganismGlycine soja (Wild soybean)
KingdomViridiplantae
LineageEukaryota> Viridiplantae> Streptophyta> Embryophyta> Tracheophyta> Spermatophyta> Magnoliopsida> eudicotyledons> Gunneridae> Pentapetalae> rosids> fabids> Fabales> Fabaceae> Papilionoideae> 50 kb inversion clade> NPAAA clade> indigoferoid/millettioid clade> Phaseoleae> Glycine> Glycine subgen. Soja.
Aromaticity0.06
Grand average of hydropathy-1.071
Instability index58.19
Isoelectric point6.30
Molecular weight115465.38
Publications

Function

Annotated function
GO - Cellular Component
GO - Biological Function
GO - Biological Process
mRNA cis splicing, via spliceosome	GO:0045292	IEA:InterPro

Interaction

Binary Interactions

Repeat regions

Repeats

>MDP23718
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             7|     508.36|      66|      66|     538|     603|       1
---------------------------------------------------------------------------
  473-  535 (71.23/45.57)	.........EAK.DAFKALL.E...SVNVGSDWTW.........DRSMRLII...NDKRYGALKTLGE...RKQAFNEYLNQRKKQE...AEEKR
  538-  603 (102.30/69.30)	Q.......KKAR.EDFKKMLEE...STDLTSSARW.........SKAVSIFE...NDERFKAVERDRD...RRDMFESFLEELLNKE...RAKVQ
  606-  672 (81.48/53.40)	R.......KRNI.MEYKKFLES...CDFIKASTQW.........RKVQDRLE...ADERCSRLEK.ID...RLEIFQDYLHD.LEKEeeeQKKIQ
  673-  747 (65.64/41.30)	KeelrkteRKNR.EEFRKLMEEhiaSGILTAKTHW.........RDYYTKVK...DLHAYVAVASNTSgstPKDLFEDVAEEL...E...K.QYH
  748-  805 (45.05/25.58)	E.......EKSRiKDTVK.L......AKITLSSTWafedfksalSKAISTPP..iSDFNLKLV............FD....ELL..E...RAKEK
  813-  874 (79.33/51.76)	R.......KRLS.DDFFHLLHS...TKDITVSLKW.........EDCRPHVE...DSQEFRSIGDESL...CKEVFEEYIAQL..KE...EAK..
  901-  965 (63.33/39.54)	Q.......RKEK.EGGRERGKD...EAHKKDKAD..........SDSMELTEiqtSKENKRSEDDNRK...QRKKLQS.PEHEMDKG...KTK..
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             5|     264.39|      44|      48|       6|      53|       2
---------------------------------------------------------------------------
    6-   53 (81.39/53.15)	QYPGLQPLrppiAGSLDPPRNFVPPMPVQ....F.RPVVPTQQSQ...QFISMP.SQ
   60-   94 (59.28/30.07)	VGPGGVPL....IGVGMPPQNQRS....Q....FsQPI......Q...QLPPRP.SP
   95-  126 (53.76/26.58)	Q...L...........PPPSQAI.PMPVA......RPNMHIP.SE...SMMHQPdSQ
  135-  164 (32.78/13.30)	YTPGLGG................PAMPLSasytF.APSAYGQVQT...NF.......
  339-  371 (37.19/15.08)	..................PSQGEPSSPVS....V.SPVVTTSISNlqsEMPSGP.SP
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             3|     183.98|      39|      39|     214|     252|       3
---------------------------------------------------------------------------
  174-  211 (43.13/25.94)	PQIhALTGSS...SQSITTGATLQSNgGQPSVTTVMPSATI
  214-  252 (73.03/49.53)	PQL.AKNGPTDWIEHTSATGRTFYYN.KKTKVSSWEKPFEL
  255-  293 (67.82/45.41)	PIE.RVDATTNWKEYTSPDGRKYYYN.KITNESKWSVPEEL
---------------------------------------------------------------------------
---------------------------------------------------------------------------
No. of Repeats|Total Score|Length  |Diagonal| BW-From|   BW-To|   Level
             2|      49.57|      17|      18|     390|     407|       4
---------------------------------------------------------------------------
  390-  407 (24.85/19.55)	NT.VTPSDtSVGSDKAIVT
  410-  427 (24.73/14.35)	NTaVTPMN.DVDNDSAQAT
---------------------------------------------------------------------------




Explaination for Stockholm format The "Stockholm" format is a system for marking up features in a multiple alignment. These mark-up annotations are preceded by a 'magic' label, of which there are four types. The Stockholm format is used by HMMER, Pfam, and Belvu. Mark-up lines include any characters except whitespace. Underscore ("_") is used instead of space.

#=GR (seqname) PP (Generic per-Sequence AND per-Column markup, exactly 1 char per column) where PP is Posterior Probability [0-9*], (0=0.00-0.05; 1=0.05-0.15; *=0.95-1.00)

#=GC PP_cons line is Stockholm-format consensus posterior probability annotation for the entire column. It’s calculated simply as the arithmetic mean of the per-residue posterior probabilities in that column. This should prove useful in phylogenetic inference applications, for example, where it’s common to mask away non confidently aligned columns of a multiple alignment. The PP_cons line provides an objective measure of the confidence assigned to each column.

#=GC RF line is Stockholm-format reference coordinate annotation, with an x marking each column that the profile considered to be consensus.

Alignment of MDP23718 with Med35 domain of Kingdom Viridiplantae

Intrinsically Disordered Regions

IDR SequenceStartStop
1) EAKESERKRKEERAKKEKDREERERRKGKQRKEKEGGRERGKDEAHKKDKADSDSMELTEIQTSKENKRSEDDNRKQRKKLQSPEHEMDKGKTKKSHGHGSDRKKSRRHSSGHESDEGRHKRHKRDHCREGDLEDGEFGDDHVDRW
2) MANNPQYPGLQPLRPPIAGSLDPPRNFVPPMPVQFRPVVPTQQSQQFISMPSQHYQHQPVGPGGVPLIGVGMPPQNQRSQFSQPIQQLPPRPSPQLPPPSQAIPMPVARPNMHIPSESMMHQPDSQVHSQAPNGYTPGLGGPAMPLS
3) QTNFSSTGQFQPVPQIHALTGSSSQSITTGATLQSNGGQPSVTTVMPSATIAQPQLAKNGPTDW
4) SGARPEALLNSHPQPSPTPSAIEATPNADNSSLPSQGEPSSPVSVSPVVTTSISNLQSEMPSGPSPSPADAITGTKVDELEAPLNTVTPSDTSVGSDKAIVTDINTAVTPMNDVDNDSAQATLGSADGVSAEDKEDGKNDSIGEKSNDEAAETKAVEPEPPVYANKME
872
1
161
306
1017
147
224
473

Molecular Recognition Features

MoRF SequenceStartStop
NANANA