Reference
Cytochrome P450 HomepageNelson, DR (2009) The Cytochrome P450 Homepage. Human Genomics 4, 59-65.AlphafoldJohn Jumper, Richard Evans, Alexander Pritzel, et al. - Highly accurate protein structure prediction with AlphaFold. Nature, 1–11, doi: 10.1038/s41586-021-03819-2 (2021)ColabfoldMirdita M, Ovchinnikov S and Steinegger M. ColabFold - Making protein folding accessible to all. bioRxiv, doi: 10.1101/2021.08.15.456425 (2021)Localcolabfoldhttps://github.com/YoshitakaMo/localcolabfoldAutoDock VinaO. Trott, A. J. Olson, AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization and multithreading, Journal of Computational Chemistry 31 (2010) 455-461Reference
D. R. Nelson's
D. R. Nelson's
https://twitter.com/P450nelson/status/1501396401317883904
Method
Localcolabfold (model1_ptm, recycle=6 or CaRMSD < 0.4, centerd and rotated around proximal Cys)runner_P450align.py -f CYPs/CYPXXXA1.fasta -nm 5 -mr 6 -alirunner_P450align.py -f CYPs/CYPXXXA1.fasta -nm 5 -mr 6 -ali -upMaestro (Protein preparation Wizard)proPKa pH 7.0, save as pdbHeme was extract from proximal-ligand-aligned crystal structure of CYP101A1-CYP150A1Heme was added hydrogen using maestro builder, save as mol2ES_scoreMaestro output Method
CYP101
CYP101
A1 Pseudomonas putidaB1 Novosphingobium aromaticivoransC1 Novosphingobium aromaticivoransD1 Novosphingobium aromaticivoransD2 Novosphingobium aromaticivoransD3 Sphingomonas sp. SKA58
PDB
Best ResolutionCYP101A1 P450cam 2ZWU 1.3ÅCYP101As P450tcu 6WPL 2.1ÅCYP101C1 3OFT 1.9ÅCYP101D1 4C9L 1.8ÅCYP101D2 3NV6 2.2ÅCYP101J2 5KYO 1.8ÅPDB
CYP101_Consensus5
>Consensus/1-597 Percentage Identity Consensus MNRLKCALAIPMAQFCKPAPGDRDEAALGIAHDAGAHESMPGFMASVESIKSNRYIQMVESGRWMRMADMKMMMTTATMMAAAELAPVPAHVPPADLVVDFDIYDPPGAAPGDEEQDVHAAWAALQPVVYSPAPDIVWTPRNLGFNGGHWIATRGEDIREVLRDPERFSSRPINLPPELGEVGDRTHQPKLIPLELDPPEHTPYRALLNPLFSPKAVAALEQPDIRELANELIDGFAPRGGDECEFVSEFAEPLPIRVFLALVGLPLEDRPRLLEWADQMTRPDGGEEREMAMTEEEAAEARAEAAQAIFDYLGPIIAERRANPGGDDLLSRLDSLQDDRDDLISAIVNAEVDGEDGTSELKEQGAGRPLTDEEALGMCFLLLIGGLDTVASSLGFIARHLARNPEHRRQLIDDPRLDDAEDPELIPAAVEELLRRFGLVNTARGRVVTRDVEYGGGVQLKAGDMVLLPTCLAGLDPREFDPDPLEVDFDRKPSADESGPNRHLTFGAGPHRCLGSHLARMELRVALEEWLRRIPDFRLKPGAPPPVHSGIVVRGVERLPLVWDVATTKAVASSKQAASASAAGAPALAHPHGMAERCYP101_Consensus5
CYP101_Consensus5_trim (A1_55.04%)
>Consensus/1-597 Percentage Identity Consensus MAAAELAPVPAHVPPADLVVDFDIYDPPGDEEQDVHAAWAALQPVVAPDIVWTPRNGGHWIATRGEDIREVLRDPERFSSRPINLPPELGEVGDLIPLELDPPEHTPYRALLNPLFSPKAVAALEPDIRELANELIDGFAPRGECEFVSEFAEPLPIRVFLALVGLPLEDRPRLLEWADQMTRPDGGEMTEEEAAQAIFDYLGPIIAERRANPGGDLISAIVNAEVDGRPLTDEEALGMCFLLLIGGLDTVASSLGFIARHLARNPEHRRQLIEDPELIPAAVEELLRRFGLVNTARVVTRDVEYGGVQLKAGDMVLLPTCLAGLDPREFDDPLEVDFDRKPNRHLTFGAGPHRCLGSHLARMELRVALEEWLRRIPDFRLKPGAPPVHSGIVVGVERLPLVWDVATCYP101_Consensus5_trim (A1_55.04%)
CYP101α_Consensus
>Consensus1_100%-55%/1-471 Percentage Identity Consensus MVREAAHLHRHDLGKHRTPYRAVRCVPVIAVNQMTRMVPTAHAGRFEERRQLMSVAAHMATPELAPVPAHVPPELVVDFDIYNPPGAEDDYHLALKTLHAGGIPDIVWTPRNGGHWIATRGDDIYAIQKDYENFSSRQLTVPLVEETHQPLPPIFFDPPEHTAYRALIAPAFLPQAIAKLEEKARELAIELIEGFYPRGECEFVSEFAQHLPIGIFMRMVALPAEDREYLLELADKMVRPNDAERKLSALGAIFGYLGVKIAERRANPGDDLISLIVNAKIDGRPLTDEEMFGLCGLVLIGGLDTVASAMGFIARFLAESPAHRRQLIDRLDNPELTQKAVDELLRRFPVVNQGRRITHDFEYKGVQMKAGDMIQLPTTLHGLDERKFDDPLAVDFTRPTPVIHSTFGNGPHRCPGSFLARVEIKVFLQEWLKRIPDFRIKPGEKAGVHGGVVGTLYHLPLVWDVARQKAVCYP101α_Consensus
>Consensus1_trim/1-471 Percentage Identity Consensus MSVAAHMATPELAPVPAHVPPELVVDFDIYNPPGAEDDYHLALKTLHAGGIPDIVWTPRNGGHWIATRGDDIYAIQKDYENFSSRQLTVPLVEHQPLPPIFFDPPEHTAYRALIAPAFLPQAIAKLEEKARELAIELIEGFYPRGECEFVSEFAQHLPIGIFMRMVALPAEDREYLLELADKMVRPNDAERKLSALGAIFGYLGVKIAERRANPGDDLISLIVNAKIDGRPLTDEEMFGLCGLVLIGGLDTVASAMGFIARFLAESPAHRRQLIDNPELTQKAVDELLRRFPVVNQGRRITHDFEYKGVQMKAGDMIQLPTTLHGLDERKFDDPLAVDFTRPTPIHSTFGNGPHRCPGSFLARVEIKVFLQEWLKRIPDFRIKPGEKAGVHGGVVGTLYHLPLVWDVAR
>Consensus2_trim/1-468 Percentage Identity Consensus MSVAAHMATPELWPVPAHVPPELVVDFDIYNPPGAEDDVHLALKTLHAGGIPDIVWTPRNGGHWIATRGDDIEAIQKDYENFSSRQVTVPLVEHTPLVPLELDPPEHTAYRALIAPAFLPQAIASLEEDARELAIELIEGFYPRGECEFVSEFAKHLPIGIFLRMVDLPLEDREYLLELADKMVRPVDAEERLESQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPLTDEEMFGMCILVLFGGLDTVASAMGFIARFLAESPAHRRQLIDNPELTQKAVDELLRRFGVVNTARRITHDFEYKGVQMKKGDQIQLPTTLHGLDERKFDDPLAVDFTRKPVIHSTFGNGPHRCPGSFLARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGLLHLPLVWDVAR
>Consensus3_trim/1-476 Percentage Identity Consensus MSVAAHMATPELAPVPAHVPPELVVDFDIYNPPGAEDDVHLAWKRLHAGGIPDIVWTPRNGGHWIATRGEDIEAIQKDHERFSHRQVTVPLVEHTPLVPLELDPPEHTAYRALIAPAFLPQAIASLEEDARELAIELIEGFYPRGECEFVSEFAKHLPIVIFLRLVDLPLEDREYLLELAEKAVRPVDAEERLEAQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPITDEEMFGMCILVLFGGLDTVASAMGFIARFLAENPAHRRQLIDNPELMQKAVDELLRRFGVVNTARRITHDFEYKGVQMKAGDQIQLPNTLHGLDERKFDDPLAVDFTRKPVIHAAFGNGPHRCPGSFLARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGMLHLPLVWDVAR
>Consensus4_trim/1-477 Percentage Identity Consensus MSVAAHMATPELAPVPAHVPPELVVDFDYYNPPGAEDDVHLAWKRLHAGGIPDIVWTPRNGGHWIATRGEDIEAIQKDHERFSHRQVTIPLVEHTPLVPLELDPPEHTAYRALIAPAFLPQAIASLEEDVRELAIELIEGFYPRGECEFVSEFAKHLPIVIFLRLVDLPLEDREYLLELAEKAVRPVDAEERLEAQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPITDEEMFGLCILVLFGGLDTVASAMGFIARFLAENPAHRRQLIDNPELMQKAVDELLRRFGVVNTARLITHDFEYKGVQFKAGDQIQLPNTLHGLDERKFDDPLAVDFTRKPVIHAAFGNGPHRCPGSFLARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGMLHLPLVWDVA
>Consensus5_trim/1-477 Percentage Identity Consensus MSVAAHMATPELAPVPAHVPPELVVDFDYYNPPGAEDDVHLAWKRLHAGGIPDIVWTPRNGGHWIATRGEDIEAIQKDHERFSHRQVTIPLVEHTPLVPLELDPPEHTAYRALIAPAFLPQAIASLEEDVRELAIELIEGFYPRGECEFVSEFAKHLPIVIFLRLVDLPLEDREYLLELAEKAVRPVDAEERLEAQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPITDEEMFGLCILVLFGGLDTVASAMGFIARFLAENPAHRRQLIDNPELMQKAVDELLRRFGVVNTARLITHDFEYKGVQFKAGDQIQLPNTLHGLDERKFDDPLAVDFTRKPVIHAAFGNGPHRCPGSFLARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGMLHLPLVWDVA
>cytochrome P450 [Paraburkholderia youngii]Sequence ID: NVH71867.1 Length: 409 Range 1: 1 to 408
Score:633 bits(1633), Expect:0.0, Method:Compositional matrix adjust., Identities:312/408(76%), Positives:344/408(84%), Gaps:0/408(0%)
Query 1 MSVAAHMATPELAPVPAHVPPELVVDFDYYNPPGAEDDVHLAWKRLHAGGIPDIVWTPRN 60 MSVAAH A PELAP+PAHV PE V D D YNPPGAEDD HL+ K+LHA GIPDI+WTPRNSbjct 1 MSVAAHGAAPELAPIPAHVSPERVFDIDIYNPPGAEDDYHLSLKQLHAEGIPDILWTPRN 60
Query 61 GGHWIATRGEDIEAIQKDHERFSHRQVTIPLVEHTPLVPLELDPPEHTAYRALIAPAFLP 120 GGHWIATRG+DI I KD+E FS +Q+T+PLVEH PL P+ DPP+HTAYRALIAPAF+PSbjct 61 GGHWIATRGDDIYHILKDYENFSSKQLTVPLVEHQPLPPIFFDPPQHTAYRALIAPAFMP 120
Query 121 QAIASLEEDVRELAIELIEGFYPRGECEFVSEFAKHLPIVIFLRLVDLPLEDREYLLELA 180 QAI LEE RE+AIELIEGFYP+GE EFVS FA+HLPI IF+ +V LP EDREYLL LASbjct 121 QAIGKLEEKAREMAIELIEGFYPKGEVEFVSGFAQHLPIGIFMNMVALPAEDREYLLGLA 180
Query 181 EKAVRPVDAEERLEAQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPITDEEMFGL 240 +K VRPVDA E+L+A GA+F YLGV IAERRANPGDDLIS IV++KIDGR +TDEE+ GLSbjct 181 DKMVRPVDANEKLKALGAIFQYLGVKIAERRANPGDDLISKIVHSKIDGRALTDEEVRGL 240
Query 241 CILVLFGGLDTVASAMGFIARFLAENPAHRRQLIDNPELMQKAVDELLRRFGVVNTARLI 300 C LVL GGLDTVASAMGFIARFLA +PAHR+QLIDNPEL QKAVDELLRRF VVN R ISbjct 241 CGLVLIGGLDTVASAMGFIARFLAGSPAHRKQLIDNPELTQKAVDELLRRFPVVNQGRRI 300
Query 301 THDFEYKGVQFKAGDQIQLPNTLHGLDERKFDDPLAVDFTRKPVIHAAFGNGPHRCPGSF 360 THDFEYKGVQ KAGD I +P TLHGLDERKFDDPL VDFTR IH+ FGNG HRCPGSFSbjct 301 THDFEYKGVQMKAGDMIVMPTTLHGLDERKFDDPLTVDFTRPTPIHSTFGNGAHRCPGSF 360
Query 361 LARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGMLHLPLVWDVA 408 LAR E+KVFLQEWLKRIPDF+IKPGEK VH GV + HLPLVWDVASbjct 361 LARVELKVFLQEWLKRIPDFQIKPGEKAGVHGGVNGTLYHLPLVWDVA 408
Score:633 bits(1633), Expect:0.0, Method:Compositional matrix adjust., Identities:312/408(76%), Positives:344/408(84%), Gaps:0/408(0%)
Query 1 MSVAAHMATPELAPVPAHVPPELVVDFDYYNPPGAEDDVHLAWKRLHAGGIPDIVWTPRN 60 MSVAAH A PELAP+PAHV PE V D D YNPPGAEDD HL+ K+LHA GIPDI+WTPRNSbjct 1 MSVAAHGAAPELAPIPAHVSPERVFDIDIYNPPGAEDDYHLSLKQLHAEGIPDILWTPRN 60
Query 61 GGHWIATRGEDIEAIQKDHERFSHRQVTIPLVEHTPLVPLELDPPEHTAYRALIAPAFLP 120 GGHWIATRG+DI I KD+E FS +Q+T+PLVEH PL P+ DPP+HTAYRALIAPAF+PSbjct 61 GGHWIATRGDDIYHILKDYENFSSKQLTVPLVEHQPLPPIFFDPPQHTAYRALIAPAFMP 120
Query 121 QAIASLEEDVRELAIELIEGFYPRGECEFVSEFAKHLPIVIFLRLVDLPLEDREYLLELA 180 QAI LEE RE+AIELIEGFYP+GE EFVS FA+HLPI IF+ +V LP EDREYLL LASbjct 121 QAIGKLEEKAREMAIELIEGFYPKGEVEFVSGFAQHLPIGIFMNMVALPAEDREYLLGLA 180
Query 181 EKAVRPVDAEERLEAQGALFGYLGVWIAERRANPGDDLISLIVNAKIDGRPITDEEMFGL 240 +K VRPVDA E+L+A GA+F YLGV IAERRANPGDDLIS IV++KIDGR +TDEE+ GLSbjct 181 DKMVRPVDANEKLKALGAIFQYLGVKIAERRANPGDDLISKIVHSKIDGRALTDEEVRGL 240
Query 241 CILVLFGGLDTVASAMGFIARFLAENPAHRRQLIDNPELMQKAVDELLRRFGVVNTARLI 300 C LVL GGLDTVASAMGFIARFLA +PAHR+QLIDNPEL QKAVDELLRRF VVN R ISbjct 241 CGLVLIGGLDTVASAMGFIARFLAGSPAHRKQLIDNPELTQKAVDELLRRFPVVNQGRRI 300
Query 301 THDFEYKGVQFKAGDQIQLPNTLHGLDERKFDDPLAVDFTRKPVIHAAFGNGPHRCPGSF 360 THDFEYKGVQ KAGD I +P TLHGLDERKFDDPL VDFTR IH+ FGNG HRCPGSFSbjct 301 THDFEYKGVQMKAGDMIVMPTTLHGLDERKFDDPLTVDFTRPTPIHSTFGNGAHRCPGSF 360
Query 361 LARTEIKVFLQEWLKRIPDFRIKPGEKPRVHSGVVNGMLHLPLVWDVA 408 LAR E+KVFLQEWLKRIPDF+IKPGEK VH GV + HLPLVWDVASbjct 361 LARVELKVFLQEWLKRIPDFQIKPGEKAGVHGGVNGTLYHLPLVWDVA 408
Alphafold2
Alphafold2
CYP101A_Consensus
>Consensus1_A1_55%-100%/1-427 Percentage Identity Consensus MADMMTAPETISNAPLAPLPAHVPEHRVVDFDMYNPPNISQGGLQEAWKRLQDPGVPDLVWTPHNGGHWIATRGKLIRQMFEDHEHFSSECPFIPREAGEAYDFIPTSMDPPEQRPYRAILNAVVGPPVVDRLEDDIRELARELIEDIRLQGSCNFTEDYAEPFPIRIFLMLVDLPLSDAPHLKYIGDQMTRPDGSMSFAEARDALYEYLSPIIDARMEKPGDDAISTVVNGQVNGRPITKDEALKICGLLLLAGLDTVVNFLSFSMHFLATSPEHRKELVDDPERIPAATEELLRRFSLVADGRILKSDIEFDGVLLKAGDMVLLPQLLSGLDERENACPMHVDFGREKISHTTFGHGVHHCAGAHLARREIQTTLEEWLARIPDFEIAPGAEVGHQSGIVSSVTSLPLVWDTATTKAVPRRDYAACYP101A_Consensus
>Consensus1_trim/1-427 Percentage Identity Consensus MTAPETISNAPLAPLPAHVPEHRVVDFDMYNPPNISQGLQEAWKRLQDPGVPDLVWTPHNGGHWIATRGKLIRQMFEDHEHFSSECPFIPREAGEAYDFIPTSMDPPEQRPYRAILNAVVGPPVVDRLEDDIRELARELIEDIRLQGSCNFTEDYAEPFPIRIFLMLVDLPLSDAPHLKYIGDQMTRPDGSMSFAEARDALYEYLSPIIDARMEKPGDDAISTVVNGQVNGRPITKDEALKICGLLLLAGLDTVVNFLSFSMHFLATSPEHRKELVDDPERIPAATEELLRRFSLVADGRILKSDIEFDGVLLKAGDMVLLPQLLSGLDERENACPMHVDFGREKISHTTFGHGVHHCAGAHLARREIQTTLEEWLARIPDFEIAPGAEVGHQSGIVSSVTSLPLVWDTATTKAV
>Consensus2_trim/1-459 Percentage Identity Consensus MTAPETISSAPLAPLPANVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRAILNAAVGPPVVDRIEDEIRELARELIEDIRAQGSCDFTKEYAEPFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEATDLFFDYLSPFIDERRENPGDDAISAVVQSKVNGRPITKDEALRICGLLLLAGLDTVVNFLSFMMQFLAESPEHRKALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKISHTTFGHGVHHCAGAHLARREIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA
>Consensus3_trim/1-459 Percentage Identity Consensus MTAPETITSAPLAPLPANVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRAILNAAVGPPVVDRIEDEIRELARELIEDIRAQGSCDFTKDYAEPFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEATDLFFDYLSPIIDERRENPGDDAISAVVQSKVNGRPITKDEALRICGLLLLAGLDTVVNFLSFMMQFLAESPEHRRALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKISHTTFGHGVHHCAGAHLARREIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA
>Consensus4_trim/1-459 Percentage Identity Consensus MTAPETITSAPLAPLPANVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNAAVGPPVVDRIEDEIRELARELIEDIRAQGSCDFTKEYAEIFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEATDLFFDYLSPFIDERMENPGDDAISAVVQSKVNGRPITKDEALRICGLLLLAGLDTVVNFLSFMMQFLAENPEHRRALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKIEHTTFGHGVHHCAGAHLARVEIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA
>Consensus5_trim=3_trim/1-459 Percentage Identity Consensus MTAPETITSAPLAPLPANVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRAILNAAVGPPVVDRIEDEIRELARELIEDIRAQGSCDFTKDYAEPFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEATDLFFDYLSPIIDERRENPGDDAISAVVQSKVNGRPITKDEALRICGLLLLAGLDTVVNFLSFMMQFLAESPEHRRALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKISHTTFGHGVHHCAGAHLARREIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA
>cytochrome P450 [Rhodococcus opacus]Sequence ID: PQP22084.1 Length: 411 Range 1: 3 to 411
Score:674 bits(1738), Expect:0.0, Method:Compositional matrix adjust., Identities:331/409(81%), Positives:364/409(88%), Gaps:0/409(0%)
Query 2 TAPETITSaplaplpaNVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNG 61 TAPETI+ + LP NVPPERVVDFDMFNPPHI QG QEAWKHLQDPG DLVWTPHNGSbjct 3 TAPETISPTTASDLPTNVPPERVVDFDMFNPPHIDQGAQEAWKHLQDPGMRDLVWTPHNG 62
Query 62 GHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNAAVG 121 GHWIATRG LI++MF D EHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNA+VGSbjct 63 GHWIATRGSLIQQMFTDFEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNASVG 122
Query 122 PPVVDRIEDEIRELARELIEDIRAQGSCDFTKEYAEIFPIRIFLMLVDLPSSDAPRLKYI 181 P +DRI D+IRE ARELI DIRA GSCDFTKEYAE+FPI+IFLM+VDLP SDAP+LKYISbjct 123 PHTIDRIGDDIRETARELIRDIRATGSCDFTKEYAEVFPIKIFLMIVDLPLSDAPKLKYI 182
Query 182 ADQMTRPDGSMTMAEATDLFFDYLSPFIDERMENPGDDAISAVVQSKVNGRPITKDEALR 241 DQMTRPDGSM+M EA +LFF+YLSPFID RME PGDDAIS VVQS+VNGRPITKDEAL+Sbjct 183 GDQMTRPDGSMSMGEAIELFFEYLSPFIDARMEKPGDDAISTVVQSQVNGRPITKDEALK 242
Query 242 ICgllllaglDTVVNFLSFMMQFLAENPEHRRALVDDPELIPAATEELLRRFGLVADARI 301 ICGLLLLAGLDTVVNFLSF M FLA +PE RRALV DP+LIP ATEELLRRFGLV+DAR+Sbjct 243 ICGLLLLAGLDTVVNFLSFTMHFLATHPEERRALVADPQLIPTATEELLRRFGLVSDARL 302
Query 302 IKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKIEHTTFGHGVHHCAGA 361 IK DI DGVDLK GDMVALPTML+GLD+R+N CPMDVDFGRE+I+H TFGHGVHHCAGASbjct 303 IKKDITIDGVDLKSGDMVALPTMLYGLDDRKNKCPMDVDFGREEIDHITFGHGVHHCAGA 362
Query 362 HLARVEIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA 410 HLARVEIQTTLEEWLA+IPDFE++ ++VG QSGIV ++TS+PL W T Sbjct 363 HLARVEIQTTLEEWLAQIPDFEVSDPSRVGFQSGIVSSMTSVPLRWSTG 411
Score:674 bits(1738), Expect:0.0, Method:Compositional matrix adjust., Identities:331/409(81%), Positives:364/409(88%), Gaps:0/409(0%)
Query 2 TAPETITSaplaplpaNVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNG 61 TAPETI+ + LP NVPPERVVDFDMFNPPHI QG QEAWKHLQDPG DLVWTPHNGSbjct 3 TAPETISPTTASDLPTNVPPERVVDFDMFNPPHIDQGAQEAWKHLQDPGMRDLVWTPHNG 62
Query 62 GHWIATRGKLIKEMFEDHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNAAVG 121 GHWIATRG LI++MF D EHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNA+VGSbjct 63 GHWIATRGSLIQQMFTDFEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNASVG 122
Query 122 PPVVDRIEDEIRELARELIEDIRAQGSCDFTKEYAEIFPIRIFLMLVDLPSSDAPRLKYI 181 P +DRI D+IRE ARELI DIRA GSCDFTKEYAE+FPI+IFLM+VDLP SDAP+LKYISbjct 123 PHTIDRIGDDIRETARELIRDIRATGSCDFTKEYAEVFPIKIFLMIVDLPLSDAPKLKYI 182
Query 182 ADQMTRPDGSMTMAEATDLFFDYLSPFIDERMENPGDDAISAVVQSKVNGRPITKDEALR 241 DQMTRPDGSM+M EA +LFF+YLSPFID RME PGDDAIS VVQS+VNGRPITKDEAL+Sbjct 183 GDQMTRPDGSMSMGEAIELFFEYLSPFIDARMEKPGDDAISTVVQSQVNGRPITKDEALK 242
Query 242 ICgllllaglDTVVNFLSFMMQFLAENPEHRRALVDDPELIPAATEELLRRFGLVADARI 301 ICGLLLLAGLDTVVNFLSF M FLA +PE RRALV DP+LIP ATEELLRRFGLV+DAR+Sbjct 243 ICGLLLLAGLDTVVNFLSFTMHFLATHPEERRALVADPQLIPTATEELLRRFGLVSDARL 302
Query 302 IKGDIEFDGVDLKDGDMVALPTMLFGLDERENACPMDVDFGREKIEHTTFGHGVHHCAGA 361 IK DI DGVDLK GDMVALPTML+GLD+R+N CPMDVDFGRE+I+H TFGHGVHHCAGASbjct 303 IKKDITIDGVDLKSGDMVALPTMLYGLDDRKNKCPMDVDFGREEIDHITFGHGVHHCAGA 362
Query 362 HLARVEIQTTLEEWLARIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA 410 HLARVEIQTTLEEWLA+IPDFE++ ++VG QSGIV ++TS+PL W T Sbjct 363 HLARVEIQTTLEEWLAQIPDFEVSDPSRVGFQSGIVSSMTSVPLRWSTG 411
Alphafold2
Alphafold2
>CYP101A1---- 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MTTETIQSNA NLAPLPPHVP EHLVFDFDMY NPSNLSAGVQ EAWAVLQESN 00500051 VPDLVWTRCN GGHWIATRGQ LIREAYEDYR HFSSECPFIP REAGEAYDFI 01000101 PTSMDPPEQR QFRALANQVV GMPVVDKLEN RIQELACSLI ESLRPQGQCN 01500151 FTEDYAEPFP IRIFMLLAGL PEEDIPHLKY LTDQMTRPDG SMTFAEAKEA 02000201 LYDYLIPIIE QRRQKPGTDA ISIVANGQVN GRPITSDEAK RMCGLLLVGG 02500251 LDTVVNFLSF SMEFLAKSPE HRQELIERPE RIPAACEELL RRFSLVADGR 03000301 ILTSDYEFHG VQLKKGDQIL LPQMLSGLDE RENACPMHVD FSRQKVSHTT 03500351 FGHGSHLCLG QHLARREIIV TLKEWLTRIP DFSIAPGAQI QHKSGIVSGV 04000401 QALPLVWDPA TTKAV
>RecName: Full=Camphor 5-monooxygenase; AltName: Full=Cytochrome P450-cam; Short=Cytochrome P450cam [Pseudomonas putida] = CYP101A1Sequence ID: P00183.2 Length: 415Range 1: 18 to 410
Score:537 bits(1384), Expect:0.0, Method:Compositional matrix adjust., Identities:259/393(66%), Positives:306/393(77%), Gaps:0/393(0%)
Query 18 NVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFE 77 +VP V DFDM+NP ++S G+QEAW LQ+ PDLVWT NGGHWIATRG+LI+E +ESbjct 18 HVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYE 77
Query 78 DHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNAAVGPPVVDRIEDEIRELAR 137 D+ HFSSECPF+PREAGE Y FIPTSMDPPEQR +R + N VG PVVD++E+ I+ELA Sbjct 78 DYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELAC 137
Query 138 ELIEDIRAQGSCDFTKEYAEIFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEA 197 LIE +R QG C+FT++YAE FPIRIF++L LP D P LKY+ DQMTRPDGSMT AEASbjct 138 SLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEA 197
Query 198 TDLFFDYLSPFIDERMENPGDDAISAVVQSKVNGRPITKDEALRICgllllaglDTVVNF 257 + +DYL P I++R + PG DAIS V +VNGRPIT DEA R+CGLLL+ GLDTVVNFSbjct 198 KEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNF 257
Query 258 LSFMMQFLAENPEHRRALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGD 317 LSF M+FLA++PEHR+ L++ PE IPAA EELLRRF LVAD RI+ D EF GV LK GDSbjct 258 LSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGD 317
Query 318 MVALPTMLFGLDERENACPMDVDFGREKIEHTTFGHGVHHCAGAHLARVEIQTTLEEWLA 377 + LP ML GLDERENACPM VDF R+K+ HTTFGHG H C G HLAR EI TL+EWL Sbjct 318 QILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLT 377
Query 378 RIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA 410 RIPDF IAPGA++ H+SGIV V +LPLVWD ASbjct 378 RIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPA 410
Score:537 bits(1384), Expect:0.0, Method:Compositional matrix adjust., Identities:259/393(66%), Positives:306/393(77%), Gaps:0/393(0%)
Query 18 NVPPERVVDFDMFNPPHISQGLQEAWKHLQDPGGPDLVWTPHNGGHWIATRGKLIKEMFE 77 +VP V DFDM+NP ++S G+QEAW LQ+ PDLVWT NGGHWIATRG+LI+E +ESbjct 18 HVPEHLVFDFDMYNPSNLSAGVQEAWAVLQESNVPDLVWTRCNGGHWIATRGQLIREAYE 77
Query 78 DHEHFSSECPFLPREAGEQYSFIPTSMDPPEQRPYRRILNAAVGPPVVDRIEDEIRELAR 137 D+ HFSSECPF+PREAGE Y FIPTSMDPPEQR +R + N VG PVVD++E+ I+ELA Sbjct 78 DYRHFSSECPFIPREAGEAYDFIPTSMDPPEQRQFRALANQVVGMPVVDKLENRIQELAC 137
Query 138 ELIEDIRAQGSCDFTKEYAEIFPIRIFLMLVDLPSSDAPRLKYIADQMTRPDGSMTMAEA 197 LIE +R QG C+FT++YAE FPIRIF++L LP D P LKY+ DQMTRPDGSMT AEASbjct 138 SLIESLRPQGQCNFTEDYAEPFPIRIFMLLAGLPEEDIPHLKYLTDQMTRPDGSMTFAEA 197
Query 198 TDLFFDYLSPFIDERMENPGDDAISAVVQSKVNGRPITKDEALRICgllllaglDTVVNF 257 + +DYL P I++R + PG DAIS V +VNGRPIT DEA R+CGLLL+ GLDTVVNFSbjct 198 KEALYDYLIPIIEQRRQKPGTDAISIVANGQVNGRPITSDEAKRMCGLLLVGGLDTVVNF 257
Query 258 LSFMMQFLAENPEHRRALVDDPELIPAATEELLRRFGLVADARIIKGDIEFDGVDLKDGD 317 LSF M+FLA++PEHR+ L++ PE IPAA EELLRRF LVAD RI+ D EF GV LK GDSbjct 258 LSFSMEFLAKSPEHRQELIERPERIPAACEELLRRFSLVADGRILTSDYEFHGVQLKKGD 317
Query 318 MVALPTMLFGLDERENACPMDVDFGREKIEHTTFGHGVHHCAGAHLARVEIQTTLEEWLA 377 + LP ML GLDERENACPM VDF R+K+ HTTFGHG H C G HLAR EI TL+EWL Sbjct 318 QILLPQMLSGLDERENACPMHVDFSRQKVSHTTFGHGSHLCLGQHLARREIIVTLKEWLT 377
Query 378 RIPDFEIAPGAKVGHQSGIVGAVTSLPLVWDTA 410 RIPDF IAPGA++ H+SGIV V +LPLVWD ASbjct 378 RIPDFSIAPGAQIQHKSGIVSGVQALPLVWDPA 410
Alphafold2
Alphafold2
PDBj 1.3Å
PDBj 1.3Å
CYP101A1 (P450cam) WT
Alphafold2
Alphafold2
Alphafold2_ptm
Alphafold2_ptm
PDB
PDB
100%1GEK 1.7Å_415aa1IWI 2.0Å2L8M NMR2Z97 1.8Å2ZAW 1.6Å2ZAX 1.6Å2ZWT 1.4Å2ZWU 1.3Å
+2aa at C-terminal1LWL 2.2Å_417aa1RF9 1.8Å
Histag at C-terminal2FE6 1.5Å_421aa3WRH 1.6Å3WRI 2.9Å3WRJ 1.9Å3WRK 2.6Å3WRL 1.7Å3WRM 2.0Å
-1Met1CP4 1.9Å_144aa3CP4 2.3Å4CP4 2.1Å5CP4 1.8Å2CPP 1.6Å3CPP 1.9Å4CPP 2.1Å5CPP 2.1Å6CPP 1.9Å7CPP 2.0Å8CPP 2.1Å1NOO 2.2Å1PHA 1.6Å1PHB 1.6Å1PHC 1.6Å1PHD 1.6Å1PHE 1.6Å1PHF 1.6Å1PHG 1.6Å1RE9 1.5Å1YRC 1.4Å1YRD 1.7Å2H7Q 1.5Å
-1Met +Histag at C1P2Y 2.3Å_420aa1P7R 2.9Å
-2aa2LQD NMR_413aa
-N terminus +Histag2FER 1.7Å_2FEU 1.7Å
Variant C334A2A1M 2.1Å1GJM 2.2Å1K2O 1.7Å1QMQ 1.6Å1T87 1.8Å1T88 1.9Å3L61 1.5Å3L62 1.7Å3L63 1.5Å3OIA 1.7Å3OL5 1.8Å3P6M 2.0Å3P6N 1.7Å3P6O 2.0Å3P6P 1.9Å3P6Q 2.0Å3P6R 2.1Å3P6S 2.0Å3P6T 1.9Å3P6U 1.7Å3P6V 2.0Å3P6W 2.1Å3P6X 1.7Å5IKI 1.5Å
Mutation1aa2CP4 2.1Å_T252A2AIO 1.6Å1GEB 2.0Å_T252I
1IWJ 2.0Å_R109K1IWK 2.0Å_R112K
2QBL 1.8Å_G248T2QBN 1.8Å_G248V2QBO 1.9Å
6WFL 1.6Å_D251E6CP4 1.9Å_D251N
1AKD 1.8Å_E276Q1DZ4 1.6Å1DZ6 1.9Å1DZ8 1.9Å1DZ9 1.9Å1O76 1.8Å1UYU 2.0Å2M56 NMR
2ZUH 1.6Å_D297A2ZUI 1.5Å_D297N2ZUJ 1.6A_D297L
C334A_&5WK7 2.0Å_R186A2H7R 2.1Å_L244A2A1N 1.9Å_D251N6WE6 2.2Å_D251E2A1O 1.6Å_T252A4L49 2.1Å_L358A4L4D 2.1Å1T85 1.8Å_L358P1T86 1.9Å
2aaF87W/Y96F1C8J 2.1Å
C334AR365L/E366Q3FWG 2.6Å
3aaC357U/R365L/E366Q3FWI 2.4Å3FWJ 1.9Å
C334A_L358A/K178G/D182N4L4B 2.1Å4L4F 1.3Å
C334A_F87W/Y96F/V247L1J51 2.2Å1MPW 2.3Å2FRZ 2.1Å
4aaE276Q/C357U/R365L/E366Q3FWF 1.8Å
C334A_F87W/Y96F/L244A/V247L2GQX 2.1Å2GR6 2.3Å
5aaC334A_C58S/C85S/C136S/C285S/K344C/4JWS 2.2Å2JWU 2.2Å4JX1 2.1Å6NBL 2.2Å
6aaC334A_C48S/C58S/C85SC136S/S190C/C285S/4EK1 2.0Å
C334A_C48S/C58S/C85SC136S/S190C/C285S/4EK1 2.0Å
C334A_T1A/C48S/C58S/C85SC136S/C285S/4EK1 2.0Å
Alphafold result (localcolabfold using AF2 parameters)spectrum b, rainbow_rev, maximum=100, minimum=502ZWU rotated & Center using F351, C358, G360White
Alphafold result (localcolabfold using AF2 parameters)spectrum b, rainbow_rev, maximum=100, minimum=502ZWU rotated & Center using F351, C358, G360White
Local-Global Alignment#CA N1 N2 DIST N RMSD SUMMARY(GDT) 405 405 4.0 405 0.70
GDT_TS LGA_S3 LGA_Q 97.593 99.081 50.714
GDT_TS LGA_S3 LGA_Q 97.593 99.081 50.714
TM-Score (model5 vs 2ZWU)Number of residues in common= 405RMSD of the common residues= 3.831
TM-score = 0.7840 (d0= 7.26)MaxSub-score= 0.1987 (d0= 3.50)
GDT-TS-score= 0.4636 %(d<1)=0.0469 %(d<2)=0.0815 %(d<4)=0.7259 %(d<8)=1.0000 %
GDT-HA-score= 0.2222 %(d<0.5)=0.0346 %(d<1)=0.0469 %(d<2)=0.0815 %(d<4)=0.7259 %
TM-score = 0.7840 (d0= 7.26)MaxSub-score= 0.1987 (d0= 3.50)
GDT-TS-score= 0.4636 %(d<1)=0.0469 %(d<2)=0.0815 %(d<4)=0.7259 %(d<8)=1.0000 %
GDT-HA-score= 0.2222 %(d<0.5)=0.0346 %(d<1)=0.0469 %(d<2)=0.0815 %(d<4)=0.7259 %
TM-Align (model5 vs 2ZWU)Length of Chain_1: 415 residuesLength of Chain_2: 405 residues
Aligned length= 405, RMSD= 0.70, Seq_ID=n_identical/n_aligned= 1.000
TM-score= 0.96739 (if normalized by length of Chain_1)TM-score= 0.99110 (if normalized by length of Chain_2)
Aligned length= 405, RMSD= 0.70, Seq_ID=n_identical/n_aligned= 1.000
TM-score= 0.96739 (if normalized by length of Chain_1)TM-score= 0.99110 (if normalized by length of Chain_2)
CYP101B_Consensus
>Consensus1_B1_55%-100%/1-422 Percentage Identity Consensus MAARMAATLAPMATSAAMQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEALAEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESIRATAIDLIEGFAARGHCNFTAEYAEQFPIRVFMALVGIPASDAPKIRHWAECMTRPGMDMTFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMGGRDGVGRRLTRDEALSLVTQVLIAGVDTVVNFLGFIMLELARNPALRADLRARGADGILPAVNELFRRFGLVTIAREVRHDIEFHGGVALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAITLEEWLKRIPDFALAPDSDLSPVPGIVGALRRVELEWDTRCYP101B_Consensus
>Consensus1_trim/1-422 Percentage Identity Consensus MATSAAMQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEALAEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESIRATAIDLIEGFAARGHCNFTAEYAEQFPIRVFMALVGIPASDAPKIRHWAECMTRPGMDMTFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMGGRRLTRDEALSLVTQVLIAGVDTVVNFLGFIMLELARNPALRADLRARGADILPAVNELFRRFGLVTIAREVRHDIEFHGVALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAITLEEWLKRIPDFALAPDSDLSPVPGIVGALRRVELEWDT
>Consensus2_trim/1-423 Percentage Identity Consensus MATSAAMQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEALAEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESIRATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDMTFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMGGRRLTRDEALSLVTQVLIAGVDTVVNFLGFIMLELARNPALRADLRARGADILPAVNELFRRFGLVTIAREVRHDIEFHGVALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAITLEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT
>Consensus3/1-423 Percentage Identity Consensus MATSAAMQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEALAEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESIRATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDMTFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMGGRRLTRDEALSLVTQVLIAGVDTVVNFLGFIMLELARNPALRADLRARGADILPAVNELFRRFGLVTIAREVRHDIEFHGVALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAITLEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT
>cytochrome P450 [Novosphingobium sp. ERN07]Sequence ID: NLR72008.1 Length: 398 Range 1: 1 to 398
Score:696 bits(1795), Expect:0.0, Method:Compositional matrix adjust., Identities:349/398(88%), Positives:375/398(94%), Gaps:1/398(0%)
Query 7 MQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEAL 66 MQ P+HVP DR+VDI+IY PPGLE+D FHKAWSDL ASNPAVVWTPRNEGHWIALGGEALSbjct 1 MQTPSHVPADRIVDINIYMPPGLESDGFHKAWSDLSASNPAVVWTPRNEGHWIALGGEAL 60
Query 67 AEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESI 126 +VQSDP RFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYR+LLNAHLNP+AIRGLS+SISbjct 61 QDVQSDPGRFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRQLLNAHLNPSAIRGLSDSI 120
Query 127 RATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDM 186 RATA+DLIEGFA+RGHCNF+AEYAEQFPIR+FMALVGI A++APKIRHWAECMTRPGMDMSbjct 121 RATAVDLIEGFASRGHCNFSAEYAEQFPIRVFMALVGIDAAEAPKIRHWAECMTRPGMDM 180
Query 187 TFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMG-GRRLTRDEALSLVTQVLIAGV 245 TF+EAKA FFDY+ PLVDARR PGEDMISA+L AD+G GRRLTRDEALS+VTQVLIAG+Sbjct 181 TFDEAKAVFFDYVAPLVDARRQTPGEDMISALLHADLGEGRRLTRDEALSVVTQVLIAGL 240
Query 246 DTVVNFLGFIMLELARNPalradlrargadILPAVNELFRRFGLVTIAREVRHDIEFHGV 305 DTVVN LGFIM ELA N ALRADLR RGADI+P V+ELFRRFGLV+IAREVRHDIEFHGVSbjct 241 DTVVNVLGFIMRELAENAALRADLRQRGADIVPVVHELFRRFGLVSIAREVRHDIEFHGV 300
Query 306 ALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 365 LKAGDMIAIPTQVHGLDPRVNPDPL IDPARKRARHSTFGSGPHMCPGQELARKEVAITSbjct 301 HLKAGDMIAIPTQVHGLDPRVNPDPLTIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 360
Query 366 LEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT 403 LEEWLKRIPDFA++PDSDLSPVPGIVGALRRVEL W+TSbjct 361 LEEWLKRIPDFALSPDSDLSPVPGIVGALRRVELVWNT 398
Score:696 bits(1795), Expect:0.0, Method:Compositional matrix adjust., Identities:349/398(88%), Positives:375/398(94%), Gaps:1/398(0%)
Query 7 MQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEAL 66 MQ P+HVP DR+VDI+IY PPGLE+D FHKAWSDL ASNPAVVWTPRNEGHWIALGGEALSbjct 1 MQTPSHVPADRIVDINIYMPPGLESDGFHKAWSDLSASNPAVVWTPRNEGHWIALGGEAL 60
Query 67 AEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESI 126 +VQSDP RFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYR+LLNAHLNP+AIRGLS+SISbjct 61 QDVQSDPGRFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRQLLNAHLNPSAIRGLSDSI 120
Query 127 RATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDM 186 RATA+DLIEGFA+RGHCNF+AEYAEQFPIR+FMALVGI A++APKIRHWAECMTRPGMDMSbjct 121 RATAVDLIEGFASRGHCNFSAEYAEQFPIRVFMALVGIDAAEAPKIRHWAECMTRPGMDM 180
Query 187 TFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMG-GRRLTRDEALSLVTQVLIAGV 245 TF+EAKA FFDY+ PLVDARR PGEDMISA+L AD+G GRRLTRDEALS+VTQVLIAG+Sbjct 181 TFDEAKAVFFDYVAPLVDARRQTPGEDMISALLHADLGEGRRLTRDEALSVVTQVLIAGL 240
Query 246 DTVVNFLGFIMLELARNPalradlrargadILPAVNELFRRFGLVTIAREVRHDIEFHGV 305 DTVVN LGFIM ELA N ALRADLR RGADI+P V+ELFRRFGLV+IAREVRHDIEFHGVSbjct 241 DTVVNVLGFIMRELAENAALRADLRQRGADIVPVVHELFRRFGLVSIAREVRHDIEFHGV 300
Query 306 ALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 365 LKAGDMIAIPTQVHGLDPRVNPDPL IDPARKRARHSTFGSGPHMCPGQELARKEVAITSbjct 301 HLKAGDMIAIPTQVHGLDPRVNPDPLTIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 360
Query 366 LEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT 403 LEEWLKRIPDFA++PDSDLSPVPGIVGALRRVEL W+TSbjct 361 LEEWLKRIPDFALSPDSDLSPVPGIVGALRRVELVWNT 398
Alphafold2
Alphafold2
CYP101B
>CYP101B1C_1 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MLPHDRGQNS TRRITAMEAP AHVPADRVVD IDIYMPPGLA EHGFHKAWSD 00500051 LSAGNPAVVW TPRNEGHWIA LGGEALQEVQ SDPERFSSRI IVLPKSVGEM 01000101 HGLIPTTIDP PEHRPYRQLL NAHLNPGAIR GLSESIRQTA VDLIEGFAAQ 01500151 GHCNFTAQYA EQFPIRVFMA LVGIEASEAP RIRHWAECMT RPGMDMTFDE 02000201 AKAVFFDYVG PLVDARRETP GEDMISAMIN ADLGDGRRLT RDEALSVVTQ 02500251 VLIAGLDTVV NVLGFIMREL AGNPALRADL RQRGADILPV VHELFRRFGL 03000301 VSIAREVRRD IEFHGVHLKA GDMIAIPTQV HGLDPRVNPD PLAIDPSRKR 03500351 ARHSTFGSGP HMCPGQELAR KEVAITLEEW LRRIPDFALG PNSDLSPVPG 04000401 IVGALRRVEL VWNTCYP101B
>Cytochrome P450 [Novosphingobium aromaticivorans] =CYP101B1Sequence ID: SCY90561.1 Length: 398 Range 1: 1 to 398
Score:691 bits(1783), Expect:0.0, Method:Compositional matrix adjust., Identities:349/398(88%), Positives:371/398(93%), Gaps:1/398(0%)
Query 7 MQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEAL 66 M+ PAHVP DRVVDIDIY PPGL FHKAWSDL A NPAVVWTPRNEGHWIALGGEALSbjct 1 MEAPAHVPADRVVDIDIYMPPGLAEHGFHKAWSDLSAGNPAVVWTPRNEGHWIALGGEAL 60
Query 67 AEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESI 126 EVQSDPERFSSR+IVLPKSVGEMHGLIPTTIDPPEHRPYR+LLNAHLNP AIRGLSESISbjct 61 QEVQSDPERFSSRIIVLPKSVGEMHGLIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESI 120
Query 127 RATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDM 186 R TA+DLIEGFAA+GHCNFTA+YAEQFPIR+FMALVGI AS+AP+IRHWAECMTRPGMDMSbjct 121 RQTAVDLIEGFAAQGHCNFTAQYAEQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDM 180
Query 187 TFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMG-GRRLTRDEALSLVTQVLIAGV 245 TF+EAKA FFDY+GPLVDARR PGEDMISAM+ AD+G GRRLTRDEALS+VTQVLIAG+Sbjct 181 TFDEAKAVFFDYVGPLVDARRETPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGL 240
Query 246 DTVVNFLGFIMLELARNPalradlrargadILPAVNELFRRFGLVTIAREVRHDIEFHGV 305 DTVVN LGFIM ELA NPALRADLR RGADILP V+ELFRRFGLV+IAREVR DIEFHGVSbjct 241 DTVVNVLGFIMRELAGNPALRADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGV 300
Query 306 ALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 365 LKAGDMIAIPTQVHGLDPRVNPDPLAIDP+RKRARHSTFGSGPHMCPGQELARKEVAITSbjct 301 HLKAGDMIAIPTQVHGLDPRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAIT 360
Query 366 LEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT 403 LEEWL+RIPDFA+ P+SDLSPVPGIVGALRRVEL W+TSbjct 361 LEEWLRRIPDFALGPNSDLSPVPGIVGALRRVELVWNT 398
Score:691 bits(1783), Expect:0.0, Method:Compositional matrix adjust., Identities:349/398(88%), Positives:371/398(93%), Gaps:1/398(0%)
Query 7 MQRPAHVPPDRVVDIDIYAPPGLEADDFHKAWSDLQASNPAVVWTPRNEGHWIALGGEAL 66 M+ PAHVP DRVVDIDIY PPGL FHKAWSDL A NPAVVWTPRNEGHWIALGGEALSbjct 1 MEAPAHVPADRVVDIDIYMPPGLAEHGFHKAWSDLSAGNPAVVWTPRNEGHWIALGGEAL 60
Query 67 AEVQSDPERFSSRVIVLPKSVGEMHGLIPTTIDPPEHRPYRKLLNAHLNPAAIRGLSESI 126 EVQSDPERFSSR+IVLPKSVGEMHGLIPTTIDPPEHRPYR+LLNAHLNP AIRGLSESISbjct 61 QEVQSDPERFSSRIIVLPKSVGEMHGLIPTTIDPPEHRPYRQLLNAHLNPGAIRGLSESI 120
Query 127 RATAIDLIEGFAARGHCNFTAEYAEQFPIRIFMALVGIPASDAPKIRHWAECMTRPGMDM 186 R TA+DLIEGFAA+GHCNFTA+YAEQFPIR+FMALVGI AS+AP+IRHWAECMTRPGMDMSbjct 121 RQTAVDLIEGFAAQGHCNFTAQYAEQFPIRVFMALVGIEASEAPRIRHWAECMTRPGMDM 180
Query 187 TFEEAKAAFFDYLGPLVDARRANPGEDMISAMLTADMG-GRRLTRDEALSLVTQVLIAGV 245 TF+EAKA FFDY+GPLVDARR PGEDMISAM+ AD+G GRRLTRDEALS+VTQVLIAG+Sbjct 181 TFDEAKAVFFDYVGPLVDARRETPGEDMISAMINADLGDGRRLTRDEALSVVTQVLIAGL 240
Query 246 DTVVNFLGFIMLELARNPalradlrargadILPAVNELFRRFGLVTIAREVRHDIEFHGV 305 DTVVN LGFIM ELA NPALRADLR RGADILP V+ELFRRFGLV+IAREVR DIEFHGVSbjct 241 DTVVNVLGFIMRELAGNPALRADLRQRGADILPVVHELFRRFGLVSIAREVRRDIEFHGV 300
Query 306 ALKAGDMIAIPTQVHGLDPRVNPDPLAIDPARKRARHSTFGSGPHMCPGQELARKEVAIT 365 LKAGDMIAIPTQVHGLDPRVNPDPLAIDP+RKRARHSTFGSGPHMCPGQELARKEVAITSbjct 301 HLKAGDMIAIPTQVHGLDPRVNPDPLAIDPSRKRARHSTFGSGPHMCPGQELARKEVAIT 360
Query 366 LEEWLKRIPDFAVAPDSDLSPVPGIVGALRRVELEWDT 403 LEEWL+RIPDFA+ P+SDLSPVPGIVGALRRVEL W+TSbjct 361 LEEWLRRIPDFALGPNSDLSPVPGIVGALRRVELVWNT 398
Alphafold result (localcolabfold using AF2 parameters)spectrum b, rainbow_rev, maximum=100, minimum=502ZWU rotated & Center using F351, C358, G360WhiteCYP101A1_model5magenta
TM-Score (CYP101B1_model5 vs CYP101A1_model5)Number of residues in common= 414RMSD of the common residues= 10.041
TM-score = 0.4363 (d0= 7.34)MaxSub-score= 0.0891 (d0= 3.50)
GDT-TS-score= 0.1723 %(d<1)=0.0361 %(d<2)=0.0578 %(d<4)=0.1566 %(d<8)=0.4386 %
GDT-HA-score= 0.0699 %(d<0.5)=0.0289 %(d<1)=0.0361 %(d<2)=0.0578 %(d<4)=0.1566 %
TM-score = 0.4363 (d0= 7.34)MaxSub-score= 0.0891 (d0= 3.50)
GDT-TS-score= 0.1723 %(d<1)=0.0361 %(d<2)=0.0578 %(d<4)=0.1566 %(d<8)=0.4386 %
GDT-HA-score= 0.0699 %(d<0.5)=0.0289 %(d<1)=0.0361 %(d<2)=0.0578 %(d<4)=0.1566 %
CYP101C_Consensus
>Consensus1_C1_55%-100%/1-425 Percentage Identity Consensus MTDPMAAPMMMMSAAMIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGDRPDGPGLVWTTANGGHWIAARGDVVRSLWGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREVARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVEQLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRAGDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLEEWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVWDPAGAPGFSPARDYAACYP101C_Consensus
>Consensus1_trim/1-425 Percentage Identity Consensus MIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGPGLVWTTANGGHWIAARGDVVRSLWGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREVARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVEQLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRAGDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLEEWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVWD
>Consensus2_trim/1-425 Percentage Identity Consensus MIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGPGLVWTTANGGHWIAARGDVVRSLWGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREVARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVEQLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRAGDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLEEWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVWD
>Consensus3_trim/1-425 Percentage Identity Consensus MIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGPGLVWTTANGGHWIAARGDVVRSLWGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREVARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVEQLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVAAMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRAGDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLEEWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVWD
CYP101C
>CYP101C1---- 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MIPAHVPADR VVDFDIFNPP GVEQDYFAAW KTLLDGPGLV WSTANGGHWI 00500051 AARGDVVREL WGDAERLSSQ CLAVTPGLGK VMQFIPLQQD GAEHKAFRTP 01000101 VMKGLASRFV VALEPKVQAV ARKLMESLRP RGSCDFVSDF AEILPLNIFL 01500151 TLIDVPLEDR PRLRQLGVQL TRPDGSMTVE QLKQAADDYL WPFIEKRMAQ 02000201 PGDDLFSRIL SEPVGGRPWT VDEARRMCRN LLFGGLDTVA AMIGMVALHL 02500251 ARHPEDQRLL RERPDLIPAA ADELMRRYPT VAVSRNAVAD VDADGVTIRK 03000301 GDLVYLPSVL HNLDPASFEA PEEVRFDRGL APIRHTTMGV GAHRCVGAGL 03500351 ARMEVIVFLR EWLGGMPEFA LAPDKAVTMK GGNVGACTAL PLVWRACYP101C
>Cytochrome P450 [Novosphingobium aromaticivorans] = CYP101C1Sequence ID: SCY39293.1 Length: 396 Range 1: 1 to 394
Score:705 bits(1820), Expect:0.0, Method:Compositional matrix adjust., Identities:348/394(88%), Positives:365/394(92%), Gaps:0/394(0%)
Query 1 MIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGPGLVWTTANGGHWIAARGDVVRSL 60 MIPAHVPADRVVDFDIF PPGVEQDYFAAWKTLL GPGLVW+TANGGHWIAARGDVVR LSbjct 1 MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTANGGHWIAARGDVVREL 60
Query 61 WGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREV 120 WGDAERLSS+CLAVTPGLG+VMQFIPLQQDG EHK FR VMKG ASR VVALEPKV+ VSbjct 61 WGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVMKGLASRFVVALEPKVQAV 120
Query 121 ARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVE 180 AR+L+ LRPRG C+FV+DFAEILPL+IFLTLIDVPLEDRPRLR LGVQLTRPDGSMTVESbjct 121 ARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDRPRLRQLGVQLTRPDGSMTVE 180
Query 181 QLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVA 240 QL+ AADDYLWP+IE+R+A PGDDLFSRILSEPV GR WT DEARRMCRNLLFGGLDTVASbjct 181 QLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRPWTVDEARRMCRNLLFGGLDTVA 240
Query 241 AMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRA 300 AMIGMVALHLARHPEDQ LLRERPDLIPAAADELMRRYPTV+VSRNAVADV+ DGVTIR Sbjct 241 AMIGMVALHLARHPEDQRLLRERPDLIPAAADELMRRYPTVAVSRNAVADVDADGVTIRK 300
Query 301 GDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLE 360 GDLVYLPSVLHNLDPA F+APEEVRFDRGL PIRHTTMGVGAHRCVGAGLARMEVIVFL Sbjct 301 GDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIRHTTMGVGAHRCVGAGLARMEVIVFLR 360
Query 361 EWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVW 394 EWL GMPEF L PD+ VTMKGGNVGACTALPLVWSbjct 361 EWLGGMPEFALAPDKAVTMKGGNVGACTALPLVW 394
Score:705 bits(1820), Expect:0.0, Method:Compositional matrix adjust., Identities:348/394(88%), Positives:365/394(92%), Gaps:0/394(0%)
Query 1 MIPAHVPADRVVDFDIFAPPGVEQDYFAAWKTLLGGPGLVWTTANGGHWIAARGDVVRSL 60 MIPAHVPADRVVDFDIF PPGVEQDYFAAWKTLL GPGLVW+TANGGHWIAARGDVVR LSbjct 1 MIPAHVPADRVVDFDIFNPPGVEQDYFAAWKTLLDGPGLVWSTANGGHWIAARGDVVREL 60
Query 61 WGDAERLSSECLAVTPGLGEVMQFIPLQQDGPEHKPFRMAVMKGFASRHVVALEPKVREV 120 WGDAERLSS+CLAVTPGLG+VMQFIPLQQDG EHK FR VMKG ASR VVALEPKV+ VSbjct 61 WGDAERLSSQCLAVTPGLGKVMQFIPLQQDGAEHKAFRTPVMKGLASRFVVALEPKVQAV 120
Query 121 ARELIAGLRPRGGCEFVADFAEILPLHIFLTLIDVPLEDRPRLRPLGVQLTRPDGSMTVE 180 AR+L+ LRPRG C+FV+DFAEILPL+IFLTLIDVPLEDRPRLR LGVQLTRPDGSMTVESbjct 121 ARKLMESLRPRGSCDFVSDFAEILPLNIFLTLIDVPLEDRPRLRQLGVQLTRPDGSMTVE 180
Query 181 QLRDAADDYLWPYIEERLANPGDDLFSRILSEPVEGRAWTFDEARRMCRNLLFGGLDTVA 240 QL+ AADDYLWP+IE+R+A PGDDLFSRILSEPV GR WT DEARRMCRNLLFGGLDTVASbjct 181 QLKQAADDYLWPFIEKRMAQPGDDLFSRILSEPVGGRPWTVDEARRMCRNLLFGGLDTVA 240
Query 241 AMIGMVALHLARHPEDQALLRERPDLIPAAADELMRRYPTVSVSRNAVADVEVDGVTIRA 300 AMIGMVALHLARHPEDQ LLRERPDLIPAAADELMRRYPTV+VSRNAVADV+ DGVTIR Sbjct 241 AMIGMVALHLARHPEDQRLLRERPDLIPAAADELMRRYPTVAVSRNAVADVDADGVTIRK 300
Query 301 GDLVYLPSVLHNLDPACFDAPEEVRFDRGLNPIRHTTMGVGAHRCVGAGLARMEVIVFLE 360 GDLVYLPSVLHNLDPA F+APEEVRFDRGL PIRHTTMGVGAHRCVGAGLARMEVIVFL Sbjct 301 GDLVYLPSVLHNLDPASFEAPEEVRFDRGLAPIRHTTMGVGAHRCVGAGLARMEVIVFLR 360
Query 361 EWLAGMPEFRLDPDRPVTMKGGNVGACTALPLVW 394 EWL GMPEF L PD+ VTMKGGNVGACTALPLVWSbjct 361 EWLGGMPEFALAPDKAVTMKGGNVGACTALPLVW 394
Alphafold2
Alphafold2
CYP101D_Consensus
>Consensus1_D1_55%-100%/1-453 Percentage Identity Consensus MPHHFCRPGQRSRGCARQSMMNHGKDAAMNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDDTPPLVWTPFTGGHWIATRGAVIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFLALADLPMEDAPKLNALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKALGLISLLLLGGLDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFAVVSDARMVAKDMEYHGTQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRNGIAHSTFAQGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKVAKAAAGCYP101D_Consensus
>Consensus1_trim/1-453 Percentage Identity Consensus MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTPFTGGHWIATRGAVIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFLALADLPMEDAPKLNALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKALGLISLLLLGGLDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFAVVSDARMVAKDMEYHGTQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFAQGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV
>Consensus2_trim/1-454 Percentage Identity Consensus MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTPFTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFLALADLPMEDAPKLNALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKALGLISLLLLGGLDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFPVVSDARMVAKDMEYHGTQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFAQGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV
>Consensus3/1-454 Percentage Identity Consensus MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTPFTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKLNALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKALGLISLLLLGGLDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFPVVSDARMVAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFAQGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV
>cytochrome P450 [Sphingobium baderi]Sequence ID: WP_062067716.1 Length: 417Range 1: 1 to 417
Score:723 bits(1866), Expect:0.0, Method:Compositional matrix adjust., Identities:354/417(85%), Positives:382/417(91%), Gaps:0/417(0%)
Query 1 MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTP 60 MNTD+L+PPSAKHR P PDHVPA LVRE+D YALDGIEEG+HEAWKRVQQPDTPPL+WTPSbjct 1 MNTDSLSPPSAKHRVPVPDHVPAALVREVDAYALDGIEEGFHEAWKRVQQPDTPPLIWTP 60
Query 61 FTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDK 120 FTGGHWIATRGALIDEIYR PDRFSS VI++P+EAGE YEMVPTKMDPPEHTPYRKA+DKSbjct 61 FTGGHWIATRGALIDEIYRSPDRFSSRVIWVPREAGEAYEMVPTKMDPPEHTPYRKAIDK 120
Query 121 GLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKL 180 GLNLAQIRKVESD+RAVA+ELIEGFA +GHCDFA D+A VFPVKVF+ALA LPMEDAPKLSbjct 121 GLNLAQIRKVESDIRAVAVELIEGFAEKGHCDFAKDFAGVFPVKVFLALAGLPMEDAPKL 180
Query 181 NALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGE 240 N LA EMTRPSGNTPEEQ SLEAANKGFFAYVAPII ARRG G DLIT MVNSEING+Sbjct 181 NLLANEMTRPSGNTPEEQGKSLEAANKGFFAYVAPIIEARRGSAGADLITQMVNSEINGQ 240
Query 241 PMPHDKAlglisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRR 300 PM DKALGL+SLLLLGGLDTVVNFLSFMMIYLARHPETV ELRSDP+KLQRGVEEMFRRSbjct 241 PMAQDKALGLVSLLLLGGLDTVVNFLSFMMIYLARHPETVEELRSDPMKLQRGVEEMFRR 300
Query 301 FPVVSDARMVAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFA 360 F VVSDAR VA+DME+ G +LKAGD++LLPT LHGLDDTQ+ED M VDLSRR +AHSTFASbjct 301 FGVVSDARYVARDMEFQGTKLKAGDLVLLPTVLHGLDDTQHEDAMTVDLSRRNVAHSTFA 360
Query 361 QGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV 417 QGPHRCAGMHLARMEV+VTLQEWL RIP F L EGA+PIYH+GIVAAVENVPLVW+VSbjct 361 QGPHRCAGMHLARMEVLVTLQEWLARIPRFTLAEGASPIYHAGIVAAVENVPLVWEV 417
Score:723 bits(1866), Expect:0.0, Method:Compositional matrix adjust., Identities:354/417(85%), Positives:382/417(91%), Gaps:0/417(0%)
Query 1 MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTP 60 MNTD+L+PPSAKHR P PDHVPA LVRE+D YALDGIEEG+HEAWKRVQQPDTPPL+WTPSbjct 1 MNTDSLSPPSAKHRVPVPDHVPAALVREVDAYALDGIEEGFHEAWKRVQQPDTPPLIWTP 60
Query 61 FTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDK 120 FTGGHWIATRGALIDEIYR PDRFSS VI++P+EAGE YEMVPTKMDPPEHTPYRKA+DKSbjct 61 FTGGHWIATRGALIDEIYRSPDRFSSRVIWVPREAGEAYEMVPTKMDPPEHTPYRKAIDK 120
Query 121 GLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKL 180 GLNLAQIRKVESD+RAVA+ELIEGFA +GHCDFA D+A VFPVKVF+ALA LPMEDAPKLSbjct 121 GLNLAQIRKVESDIRAVAVELIEGFAEKGHCDFAKDFAGVFPVKVFLALAGLPMEDAPKL 180
Query 181 NALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGE 240 N LA EMTRPSGNTPEEQ SLEAANKGFFAYVAPII ARRG G DLIT MVNSEING+Sbjct 181 NLLANEMTRPSGNTPEEQGKSLEAANKGFFAYVAPIIEARRGSAGADLITQMVNSEINGQ 240
Query 241 PMPHDKAlglisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRR 300 PM DKALGL+SLLLLGGLDTVVNFLSFMMIYLARHPETV ELRSDP+KLQRGVEEMFRRSbjct 241 PMAQDKALGLVSLLLLGGLDTVVNFLSFMMIYLARHPETVEELRSDPMKLQRGVEEMFRR 300
Query 301 FPVVSDARMVAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFA 360 F VVSDAR VA+DME+ G +LKAGD++LLPT LHGLDDTQ+ED M VDLSRR +AHSTFASbjct 301 FGVVSDARYVARDMEFQGTKLKAGDLVLLPTVLHGLDDTQHEDAMTVDLSRRNVAHSTFA 360
Query 361 QGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV 417 QGPHRCAGMHLARMEV+VTLQEWL RIP F L EGA+PIYH+GIVAAVENVPLVW+VSbjct 361 QGPHRCAGMHLARMEVLVTLQEWLARIPRFTLAEGASPIYHAGIVAAVENVPLVWEV 417
Alphafold2
Alphafold2
CYP101D
>CYP101D1---- 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MNAQTSTATQ KHRVAPPPHV PGHLIREIDA YDLDGLEQGF HEAWKRVQQP 00500051 DTPPLVWTPF TGGHWIATRG TLIDEIYRSP ERFSSRVIWV PREAGEAYDM 01000101 VPTKLDPPEH TPYRKAIDKG LNLAEIRKLE DQIRTIAVEI IEGFADRGHC 01500151 EFGSEFSTVF PVRVFLALAG LPVEDATKLG LLANEMTRPS GNTPEEQGRS 02000201 LEAANKGFFE YVAPIIAARR GGSGTDLITR ILNVEIDGKP MPDDRALGLV 02500251 SLLLLGGLDT VVNFLGFMMI YLSRHPETVA EMRREPLKLQ RGVEELFRRF 03000301 AVVSDARYVV SDMEFHGTML KEGDLILLPT ALHGLDDRHH DDPMTVDLSR 03500351 RDVTHSTFAQ GPHRCAGMHL ARLEVTVMLQ EWLARIPEFR LKDRAVPIYH 04000401 SGIVAAVENI PLEWEPQRVS ACYP101D
>cytochrome P450 [Novosphingobium aromaticivorans]Sequence ID: WP_011444175.1 Length: 421Range 1: 1 to 415
Score:642 bits(1656), Expect:0.0, Method:Compositional matrix adjust., Identities:313/416(75%), Positives:360/416(86%), Gaps:1/416(0%)
Query 1 MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTP 60 MN T + + KHR P HVP L+REID Y LDG+E+G+HEAWKRVQQPDTPPLVWTPSbjct 1 MNAQT-STATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAWKRVQQPDTPPLVWTP 59
Query 61 FTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDK 120 FTGGHWIATRG LIDEIYR P+RFSS VI++P+EAGE Y+MVPTK+DPPEHTPYRKA+DKSbjct 60 FTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVPTKLDPPEHTPYRKAIDK 119
Query 121 GLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKL 180 GLNLA+IRK+E +R +A+E+IEGFA RGHC+F ++++ VFPV+VF+ALA LP+EDA KLSbjct 120 GLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVFPVRVFLALAGLPVEDATKL 179
Query 181 NALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGE 240 LA EMTRPSGNTPEEQ SLEAANKGFF YVAPIIAARRGG+GTDLIT ++N EI+G+Sbjct 180 GLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAARRGGSGTDLITRILNVEIDGK 239
Query 241 PMPHDKAlglisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRR 300 PMP D+ALGL+SLLLLGGLDTVVNFL FMMIYL+RHPETVAE+R +PLKLQRGVEE+FRRSbjct 240 PMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHPETVAEMRREPLKLQRGVEELFRR 299
Query 301 FPVVSDARMVAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFA 360 F VVSDAR V DME+HG LK GD+ILLPTALHGLDD ++DPM VDLSRR + HSTFASbjct 300 FAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHGLDDRHHDDPMTVDLSRRDVTHSTFA 359
Query 361 QGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWK 416 QGPHRCAGMHLAR+EV V LQEWL RIPEF LK+ A PIYHSGIVAAVEN+PL W+Sbjct 360 QGPHRCAGMHLARLEVTVMLQEWLARIPEFRLKDRAVPIYHSGIVAAVENIPLEWE 415
Score:642 bits(1656), Expect:0.0, Method:Compositional matrix adjust., Identities:313/416(75%), Positives:360/416(86%), Gaps:1/416(0%)
Query 1 MNTDTLAPPSAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTP 60 MN T + + KHR P HVP L+REID Y LDG+E+G+HEAWKRVQQPDTPPLVWTPSbjct 1 MNAQT-STATQKHRVAPPPHVPGHLIREIDAYDLDGLEQGFHEAWKRVQQPDTPPLVWTP 59
Query 61 FTGGHWIATRGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDK 120 FTGGHWIATRG LIDEIYR P+RFSS VI++P+EAGE Y+MVPTK+DPPEHTPYRKA+DKSbjct 60 FTGGHWIATRGTLIDEIYRSPERFSSRVIWVPREAGEAYDMVPTKLDPPEHTPYRKAIDK 119
Query 121 GLNLAQIRKVESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKL 180 GLNLA+IRK+E +R +A+E+IEGFA RGHC+F ++++ VFPV+VF+ALA LP+EDA KLSbjct 120 GLNLAEIRKLEDQIRTIAVEIIEGFADRGHCEFGSEFSTVFPVRVFLALAGLPVEDATKL 179
Query 181 NALAKEMTRPSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGE 240 LA EMTRPSGNTPEEQ SLEAANKGFF YVAPIIAARRGG+GTDLIT ++N EI+G+Sbjct 180 GLLANEMTRPSGNTPEEQGRSLEAANKGFFEYVAPIIAARRGGSGTDLITRILNVEIDGK 239
Query 241 PMPHDKAlglisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRR 300 PMP D+ALGL+SLLLLGGLDTVVNFL FMMIYL+RHPETVAE+R +PLKLQRGVEE+FRRSbjct 240 PMPDDRALGLVSLLLLGGLDTVVNFLGFMMIYLSRHPETVAEMRREPLKLQRGVEELFRR 299
Query 301 FPVVSDARMVAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFA 360 F VVSDAR V DME+HG LK GD+ILLPTALHGLDD ++DPM VDLSRR + HSTFASbjct 300 FAVVSDARYVVSDMEFHGTMLKEGDLILLPTALHGLDDRHHDDPMTVDLSRRDVTHSTFA 359
Query 361 QGPHRCAGMHLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWK 416 QGPHRCAGMHLAR+EV V LQEWL RIPEF LK+ A PIYHSGIVAAVEN+PL W+Sbjct 360 QGPHRCAGMHLARLEVTVMLQEWLARIPEFRLKDRAVPIYHSGIVAAVENIPLEWE 415
Alphafold2
Alphafold2
Alphafold result (localcolabfold using AF2 parameters)spectrum b, rainbow_rev, maximum=100, minimum=504C9M rotated & Center using F358, C365, G367White
Local-Global Alignment#CA N1 N2 DIST N RMSD SUMMARY(GDT) 408 408 4.0 408 0.61
GDT_TS LGA_S3 LGA_Q 98.836 99.596 57.647
GDT_TS LGA_S3 LGA_Q 98.836 99.596 57.647
TM-Score (model5 vs 4c9m)Number of residues in common= 408RMSD of the common residues= 0.608
TM-score = 0.9931 (d0= 7.28)MaxSub-score= 0.9717 (d0= 3.50)
GDT-TS-score= 0.9884 %(d<1)=0.9534 %(d<2)=1.0000 %(d<4)=1.0000 %(d<8)=1.0000 %
GDT-HA-score= 0.8725 %(d<0.5)=0.5368 %(d<1)=0.9534 %(d<2)=1.0000 %(d<4)=1.0000 %
TM-score = 0.9931 (d0= 7.28)MaxSub-score= 0.9717 (d0= 3.50)
GDT-TS-score= 0.9884 %(d<1)=0.9534 %(d<2)=1.0000 %(d<4)=1.0000 %(d<8)=1.0000 %
GDT-HA-score= 0.8725 %(d<0.5)=0.5368 %(d<1)=0.9534 %(d<2)=1.0000 %(d<4)=1.0000 %
TM-Align (model5 vs 4c9m)Length of Chain_1: 421 residuesLength of Chain_2: 408 residues
Aligned length= 408, RMSD= 0.61, Seq_ID=n_identical/n_aligned= 1.000
TM-score= 0.96265 (if normalized by length of Chain_1)TM-score= 0.99314 (if normalized by length of Chain_2)
Aligned length= 408, RMSD= 0.61, Seq_ID=n_identical/n_aligned= 1.000
TM-score= 0.96265 (if normalized by length of Chain_1)TM-score= 0.99314 (if normalized by length of Chain_2)
>CYP101D2---- 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MNAQTSTATQ KHRVAPPPHV PGHLIREIDA YDLDGLEQGF HEAWKRVQQP 00500051 DTPPLVWTPF TGGHWIATRG TLIDEIYRSP ERFSSRVIWV PREAGEAYDM 01000101 VPTKLDPPEH TPYRKAIDKG LNLAEIRKLE DQIRTIAVEI IEGFADRGHC 01500151 EFGSEFSTVF PVRVFLALAG LPVEDATKLG LLANEMTRPS GNTPEEQGRS 02000201 LEAANKGFFE YVAPIIAARR GGSGTDLITR ILNVEIDGKP MPDDRALGLV 02500251 SLLLLGGLDT VVNFLGFMMI YLSRHPETVA EMRREPLKLQ RGVEELFRRF 03000301 AVVSDARYVV SDMEFHGTML KEGDLILLPT ALHGLDDRHH DDPMTVDLSR 03500351 RDVTHSTFAQ GPHRCAGMHL ARLEVTVMLQ EWLARIPEFR LKDRAVPIYH 04000401 SGIVAAVENI PLEWEPQRVS A
PDBj 2.2Å
PDBj 2.2Å
CYP101D2 WT
>CYP101D3---- 1234567890 1234567890 1234567890 1234567890 1234567890 ----0001 MGTTRMDTFN PQESRLATNF DEAVRAKVER PANVPEDRVY EIDMYALNGI 00500051 EDGYHEAWKK VQHPGIPDLI WTPFTGGHWI ATNGDTVKEV YSDPTRFSSE 01000101 VIFLPKEAGE KYQMVPTKMD PPEHTPYRKA LDKGLNLAKI RKVEDKVREV 01500151 ASSLIDSFAA RGECDFAAEY AELFPVHVFM ALADLPLEDI PVLSEYARQM 02000201 TRPEGNTPEE MATDLEAGNN GFYAYVDPII RARVGGDGDD LITLMVNSEI 02500251 NGERIAHDKA QGLISLLLLG GLDTVVNFLS FFMIHLARHP ELVAELRSDP 03000301 LKLMRGAEEM FRRFPVVSEA RMVAKDQEYK GVFLKRGDMI LLPTALHGLD 03500351 DAANPEPWKL DFSRRSISHS TFGGGPHRCA GMHLARMEVI VTLEEWLKRI 04000401 PEFSFKEGET PIYHSGIVAA VENVPLVWPI AR
>cytochrome P450 [Novosphingobium aromaticivorans DSM 12444]Sequence ID: ABD25921.1 Length: 417Range 1: 8 to 415
Score:624 bits(1610), Expect:0.0, Method:Compositional matrix adjust., Identities:313/408(77%), Positives:343/408(84%), Gaps:0/408(0%)
Query 10 SAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTPFTGGHWIAT 69 + + + RP +VP D V EIDMYAL+GIE+GYHEAWK+VQ P P L+WTPFTGGHWIATSbjct 8 AVRAKVERPANVPEDRVYEIDMYALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIAT 67
Query 70 RGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRK 129 G + E+Y DP RFSSEVIFLPKEAGEKY+MVPTKMDPPEHTPYRKALDKGLNLA+IRKSbjct 68 NGDTVKEVYSDPTRFSSEVIFLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRK 127
Query 130 VESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKLNALAKEMTR 189 VE VR VA LI+ FAARG CDFAA+YA +FPV VFMALADLP+ED P L+ A++MTRSbjct 128 VEDKVREVASSLIDSFAARGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTR 187
Query 190 PSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKAlg 249 P GNTPEE A LEA N GF+AYV PII AR GG G DLIT+MVNSEINGE + HDKA GSbjct 188 PEGNTPEEMATDLEAGNNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQG 247
Query 250 lisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFPVVSDARM 309 LISLLLLGGLDTVVNFLSF MI+LARHPE VAELRSDPLKL RG EEMFRRFPVVS+ARMSbjct 248 LISLLLLGGLDTVVNFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARM 307
Query 310 VAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFAQGPHRCAGM 369 VAKD EY GV LK GDMILLPTALHGLDD N +P K+D SRR I+HSTF GPHRCAGMSbjct 308 VAKDQEYKGVFLKRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGM 367
Query 370 HLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV 417 HLARMEVIVTL+EWLKRIPEF+ KEG PIYHSGIVAAVENVPLVW +Sbjct 368 HLARMEVIVTLEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPI 415
Score:624 bits(1610), Expect:0.0, Method:Compositional matrix adjust., Identities:313/408(77%), Positives:343/408(84%), Gaps:0/408(0%)
Query 10 SAKHRAPRPDHVPADLVREIDMYALDGIEEGYHEAWKRVQQPDTPPLVWTPFTGGHWIAT 69 + + + RP +VP D V EIDMYAL+GIE+GYHEAWK+VQ P P L+WTPFTGGHWIATSbjct 8 AVRAKVERPANVPEDRVYEIDMYALNGIEDGYHEAWKKVQHPGIPDLIWTPFTGGHWIAT 67
Query 70 RGALIDEIYRDPDRFSSEVIFLPKEAGEKYEMVPTKMDPPEHTPYRKALDKGLNLAQIRK 129 G + E+Y DP RFSSEVIFLPKEAGEKY+MVPTKMDPPEHTPYRKALDKGLNLA+IRKSbjct 68 NGDTVKEVYSDPTRFSSEVIFLPKEAGEKYQMVPTKMDPPEHTPYRKALDKGLNLAKIRK 127
Query 130 VESDVRAVAIELIEGFAARGHCDFAADYANVFPVKVFMALADLPMEDAPKLNALAKEMTR 189 VE VR VA LI+ FAARG CDFAA+YA +FPV VFMALADLP+ED P L+ A++MTRSbjct 128 VEDKVREVASSLIDSFAARGECDFAAEYAELFPVHVFMALADLPLEDIPVLSEYARQMTR 187
Query 190 PSGNTPEEQAASLEAANKGFFAYVAPIIAARRGGTGTDLITVMVNSEINGEPMPHDKAlg 249 P GNTPEE A LEA N GF+AYV PII AR GG G DLIT+MVNSEINGE + HDKA GSbjct 188 PEGNTPEEMATDLEAGNNGFYAYVDPIIRARVGGDGDDLITLMVNSEINGERIAHDKAQG 247
Query 250 lisllllgglDTVVNFLSFMMIYLARHPETVAELRSDPLKLQRGVEEMFRRFPVVSDARM 309 LISLLLLGGLDTVVNFLSF MI+LARHPE VAELRSDPLKL RG EEMFRRFPVVS+ARMSbjct 248 LISLLLLGGLDTVVNFLSFFMIHLARHPELVAELRSDPLKLMRGAEEMFRRFPVVSEARM 307
Query 310 VAKDMEYHGVQLKAGDMILLPTALHGLDDTQNEDPMKVDLSRRGIAHSTFAQGPHRCAGM 369 VAKD EY GV LK GDMILLPTALHGLDD N +P K+D SRR I+HSTF GPHRCAGMSbjct 308 VAKDQEYKGVFLKRGDMILLPTALHGLDDAANPEPWKLDFSRRSISHSTFGGGPHRCAGM 367
Query 370 HLARMEVIVTLQEWLKRIPEFALKEGAAPIYHSGIVAAVENVPLVWKV 417 HLARMEVIVTL+EWLKRIPEF+ KEG PIYHSGIVAAVENVPLVW +Sbjct 368 HLARMEVIVTLEEWLKRIPEFSFKEGETPIYHSGIVAAVENVPLVWPI 415
CYP101J_Consensus
>Consensus1_J1_55%-100%/1-466 Percentage Identity Consensus MMPDFLEKLQIKTDEKSFICRAWRIRCRAMQGATAPAHGESMMASDMTATAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHNGGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRKLIPLSLDPPEHRPFRNLLNENLGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLKHLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEAQNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSSREVRQDIEFEGVELKAGDMVMAPTIVVALDDAMNDDPLDFRLGRKARKHSTFGKGSHTCPGAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDVASTRALSAIGVTACYP101J_Consensus
>Consensus1_trim/1-466 Percentage Identity Consensus MASDMTATAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHNGGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNENLGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLKHLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEAQNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSSREVRQDIEFEGVELKAGDMVMAPTIVVALDDAMNDDPLDFRLGRKARKHSTFGKGSHTCPGAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDV
>Consensus2_trim/1-466 Percentage Identity Consensus MASAMTAPAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHNGGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNENLGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLKHLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEAQNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSSREVRQDIEFEGVELKAGDMVMAPTIVVALDEAMNDDPLDFRLGRKARKHSTFGKGSHTCPGAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDV
>Consensus3/1-466 Percentage Identity Consensus MASAMTAPAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHNGGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNENLGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLKHLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEAQNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSSREVRQDIEFEGVELKAGDMVMAPTIVVALDEAMNDDPLDFRLGRKARKHSTFGKGSHTCPGAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDV
>cytochrome P450 [Sphingobium sp. C100]Sequence ID: WP_052028277.1 Length: 410Range 1: 1 to 410
Score:780 bits(2013), Expect:0.0, Method:Compositional matrix adjust., Identities:377/410(92%), Positives:393/410(95%), Gaps:0/410(0%)
Query 1 MASAMTAPAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHN 60 MAS MTA AA LA +PDNVP DRV DFDIYRDVPEGSDFHASW ALMA+VPYPLMWTPHNSbjct 1 MASEMTATAATLAQQPDNVPDDRVLDFDIYRDVPEGSDFHASWHALMAEVPYPLMWTPHN 60
Query 61 GGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNEN 120 GGHWVA RADISD+VMSDSERFSN TVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNENSbjct 61 GGHWVAFRADISDIVMSDSERFSNRTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNEN 120
Query 121 LGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLK 180 LGPKPLKPVEDSIVDLTVSLIEGF+AKGQC+FV EFAEQLPVRIFMQIVDLPI+DLPKLKSbjct 121 LGPKPLKPVEDSIVDLTVSLIEGFQAKGQCHFVEEFAEQLPVRIFMQIVDLPIKDLPKLK 180
Query 181 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEA 240 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRG DG+DMISRM+NGQVGGRPLTDLEASbjct 181 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGADGQDMISRMINGQVGGRPLTDLEA 240
Query 241 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSS 300 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPS IDDALLEFFRRFPVVSSSSbjct 241 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSRIDDALLEFFRRFPVVSSS 300
Query 301 REVRQDIEFEGVELKAGDMVMAPTIVVALDEAMNDDPLDFRLGRKARKHSTFGKGSHTCP 360 REV+QD+ FEGVELKAG+MVMAPT+VVALD+AMN+DPL+FRLGR ARKHSTFGKGSHTCPSbjct 301 REVKQDVTFEGVELKAGEMVMAPTVVVALDDAMNEDPLEFRLGRTARKHSTFGKGSHTCP 360
Query 361 GAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDV 410 GAHLARMEMK VLREWFARIPEFRLAK PLRF+NGIVGSVKPFTLEWDVSbjct 361 GAHLARMEMKTVLREWFARIPEFRLAKDEPLRFSNGIVGSVKPFTLEWDV 410
Score:780 bits(2013), Expect:0.0, Method:Compositional matrix adjust., Identities:377/410(92%), Positives:393/410(95%), Gaps:0/410(0%)
Query 1 MASAMTAPAAHLAPRPDNVPADRVFDFDIYRDVPEGSDFHASWRALMAQVPYPLMWTPHN 60 MAS MTA AA LA +PDNVP DRV DFDIYRDVPEGSDFHASW ALMA+VPYPLMWTPHNSbjct 1 MASEMTATAATLAQQPDNVPDDRVLDFDIYRDVPEGSDFHASWHALMAEVPYPLMWTPHN 60
Query 61 GGHWVALRADISDVVMSDSERFSNHTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNEN 120 GGHWVA RADISD+VMSDSERFSN TVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNENSbjct 61 GGHWVAFRADISDIVMSDSERFSNRTVLVPKDTAGEAYRLIPLSLDPPEHRPFRNLLNEN 120
Query 121 LGPKPLKPVEDSIVDLTVSLIEGFRAKGQCNFVHEFAEQLPVRIFMQIVDLPIEDLPKLK 180 LGPKPLKPVEDSIVDLTVSLIEGF+AKGQC+FV EFAEQLPVRIFMQIVDLPI+DLPKLKSbjct 121 LGPKPLKPVEDSIVDLTVSLIEGFQAKGQCHFVEEFAEQLPVRIFMQIVDLPIKDLPKLK 180
Query 181 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGGDGEDMISRMVNGQVGGRPLTDLEA 240 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRG DG+DMISRM+NGQVGGRPLTDLEASbjct 181 HLADQFTRPDGSLTYPEVSQLFRDYLTPVIAARRGADGQDMISRMINGQVGGRPLTDLEA 240
Query 241 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSLIDDALLEFFRRFPVVSSS 300 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPS IDDALLEFFRRFPVVSSSSbjct 241 QNICIQVLVGGLDTVVNMLGFTFSYLARDHDLRRAIAADPSRIDDALLEFFRRFPVVSSS 300
Query 301 REVRQDIEFEGVELKAGDMVMAPTIVVALDEAMNDDPLDFRLGRKARKHSTFGKGSHTCP 360 REV+QD+ FEGVELKAG+MVMAPT+VVALD+AMN+DPL+FRLGR ARKHSTFGKGSHTCPSbjct 301 REVKQDVTFEGVELKAGEMVMAPTVVVALDDAMNEDPLEFRLGRTARKHSTFGKGSHTCP 360
Query 361 GAHLARMEMKIVLREWFARIPEFRLAKGAPLRFTNGIVGSVKPFTLEWDV 410 GAHLARMEMK VLREWFARIPEFRLAK PLRF+NGIVGSVKPFTLEWDVSbjct 361 GAHLARMEMKTVLREWFARIPEFRLAKDEPLRFSNGIVGSVKPFTLEWDV 410
Alphafold2
Alphafold2