MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHS
RGEPDELEYRVRMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGV
RDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQ
SIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQSYFSGSTEEDV
LLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEAC
SEFITLRNHTLNRFNNFSSAEIEGKKQPGHEKEHELYKRKDQQENLETVG
NHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKL
MFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSC
WVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPV
SGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDES
SDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHVLH
| Relationships |
|---|
| The polypeptide, XM_017372876.1, derives from mRNA, XM_017372876.1. |

Analysis Date: 2022-01-09
Analysis Name: NCBI peptide blastp to SwissProt and TrEMBL without DCAR
Total hits: 10
Match: A0A5B6ZX47 ((Uncharacterized protein {ECO:0000313|EMBL:MPA49130.1}))
HSP 1 Score: 604.364 bits (1557), Expect = 0.000e+0
Identity = 337/556 (60.61%), Postives = 401/556 (72.12%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ--SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPVSGGKKAVKT--------NYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHV 542
M KQGPC HCGVTSTPLWRNGPP+KP+LCNACGSRWRTKG+L+NYTPLH+R EPDE E YRV R+K++S+K+KE KV+KRKQNH VV +APD YNQGY K LDEDTSNRSSSGSAIS S SC GSADASDLTG QSIVWD+ VPSRKRTCV R K S VEKLTKDLYTIL EQQ SYFSGS+EED+L E DTP+VSVEIGHGSVLIRHP+S+AR+EESEASS SV NKP+ NEA S TL H N+ NF + IE K+P G + EL KR K E L+ +GNH SP+ +L DVLN++EF H+TN E Q+LL LPS D + LPDSLK MF+SPQ+ EN++++QKL+A+GVFDLS + V NE TL RL L +LTKS WVEQYN+LKD S +GGS A H +AS S+ +KR R Q QN P G K +K+ +Y K +D +++ FSP++ PPD+ SLM D F F DESSD DLLL++PS+SSFPQAELLLPT FGA AS +SS + H+
Sbjct: 1 MGKQGPCHHCGVTSTPLWRNGPPDKPILCNACGSRWRTKGTLANYTPLHARAEPDEFEDYRVSRVKSISMKNKETKVIKRKQNHDNAVVGGIAPD------------YNQGYQKGLDEDTSNRSSSGSAISNSESCAQFGSADASDLTGQAQSIVWDTMVPSRKRTCVSRPKQSPVEKLTKDLYTILHEQQQSSYFSGSSEEDLLFESDTPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVDNKPYTVNEAYSHLTTLPVHNDNKGVNFQNLVIEKIKKPTGQGMQQELIKREKAPLEKLQILGNHNSPLRYTDLKDVLNFDEFVSHLTNEEQQQLLKYLPSVDTVRLPDSLKSMFDSPQYMENLTSYQKLLAEGVFDLSFSGVKNEDYRTLKRLALCSLTKSKWVEQYNLLKDVKNKSNIGGSVVAGGHNVIASGNSVNLKRPRDSQYQNYP--GAKMTMKSPKRVMMKGSYEHKELVDNDSSCFSPRSLFALPPDSNSLMLDSFHFADESSDQDLLLDVPSNSSFPQAELLLPTSSFGALASATSSSLYPHL 542
Match: A0A5J5A688 ((Uncharacterized protein {ECO:0000313|EMBL:KAA8526010.1}))
HSP 1 Score: 602.438 bits (1552), Expect = 0.000e+0
Identity = 336/561 (59.89%), Postives = 396/561 (70.59%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ-SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKD-----TNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPVSGGKKAVKTN--------YGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHVL 543
M KQGPC HCGVTSTPLWRNGPPEKP+LCNACGSRWRTKG+L+NYTPLH+R EPDE E YRV R+K++S+K+KEAKVLKRKQNH VV RV PD YNQG+ K LDEDTSNRSS GSAIS S SCV GSADASDLTG QS+VWD+ VPSRKRTCV R KPS VEKLTKDLYTIL EQ+ SYFS S+EED+L E DTP+VSVEIGHGSVLIRHP+S+ R+EESEASS SV NK +P NEA S TL H N+ NF + EIE K+P G + E KR K E L+ +GNH SP+ I+L DVLN++EF H+TN E ++LLN LPS D PDSL+ MF+SPQ+KEN++++QKL+A+GVFDLS + V E TL RL L TKS WVEQYN++KD N S VGG H +A S+ +KR R GQ QN P G K +K+ Y + P+D + + FSPK+ PPDN+SLM D F DESSD DLLL++PS+SSFPQAELLLPT FGAQAS S + H L
Sbjct: 1 MGKQGPCYHCGVTSTPLWRNGPPEKPILCNACGSRWRTKGTLTNYTPLHARAEPDEFEDYRVSRVKSISIKNKEAKVLKRKQNHDNVVVGRVTPD------------YNQGFRKGLDEDTSNRSSPGSAISNSESCVQFGSADASDLTGPAQSVVWDTMVPSRKRTCVSRPKPSPVEKLTKDLYTILHEQESSYFSRSSEEDLLFESDTPMVSVEIGHGSVLIRHPSSITREEESEASSLSVDNKLYPVNEAYSRLTTLPVHNDNKGVNFPTLEIEKIKKPTGQGMQQEQIKRDKAPPEKLQILGNHNSPLRYIDLKDVLNFDEFASHLTNEEQKQLLNYLPSVDTAN-PDSLRSMFDSPQYKENLTSYQKLLAEGVFDLSFSGVITEDYRTLKRLALCNSTKSKWVEQYNLIKDEKNKNNNGESVVGG------HKVIALGNSVNLKRFRDGQHQNYP--GAKMMMKSPKRVMMKGCYEHREPIDNDGSCFSPKSLFALPPDNSSLMLDSFHHADESSDQDLLLDVPSNSSFPQAELLLPTSSFGAQASACGSSVYLHRL 540
Match: A0A5B6ZX92 ((Uncharacterized protein {ECO:0000313|EMBL:MPA49132.1}))
HSP 1 Score: 595.119 bits (1533), Expect = 0.000e+0
Identity = 333/556 (59.89%), Postives = 398/556 (71.58%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ--SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPVSGGKKAVKT--------NYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHV 542
M KQGPC HCGVTSTPLWRNGPP+KP+LCNACGSRWRTKG+L+NYTPLH+R EPDE E YRV R+K++S+K+KE KV+KRKQNH VV +APD YNQGY K LDEDTSNRSSSGSAIS S SC GSADASDLTG QSIVWD+ VPSRKRTCV R K S VEKLTKDLYTIL EQQ SYFSGS+EED+L E DTP+VSVEIGHGSVLIRHP+S+AR+EESEASS SV NKP+ NEA S TL H N+ NF + IE K+P G + EL KR K E L+ +GNH SP+ +L DVLN++EF H+TN E Q+LL LPS D + LPDSLK MF+ PQ+ EN++++QKL+A+GVFDLS + V E TL RL+L TKS WV+QYN+LKD S +GGS A H +AS S+ +KR R Q QN P G K +K+ +Y K +D +++ FSP++ PPD+ SLM D F F DESSD DLLL++PS+SSFPQAELLLPT FGA AS +SS + H+
Sbjct: 1 MGKQGPCHHCGVTSTPLWRNGPPDKPILCNACGSRWRTKGTLANYTPLHARAEPDEFEDYRVSRVKSISMKNKETKVIKRKQNHDNAVVGGIAPD------------YNQGYQKGLDEDTSNRSSSGSAISNSESCAQFGSADASDLTGQAQSIVWDTMVPSRKRTCVSRPKQSPVEKLTKDLYTILHEQQQSSYFSGSSEEDLLFESDTPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVDNKPYTVNEAYSHLTTLPVHNDNKGVNFQNLVIEKIKKPTGQGMQQELIKREKAPLEKLQILGNHNSPLRYTDLKDVLNFDEFVSHLTNEEQQQLLKYLPSVDTVRLPDSLKSMFDGPQYMENLTSYQKLLAEGVFDLSFSGVKTEDCRTLKRLVLCNSTKSKWVDQYNLLKDVKNKSNIGGSVVAGGHNVIASGNSVNLKRPRDSQYQNYP--GAKMTMKSPKRVMMKGSYEHKELVDNDSSCFSPRSLFALPPDSNSLMLDSFHFADESSDQDLLLDVPSNSSFPQAELLLPTSSFGALASATSSSLYPHL 542
Match: A0A5B7A283 ((Uncharacterized protein {ECO:0000313|EMBL:MPA49133.1}))
HSP 1 Score: 593.193 bits (1528), Expect = 0.000e+0
Identity = 326/535 (60.93%), Postives = 383/535 (71.59%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ--SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIP------VSGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLP 523
M KQGPC HCGVTSTPLWRNGPPEKPVLCNACGSRWRTKG+L+NYTPLH+R EPDE E YRV R+K++ +K+KEAKVLKRKQNH VV VAP+ YNQG+ K LDEDTSNRSS GSAIS S SCV GSADASDLTG QSIVWD+ VPSRKRTCV R K S VEKLTKDLYTIL EQQ SYFSGS+EED+L E DTP+VSVEIGHGSVLIRHP+S+AR+EESEASS SV NKP+ NEA S TL H N+ NF + EIE K+P G + E KR K E L+ +GNH SP+ I+L DVLN++ F H+TN E +LL LP D LPDSLK MF+SPQ+KEN++++QKL+A+GVFDLS + V NE TL RL L +LTKS WVEQYN+LKD + G S H +AS S+ +KR R GQ QN P S + +K NY + +D +A+ FSP++ PPDN+SLM D F DESSD DLLL++PS+S FPQAELLLP
Sbjct: 1 MGKQGPCYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLTNYTPLHARAEPDEFEDYRVSRVKSICIKNKEAKVLKRKQNHDNAVVGGVAPE------------YNQGFRKGLDEDTSNRSSPGSAISNSESCVQFGSADASDLTGQAQSIVWDTMVPSRKRTCVSRPKQSPVEKLTKDLYTILHEQQQSSYFSGSSEEDLLFESDTPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVDNKPYHVNEAYSRLTTLPVHNDNKGVNFPTLEIEKIKKPTGQGMQQEQIKRDKAPPEKLQILGNHNSPLRYIDLKDVLNFDVFASHLTNEEQHQLLKYLPLVDTARLPDSLKSMFDSPQYKENLTSYQKLLAEGVFDLSFSGVKNEDYRTLKRLALCSLTKSKWVEQYNLLKDVKNKNNNGESVVVGGHNVIASGNSVNLKRFRDGQYQNYPGAQMTMKSPKRVMMKGNYEQRELIDNDASCFSPRSLFALPPDNSSLMLDSFHHADESSDQDLLLDVPSNSYFPQAELLLP 523
Match: A0A5J4ZUZ5 ((Uncharacterized protein {ECO:0000313|EMBL:KAA8522170.1}))
HSP 1 Score: 588.571 bits (1516), Expect = 0.000e+0
Identity = 337/566 (59.54%), Postives = 397/566 (70.14%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ--SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPI-------------FSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIP------VSGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNH 541
M KQGPC HCGVTSTPLWRNGPP+KPVLCNACGSRWRTKG+L+NYTPLH+R EPDE E YRV R+K++S+K+KEAKVLKRKQNH VV APD YNQGY K LDEDTSNRSSSGSAIS S SC GSADASDLTG QSIVWD+ VPSRKRTCV R KPS VEKLTKDLYTIL EQQ SYFSGS+EED+L E DTP+VSVEIGHGSVLIRHP+S+AR+EESEASS SV NKP+ NEA S TL H + NF + IE K+P G + EL KR K E L+ +GNH SP+ S+ L DVLN++ F H+TN E Q+LL LPS D LPDSLK MF+SPQ+ EN++++QKL+A+GVFDLS V E TL RL L LTKS WVEQY +LKD + +GGS A H +AS S+ +KRSR Q QN P S + +K +Y K +D +++ FSP++ PPD++SLM D F + D+SSD DLLL++PS+SSFPQAELLLPT FG QASTSSS + H
Sbjct: 1 MGKQGPCHHCGVTSTPLWRNGPPDKPVLCNACGSRWRTKGTLANYTPLHARAEPDEFEDYRVSRVKSISIKNKEAKVLKRKQNHDNAVVGGFAPD------------YNQGYQKGLDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQSIVWDTMVPSRKRTCVSRPKPSPVEKLTKDLYTILHEQQQSSYFSGSSEEDLLFESDTPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVDNKPYNINEAYSRLTTLPVHNDKKGVNFPNLVIENIKKPTGQGMQQELIKRDKAPSEKLQILGNHNSPLHYADLKVSYADLKVSLSLMDVLNFDAFVSHLTNEEQQQLLKHLPSVDTARLPDSLKSMFDSPQYTENITSYQKLLAEGVFDLSFPGVKAEDCRTLKRLALCDLTKSKWVEQYTLLKDVKNKNNIGGSVVAVGHDVIASHYSVNLKRSRDSQHQNYPGAKMTMRSPKRVMMKGSYEHKEHVDNDSSCFSPRSLFALPPDSSSLMLDSFHYADDSSDQDLLLDVPSNSSFPQAELLLPTSSFGIQASTSSSSVYPH 554
Match: A0A2C9WJ72 ((Uncharacterized protein {ECO:0000313|EMBL:OAY60120.1}))
HSP 1 Score: 563.533 bits (1451), Expect = 0.000e+0
Identity = 317/557 (56.91%), Postives = 386/557 (69.30%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSL-KHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQSY-FSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQ---PGHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLS-SAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPVSGGKKA-----VKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHVL 543
M KQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKG+L+NYTPLH+R +PD+ E +RV R K++S+ K+K+ K+LKRK N+ VV R+APD Y QGY K LDEDTSNRSSSGSAIS S SC GSADASDLTG QSIVWD+ VPSRKRTCV R KPS VEKLTKDLYTI EQQS FSGS+EED+L E +TP+VSVEIGHGSVLIRHP+S+ARDEESEASS SV NK + +EA S+ +T+ H +N S IE PG + + E KR K E + +GNH SP+ ++LN++LN+EEF +++TN E Q+LL LP D LPDS++ MF+SPQFKEN+S FQ+L+ +GVFDLS S ECN TL RL L L+KS WVE+Y+ LK C ++ G S R V S+ S+ KRSR Q IP K+ +KT Y K MD + + FSP++ PPD SLM D + DESSD DLLL++PS+ SFPQAELL PT FG QASTS+S + H++
Sbjct: 1 MGKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLANYTPLHARADPDDYEDHRVSRGKSISINKNKDVKLLKRKANYDNGVVGRIAPD------------YYQGYRKVLDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQSIVWDTMVPSRKRTCVNRTKPSPVEKLTKDLYTIWHEQQSSCFSGSSEEDLLFESETPMVSVEIGHGSVLIRHPSSIARDEESEASSLSVENKQYSTSEAYSQTVTVPVHNETINSNIQSIVIEKATNPTGPGMQVQQEQLKRDKSHHERAQILGNHNSPLCDVDLNEILNFEEFAQYLTNEEQQQLLKYLPLVDTAKLPDSIRSMFDSPQFKENISFFQQLLVEGVFDLSFSGAKAEECN-TLKRLTLSNLSKSKWVERYHELK--KCKNSTGKSLVGRGLNVVMSSNSIAAKRSRDNVGQKIPEVKVMKSPKRINMKTTYENKEVMDNDGSCFSPRSLFALPPDGGSLMLDSLHYVDESSDQDLLLHVPSNGSFPQAELLHPTSSFGQQASTSNSSRYPHLV 542
Match: A0A7J7GH12 (DEUBAD domain-containing protein {ECO:0000259|PROSITE:PS51916})
HSP 1 Score: 558.525 bits (1438), Expect = 0.000e+0
Identity = 319/548 (58.21%), Postives = 380/548 (69.34%), Query Frame = 0
Query: 6 PCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQS-YFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKRKD-QQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIP-----VSGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQP-PDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQG-FGAQ--ASTSSSIYH 539
P + T TPLWRNGPPEKPVLCNACGSRWRTKG+L NYTPLH+R EPDE E YRV R+K++S+K+KEAKVLKRKQN+ VV RV PD YNQ K LDEDTSNRSSSGSAIS S SC+ GSADASDLTG QS WD+TVPSRKRTC R KPSSVEKL KDL+TIL EQQS +FSGS+EED+L E +TP+VSVEIGHGSVLIRHP+S+ R+E+SEASS SV NKPHP NEA S L H N+ NF S EIE K+P G E KR + Q E L+ GN+ S + ++L D+ N++EF H+TN E ++LL LPS D LPDSLK MF SPQFKEN+S+FQKL+A+GVFDLS + V E TL +L L TKS WVEQYN+LKD S+ GGS A+ S S +KRSR GQ Q+ P + K+ +K ++ K MD + + FSP ++ V P PD++SLM D FQF DESSD DLLL++PS++SFPQAELLLP FG ASTS S H
Sbjct: 49 PSKYVTTTGTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARAEPDEFEDYRVSRVKSISIKNKEAKVLKRKQNNDYAVVGRVDPD------------YNQFSQKGLDEDTSNRSSSGSAISNSESCMQFGSADASDLTGPAQSNAWDTTVPSRKRTCFNRPKPSSVEKLAKDLHTILHEQQSSHFSGSSEEDLLYESNTPMVSVEIGHGSVLIRHPSSIGREEDSEASSLSVDNKPHPTNEAYSHLTALPVHIDNKGANFPSLEIERIKKPNGQGMRQEQIKRDNAQHEKLQISGNYNSLLHYVDLKDIFNFQEFASHLTNEEQKQLLKYLPSIDTARLPDSLKSMFGSPQFKENLSSFQKLLAEGVFDLSLSGVKTEDCRTLKKLALCNFTKSKWVEQYNLLKDVKGKSSNGGSVVPGGPNAIVSGSSTSVKRSRDGQFQSFPGSKTTMKSPKRVMKGSHEHKELMDNDGSCFSPGSQFVLPHPDSSSLMLDSFQFMDESSDQDLLLDVPSNNSFPQAELLLPASSRFGGATLASTSRSSAH 584
Match: A0A2R6QJV3 ((GATA transcription factor {ECO:0000313|EMBL:PSS09684.1}))
HSP 1 Score: 557.755 bits (1436), Expect = 0.000e+0
Identity = 318/534 (59.55%), Postives = 378/534 (70.79%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQ--SYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYKR-KDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIP-----VSGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLP 523
M K GPC HCGVTSTPLWRNGPPEKPVLCNACGSRWRTKG+L NYTPLH+R EPDE E +RV R+K++S+K+KEAKVLKRKQ+H +V V PD Y+Q + K LDEDTSNRSSSGSAIS S SCV GSADASDLTG Q IVW++TVPSRKRTCVGR KPS VEKLTKDL+TIL EQQ S+FSGS+EED+L E D P+VSVEIGHGSVLIRHP+S+A++EESEASS SV NK H NEA S L H+ + NFS+ I+ +P G + E KR K QQENL+ +GNH SP+ ++L DVL++++F R++T E Q+LL LP D LPDSLK MF SPQFKEN+S+FQKL A+GVFDLS + V E TL RL L LTKS WVEQYN+LKD S+ GG A AS S +KR R GQ N P + K A+K Y K D + ++FSP++ P DN+SLM D FQF ESSD DLLL++PS+SSFP+AELLLP
Sbjct: 1 MGKHGPCYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARAEPDEFEDHRVSRVKSISIKNKEAKVLKRKQSHDNGLVGGVDPD------------YSQYFRKALDEDTSNRSSSGSAISNSESCVQFGSADASDLTGPAQPIVWETTVPSRKRTCVGRPKPSPVEKLTKDLHTILHEQQQSSHFSGSSEEDLLFESDRPMVSVEIGHGSVLIRHPSSIAKEEESEASSLSVDNKLHRTNEAYSRLTALPVHSAKKGANFSTPGIDKNNKPIGQGMQQEQIKRDKAQQENLQILGNHNSPLHYVDLKDVLDFDQFARNLTKEEQQQLLKYLPYADTTALPDSLKSMFASPQFKENISSFQKLHAEGVFDLSLSGVKVEDCRTLKRLTLGNLTKSKWVEQYNLLKDVKSKSSNGGFVVPGGPKAFASGNSTNVKRLRDGQYPNFPGVNTTMKSPKGAMKGIYEHKELTDNDGSYFSPRSLFALPTDNSSLMMDSFQFMGESSDQDLLLDVPSNSSFPEAELLLP 522
Match: A0A6P6TRX9 ((GATA transcription factor 26 isoform X2 {ECO:0000313|RefSeq:XP_027081228.1}))
HSP 1 Score: 554.288 bits (1427), Expect = 0.000e+0
Identity = 321/553 (58.05%), Postives = 384/553 (69.44%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YR-VRMKAMSLKHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQSYFSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQP-GHEKEHELYK-RKDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPV-----SGGKKAVKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHVLH 544
M KQGPC HCGVTSTPLWRNGPPEKPVLCNACGSRWRTKG+L NYTPLH+R EPD+LE YR R+K +S+K+KEAKVLKRK NH ++V +PD YN G+ K LDEDTSNRSSSGSAIS S SC GSADASDLTG Q +VWDS VPSRKRTCV RAKPS VEKLTKDLYTIL E YFSGS+EED+L E D P+VSVEIGHGSVLIRHP+S+AR+EESEASS SV NK HP NEA S T HT N+ N + E K+P G E E K KDQ E L+ +G+H SP+ ++L D+LN+ EF +T E Q+LL LPS D G P+SL+ MF+S QF+EN+S+FQKL+A+GVFD + + V E TL + +L LTKS WV QYN+LKD C S++ S A E AVA+ S+ KRSR GQ Q P S + +K +Y + D + + FSP++ P DN+SL+ D F ESSD DLLL++PS+SSFPQAELLLPT FGAQAST SS + +LH
Sbjct: 1 MGKQGPCYHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLVNYTPLHARAEPDDLEDYRNSRVKNISVKNKEAKVLKRKPNHEIEV-GAFSPD------------YNHGFRKGLDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQPMVWDSMVPSRKRTCVTRAKPSPVEKLTKDLYTILHEHSPYFSGSSEEDLLFESDKPMVSVEIGHGSVLIRHPSSIAREEESEASSLSVDNKLHPVNEAYSRLTTFSVHTDNKGVNLPNPGAEKVKKPTGQGGEQEQIKWDKDQLEKLQLLGHHSSPLCYVDLKDLLNFGEFVSCLTKEEHQQLLKYLPSIDTSGPPESLRNMFDSIQFEENLSSFQKLLAEGVFDNTLSGVKTEDCRTLKKFVLCNLTKSKWVGQYNLLKDVKCRSSISMSEVAGEFDAVATGHSVNAKRSRDGQYQKFPAKTIMKSPKRVMMKASYEHREVTDNDGSCFSPRSLFALPADNSSLVLDSFNTA-ESSDQDLLLDVPSNSSFPQAELLLPTSSFGAQASTCSSSGYPQLLH 539
Match: A0A2C9WD57 ((Uncharacterized protein {ECO:0000313|EMBL:OAY56783.1}))
HSP 1 Score: 553.903 bits (1426), Expect = 0.000e+0
Identity = 316/554 (57.04%), Postives = 384/554 (69.31%), Query Frame = 0
Query: 1 MVKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGSLSNYTPLHSRGEPDELE-YRV-RMKAMSL-KHKEAKVLKRKQNHGVDVVSRVAPDTASQKGVRDALHYNQGYLKTLDEDTSNRSSSGSAISPSGSCVHLGSADASDLTGATQSIVWDSTVPSRKRTCVGRAKPSSVEKLTKDLYTILQEQQSY-FSGSTEEDVLLECDTPVVSVEIGHGSVLIRHPNSVARDEESEASSFSVYNKPHPANEACSEFITLRNHTLNRFNNFSSAEIEGKKQ---PGHEKEHELYKRKDQQENLETVGNHKSPIFSIELNDVLNYEEFRRHITNHELQELLNLLPSTDILGLPDSLKLMFESPQFKENVSAFQKLIADGVFDLSSAEVNNECNSTLTRLLLHTLTKSCWVEQYNILKDTNCGSTVGGSSDAREHAAVASAQSLEIKRSRVGQPQNIPVSGGKKA-----VKTNYGLKAPMDKEATFFSPKTRLVQPPDNTSLMPDCFQFGDESSDHDLLLNIPSHSSFPQAELLLPTQGFGAQASTSSSIYHNHV 542
M KQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKG+L+NYTPLH+R +PD+ E +RV R K++S+ K+K+ K+LKRK N+ VV APD YNQGY K DEDTSNRSSSGSAIS S SC GSADASDLTG QSIVWDS VPSRKRTCV R KPS VEKLTKDLYTI EQQS FSGS+EED+L E +TP+VSVEIGHGSVLIRHP+S+ARDEESEASS SV NK + NEA S +TL H +NR N S IE K PG ++E +L + K Q E + +GNH S + ++LND+LN+EEF R++TN E Q+LL LP D LP+S++ MF+SPQFKEN+S+FQ+L+ +GVFDLS + V E +TL +L L L+KS W+E Y+ LK C +T G S R + S + KRSR Q +P + K+ +K Y K MD + + FSP++ PPD SLM D F + DESSD DLLL++PS+ SFPQAELL PT FG QAS SSS + H+
Sbjct: 1 MGKQGPCCHCGVTSTPLWRNGPPEKPVLCNACGSRWRTKGTLANYTPLHARTDPDDYEDHRVSRGKSISINKNKDVKLLKRKVNYDNGVVDGFAPD------------YNQGYRKVSDEDTSNRSSSGSAISNSESCAQFGSADASDLTGPAQSIVWDSMVPSRKRTCVNRPKPSPVEKLTKDLYTIWHEQQSSCFSGSSEEDLLFESETPMVSVEIGHGSVLIRHPSSIARDEESEASSLSVENKQYSINEAYSHSVTLPVHNVNRNANIPSLVIEKSKNHAGPGMQQE-QLKRDKSQHEKAQILGNHDSLLCGVDLNDILNFEEFARYLTNEEQQQLLKYLPLVDTAKLPESIRSMFDSPQFKENISSFQQLLGEGVFDLSLSGVKTEDCNTLKKLTLSNLSKSKWMELYHQLK--KCRNTTGKSLVGRGPDVIPSNNLITAKRSRDSLGQKVPETKVMKSPKRINMKATYENKEVMDSDGSCFSPRSLFALPPDGGSLMLDTFHYVDESSDQDLLLDVPSNGSFPQAELLRPTSSFGQQASASSSSTYPHL 539
| Match Name | Stats | Description |
|---|---|---|
| A0A5B6ZX47 | E-Value: 0.000e+0, PID: 60.61 | (Uncharacterized protein {ECO:0000313|EMBL:MPA4913... [more] |
| A0A5J5A688 | E-Value: 0.000e+0, PID: 59.89 | (Uncharacterized protein {ECO:0000313|EMBL:KAA8526... [more] |
| A0A5B6ZX92 | E-Value: 0.000e+0, PID: 59.89 | (Uncharacterized protein {ECO:0000313|EMBL:MPA4913... [more] |
| A0A5B7A283 | E-Value: 0.000e+0, PID: 60.93 | (Uncharacterized protein {ECO:0000313|EMBL:MPA4913... [more] |
| A0A5J4ZUZ5 | E-Value: 0.000e+0, PID: 59.54 | (Uncharacterized protein {ECO:0000313|EMBL:KAA8522... [more] |
| A0A2C9WJ72 | E-Value: 0.000e+0, PID: 56.91 | (Uncharacterized protein {ECO:0000313|EMBL:OAY6012... [more] |
| A0A7J7GH12 | E-Value: 0.000e+0, PID: 58.21 | DEUBAD domain-containing protein {ECO:0000259|PROS... [more] |
| A0A2R6QJV3 | E-Value: 0.000e+0, PID: 59.55 | (GATA transcription factor {ECO:0000313|EMBL:PSS09... [more] |
| A0A6P6TRX9 | E-Value: 0.000e+0, PID: 58.05 | (GATA transcription factor 26 isoform X2 {ECO:0000... [more] |
| A0A2C9WD57 | E-Value: 0.000e+0, PID: 57.04 | (Uncharacterized protein {ECO:0000313|EMBL:OAY5678... [more] |
| Name | Description |
|---|---|
An orange, doubled-haploid, Nantes-type carrot (DH1) was used for genome sequencing. We used BAC end sequences and a newly developed linkage map with 2,075 markers to correct 135 scaffolds with one or more chimeric regions. The resulting v2.0 assembly spans 421.5 Mb and contains 4,907 scaffolds (N50 of 12.7 Mb), accounting for ∼90% of the estimated genome size of 473 Mb. The scaftig N50 of 31.2 kb is similar to those of other high-quality genome assemblies such as potato and pepper. About 86% (362 Mb) of the assembled genome is included in only 60 superscaffolds anchored to the nine pseudomolecules. The longest superscaffold spans 30.2 Mb, 85% of chromosome 4. There are a few different naming schemes for this assembly. First there is the Phytozome genome ID 388: The authors' sequences and gene predictions were also submitted to Phytozome, and can be accessed at this address: https://phytozome-next.jgi.doe.gov/info/Dcarota_v2_0 LNRQ01: These sequences were then assigned GenBank accession numbers starting at LNRQ01000001.1 which corresponds to DCARv2_Chr1, up to LNRQ01004826.1 which corresponds to an unincorporated contig, DCARv2_C10750146. These reside in bioproject PRJNA268187, which is a subproject of umbrella project PRJNA285926. Assembly GCA_001625215.1: The genome assembly was later defined an accession number GCA_001625215.1 for assembly ASM162521v1 which consists of only the 9 chromosome sequences and the plastid assembly, which have accession numbers from CM004278.1 to CM004286.1 for the chromosomes and CM004358.1 for the plastid. The mitochondrial genome was not included because it is classified as an incomplete sequence. RefSeq: The assembly was then later added to RefSeq, and there another new set of identifiers was defined from NC_030381.1 to NC_030389.1 for the chromosomes, and from NW_016089425.1 to NW_016094239.1 for unincorporated scaffolds and contigs. These reside in bioproject PRJNA326436. Note that NCBI substituted different assembled organellar genomes from different genotypes for the RefSeq records. The NCBI Sequence report lists the correspondences between the various naming methods Link to the LNRQ01000000.1 master record at NCBI Raw Reads: Link to SRA accessions used for the genome assembly This genome is available in the CarrotOmics Blast Search | |
The RefSeq genome records for Daucus carota subsp. sativus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results. View the full report at https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Daucus_carota_subsp._sativus/100/ Data from this analysis can be viewed in JBrowse here. | |
This analysis is a blastp search of all of the NCBI Daucus carota subsp. sativus Annotation Release 100 polypeptide sequences against combined ExPASy SwissProt and TrEMBL databases from Nov. 17, 2021. Prior to performing the blast search, the database was filtered to remove organisms not in the Viridiplantae, and also filtered to remove DCAR gene predictions from DCAR V1.0 Gene Prediction. |
