MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSP
NFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFS
RSLALDHGEIQLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGKCSSN
GNGFCNDSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHNRSCSGSVAL
PGGNGSTVSPNKPPLGKHSRVDSTRKSLSGNCISFVVRESGGGERARVME
GEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGR
QLTMEEFEMCVGHSPIVQELMRRQNVEDGNDKNNSDYNGNGDGAKLKKKG
SWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHG
PERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGED
CVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQL
EKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWS
KSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISG
SLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLY
NTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIR
VVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADS
RPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDD
HPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDR
IATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRT
FQNFGMPVCI
| Relationships |
|---|
| The polypeptide, XM_017402963.1, derives from mRNA, XM_017402963.1. |

Analysis Date: 2022-01-09
Analysis Name: NCBI peptide blastp to SwissProt and TrEMBL without DCAR
Total hits: 10
Match: A0A7J7HCY2 ((Uncharacterized protein {ECO:0000313|EMBL:KAF5949718.1}))
HSP 1 Score: 1165.98 bits (3015), Expect = 0.000e+0
Identity = 634/937 (67.66%), Postives = 724/937 (77.27%), Query Frame = 0
Query: 1 MSKASIQQQQQ-QDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR---SLALDH---GEI---QLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGKCSSNGNGFCNDSVCCLPEILCIDSS--VVVGNDLGCCHGTVVNKH---HNRSCSGSVALPGGNGSTVSPNKPPLGKHSR------VDSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGNDKNNSDYNGNGDG---AKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKK--RSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
M K S+ Q +D +CF+DSLDRI SS++SS S S E+E D ++S + +PNYA +FPMG+ +NYD+WIS+PSSV+ERR +LLR +GL+RD SR SL+ D G + ++ D + Q +++ N ++RSKSD + S + S C P+IL I+S+ V D G VN H ++S +G + L G SPNKPP GK+SR +DST +L+ N S V + G E + DL C+ I V P CTIKNLDTGKEFVVNEVR DG W K+KEV TG+QLTMEEFEMCVGHSPIVQELMRRQNVEDG+ K+N D N G G +K KKKGSWLKSIKNVASS+TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+V+FHGPER+RVRQYGKSCKE+T LYKSQEI AH GSIWTIKFSL+GKYLASAGEDCVIH+W+V ESERKGDLL+DK EDGNL++ + N SPEP S SPN S EKKRRGRSSISRKSVSLDH+ VPETVFALSEKP CSF+GHLDDVLDLSWSKSQHLLSSSMDKTVRLWHL+ KSCLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I DRQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSCRLYNTS+NKLQQK QINL NKKKK +KITGFQFAPGS SEV ITSADSRIRV+DGVDLVHKFKGFRNTNSQISASLTAN KYV+CASEDSHVYIWKHE DSRPS+SKGV VT+SYEHFHCQDVS AIPWPGM D WG QD + DH +E S AN+PPTPVE+ N E SPLA+GC+SSP HGTISSATNSYFFDRI ATWPEEKL+LA+ P VSVDF+NG++Q RSAWGMVIVTA LRGEIRTFQNFG+PV I
Sbjct: 1 MIKPSLNQHHNNKDCDCFYDSLDRIFSSSSSSSSSSPPEDERDRDPNSNSISDSPNYAPNPRFPMGLSNNYDIWISEPSSVEERRIQLLRQMGLSRDPILSRQIPSLSADFDAGGGVFGRSVSADHLIRTQQRAQISSSNFNPGVVRSKSDTE---ASPHDQCNSISSSICSPQILSINSTSPPVFKPD-----GPFVNNHTVVKSQSGNGLLGLNAG-----SPNKPPTGKNSRRVEEIRIDSTSSNLNFNSSSSPVLGNSGDR-----ELDDDLGCNGIVEVDGPVCTIKNLDTGKEFVVNEVREDGMWNKLKEVGTGKQLTMEEFEMCVGHSPIVQELMRRQNVEDGH-KDNVDPNAKGSGGIGSKSKKKGSWLKSIKNVASSMTGHKERRSSDERDTSSEKGGRRSSSATDDSQDVTFHGPERVRVRQYGKSCKELTALYKSQEIQAHNGSIWTIKFSLDGKYLASAGEDCVIHVWQVVESERKGDLLIDKPEDGNLNIFFVANGSPEPISLSPNLDSLPEKKRRGRSSISRKSVSLDHIMVPETVFALSEKPICSFQGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLSGKSCLKVFSHSDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPDRQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCRLYNTSENKLQQKAQINLQNKKKKSHQKKITGFQFAPGSSSEVHITSADSRIRVIDGVDLVHKFKGFRNTNSQISASLTANAKYVLCASEDSHVYIWKHEGDSRPSRSKGVTVTQSYEHFHCQDVSVAIPWPGMCDAWGFQDAQA-------DHLDEVSTANHPPTPVEEFNDDEKSPLASGCSSSPLHGTISSATNSYFFDRISATWPEEKLVLASNNVGSPPRVSVDFTNGISQSRSAWGMVIVTASLRGEIRTFQNFGLPVRI 911
Match: A0A5B7B0D0 ((Putative WD repeat-containing protein 44-like isoform X2 {ECO:0000313|EMBL:MPA61935.1}))
HSP 1 Score: 1161.75 bits (3004), Expect = 0.000e+0
Identity = 639/928 (68.86%), Postives = 718/928 (77.37%), Query Frame = 0
Query: 16 CFHDSLDRILSSTNSSCSPSSSEEEE-DEHDVAHSPNFNPNYAQK-SKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR--------------SLALDHGEI--QLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGK-CSSNGNGFCN-DSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHNRSCSGSVALPGGNGS----TVSPNKPPLGKHSRV------DSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGNDKN-NSDYNGNG-DGAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
CF++SLDRILSST+S S S+ +++E D + + SPN+ PN +FPMGV +NYDVWIS+PSSV+ERR RLLR +GL+RD SR L GE +++D ++ Q + N + RSKSDG T C+S+ + CN +S C P+IL I+S N+ G G VN H+ V L NGS VSPNKPP GK+ R DST LS NC S V + A E + D C+ I VSDP CTIKNLD GKEFVVNEVR DG W K+KEV TGRQLTMEEFEMC+GHSPIVQEL+RRQNVEDGN N +S+ NG+G G+KLKKKGSWLKSI+NVAS++TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPERIRVRQYGKS KE+T LYKSQEI AH GSIW IKFSL+GKYLASAGED VIH+ +V E+ERKGDLL DK EDGNL+LL+L N SPEP S SP + EKKRRGRSSISRKSVS+DHV VPETVFALSEKP CSF+GHLDDVLDLSWSKSQHLLSSSMDKTVRLWHL+SK+CLK FSHSDYVTCIQFNPVDD++FISGSLDAKVRIW+I D QVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSCRLYNTS+NKLQQK+QINL NKKKK H+KITGFQFAPGS SEVLITSADSRIRV+D VDLVHKFKGFRNTNSQISASLTANGKYVVCASEDS+VY+WKHE DSRPS+SKG+ VTRSYEHFHCQDVSAAIPW GM D LD E S AN+PPTPVE+ N ESSPLA+GCTSSP HGTISSATNSYFFDRI ATWPEEKLLLA K RSPH SVDFS+G+NQ RSAWGMVIVTAGLRGEIRTFQNFG+PV I
Sbjct: 13 CFYESLDRILSSTSSCSSSSAEDDDEKDPNSNSDSPNYAPNRPLPIPRFPMGVSNNYDVWISEPSSVEERRMRLLRQMGLSRDPSLSRQKPSHLAASDQGGSGGGLCRGEFGRSVSSDHLNRSQQRGEVSCSESNPGIQRSKSDGATDHHCNSSSHDQCNSNSSVCSPQILSINSISPSVNETG---GPFVNNHN-----VVVKLRSANGSPITNAVSPNKPPSGKNCRRVEEIRNDSTSLHLSVNCNSLPVLGN-----ASSGEFDDDSGCNGIVRVSDPVCTIKNLDNGKEFVVNEVREDGMWNKLKEVGTGRQLTMEEFEMCLGHSPIVQELLRRQNVEDGNKDNVDSNVNGSGGSGSKLKKKGSWLKSIRNVASTMTGHKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERIRVRQYGKSSKELTALYKSQEIQAHNGSIWIIKFSLDGKYLASAGEDRVIHVRQVVEAERKGDLL-DKLEDGNLNLLILANGSPEPISMSPK-DNHPEKKRRGRSSISRKSVSIDHVAVPETVFALSEKPICSFQGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLSSKTCLKIFSHSDYVTCIQFNPVDDSYFISGSLDAKVRIWSIPDHQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCRLYNTSENKLQQKSQINLQNKKKKSHHKKITGFQFAPGSSSEVLITSADSRIRVIDCVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSYVYVWKHEGDSRPSRSKGITVTRSYEHFHCQDVSAAIPWLGMCD-------------DLD----EVSTANHPPTPVEEMNGSESSPLASGCTSSPLHGTISSATNSYFFDRISATWPEEKLLLATKNRSPHSSVDFSSGMNQSRSAWGMVIVTAGLRGEIRTFQNFGLPVRI 908
Match: A0A2R6QWT5 ((WD repeat-containing protein {ECO:0000313|EMBL:PSS16200.1}))
HSP 1 Score: 1149.42 bits (2972), Expect = 0.000e+0
Identity = 623/925 (67.35%), Postives = 714/925 (77.19%), Query Frame = 0
Query: 1 MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSE--EEEDEHDVAHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR---SLALDHGEIQLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTG---KCSSNGNGFCNDSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHNRSCSGSVALPGGNGSTVSPNKPPLGKH-SRVDSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN-DKNNSDYNGNG-DGAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVE---DANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
MSKA + ++ ++ECF++SLDR++SS SSCS SS+E EE D + + SPNF PN + ++YDVWIS+PSSV++RR RLLR +GL+RD SR SL+ HG + D + N C ++RSKSDG +C+S+ + F ++ ++ S N H +VVN +RS +GS PNKPP GK RV+ R + V + GE V DC+ I V+ CTIKNLD GKEFVVNEVR DG W K+KEV TGRQLTMEEFE+CVGHSPIVQELMRRQNVEDGN D +SD +G+G G+KLKKKGSWLKSIK+VAS++T KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPER+RVRQYGKS KE+T LYKSQEI H GSIWTIKFSL+G+YLASAGEDCVIH+W+V +S+RKGDLL+DK EDGNL+LL +TN SPEP S SPN S EKKRRGRSSISRKSV ++ + VPET+FALSEKPFCSFEGHL+DVLDLSWSKS+HLLSSSMDKTVRLWHL+SKSCLK FSHSDYVTCIQFNPVD+ +FISGSLDAKVRIW+I DR VVDW+DLHEMVTAACYTPDGQ+ALVGSYKGSCRLYNTS+NKLQQK QINL NKKKK H+KITGFQFAP S SEVL+TSADSRIRVVDG DLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVY+WKHE DSRP++SKGV VTRSYEHFHCQDVS AIPWPGM D WG D SGE N L DH +E S AN+PP+PVE D N ESSP+ATGC+SSP HGTISSATNSYFFDRI ATWPEEKL+LAAK RSP VSVDFSN +NQ R AWGMVIVTAGLRGEIRTFQNFG+PV I
Sbjct: 1 MSKA---RDEEDEDECFYESLDRLVSSATSSCSSSSAEDDEERDPNSNSDSPNFAPNN-------QSLWNSYDVWISEPSSVEDRRMRLLRQMGLSRDPVLSRQTPSLSSAHGGRSASADQLSGGN---------CGPGIVRSKSDGGASVHDQCNSDSSVFSPRNLSGNSISQSVNESE--ENSFVNNHSSVVN---SRSDNGS------------PNKPPAGKICRRVEEIRSDSISTSSNSNVNYNSSGELIHVS------DCNGIVGVNGLVCTIKNLDNGKEFVVNEVREDGMWNKLKEVGTGRQLTMEEFEICVGHSPIVQELMRRQNVEDGNKDSVDSDVHGSGGSGSKLKKKGSWLKSIKSVASTMTSYKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERVRVRQYGKSYKELTALYKSQEIQGHNGSIWTIKFSLDGRYLASAGEDCVIHVWQVEKSQRKGDLLMDKLEDGNLNLLFVTNGSPEPTSMSPNLESHPEKKRRGRSSISRKSVCIEQILVPETLFALSEKPFCSFEGHLNDVLDLSWSKSEHLLSSSMDKTVRLWHLSSKSCLKIFSHSDYVTCIQFNPVDEKYFISGSLDAKVRIWSIPDRLVVDWNDLHEMVTAACYTPDGQSALVGSYKGSCRLYNTSENKLQQKDQINLQNKKKKCHHKKITGFQFAPESSSEVLVTSADSRIRVVDGTDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYVWKHEGDSRPNRSKGVTVTRSYEHFHCQDVSVAIPWPGMCDPWGFHDTLSGEQNQLHDHLDEVSTANHPPSPVEEIIDGN--ESSPMATGCSSSPLHGTISSATNSYFFDRISATWPEEKLILAAKNRSPRVSVDFSNALNQNRPAWGMVIVTAGLRGEIRTFQNFGLPVRI 881
Match: A0A5J5A0P4 ((Uncharacterized protein {ECO:0000313|EMBL:KAA8523869.1}))
HSP 1 Score: 1142.87 bits (2955), Expect = 0.000e+0
Identity = 638/950 (67.16%), Postives = 728/950 (76.63%), Query Frame = 0
Query: 1 MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFN---PNYAQK-----SKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR-------------SLALDHGEI--QLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGK-CSSNGNGFCN-DSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHH----NRSCSGSVALPGGNGSTVSPNKPPLGKHSRV------DSTRKSL--SGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDG-NDKNNSDYNGNGD-GAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
MSKA ++ ++ECF++SLDRILSST+S S S+ E++DE D PN N PNYA +FPMGV +NYD+WIS+PSSV+ERR RLLR +GL+RD SR S L GE +++D ++ Q D + N +L SDG T C+S+ + CN +S P+IL I+S N+ G G V+ H+ +RS +GS+ +T S NKPP GK+ R DST +L +GN + + SGG E E D C+ I SDP CTIKNLD GKEFVVNEVR DG W K+KEV TGRQLTMEEFE+C+GHSPIVQELMRRQNVEDG ND +S+ NG+G G+KLKKKGSWL+SIKNVAS++TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFH PERIRVRQYGKSCKE+T LYKSQEI AH+GSIWTIKFSL+GKYLA+AGEDCVI +W+V ESERKGDLL+DKSEDGNL+LL+L N SPEP S SPN S LEKKR+GRSSISRKSVS+DHV VPETVFALSEKP CSF+GHLDDVLDLSWSKSQ LLSSSMDKTVRLWHL+SKSCLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I DRQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSCRLYNTS+NKLQ K+QINL NKKKK H+KITGFQFAPGS SEVLITSADSRIRV+D VDLVHKFKGFRN NSQISASLTANGKYV+CASEDSHVY+WKHE DSRPSKSKG+ VTRSYEHFH QDVS AIPWPGM C D+ +E S N+PPTPVE+ N ESSPLA+GCTSSPFHGTISSAT+SYFFDRI ATWPEEKL L+ K SP SVDFS G+N RSAWGMVIVTAGLRGEIRT+QNFG+PV I
Sbjct: 1 MSKAG---DEEDEDECFYESLDRILSSTSSCSSSSA--EDDDEKD----PNSNSDSPNYASNRPLPIPRFPMGVSNNYDMWISEPSSVEERRLRLLRQMGLSRDPSLSRQKPFLLSSSEAGGSGGLCRGEFGRSVSSDHLNRSQQRDEVSYNNSNPGILWPDSDGATDHHCNSSSHDQCNFNSSVFSPQILSINSISPSVNETG---GPFVHNHNVVITSRSDNGSLIT-----NTASLNKPPTGKNCRRVEEIRDDSTSSNLHVNGNSLPVLGNASGG-------ELEDDSGCNGIVRASDPVCTIKNLDNGKEFVVNEVREDGMWNKLKEVGTGRQLTMEEFEICLGHSPIVQELMRRQNVEDGTNDNVDSNVNGSGSSGSKLKKKGSWLRSIKNVASTMTGLKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHCPERIRVRQYGKSCKELTALYKSQEIQAHSGSIWTIKFSLDGKYLATAGEDCVIQVWQVVESERKGDLLMDKSEDGNLNLLILANGSPEPTSMSPNLDSHLEKKRKGRSSISRKSVSVDHVVVPETVFALSEKPICSFQGHLDDVLDLSWSKSQQLLSSSMDKTVRLWHLSSKSCLKIFSHSDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPDRQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCRLYNTSENKLQPKSQINLQNKKKKSHHKKITGFQFAPGSSSEVLITSADSRIRVIDDVDLVHKFKGFRNPNSQISASLTANGKYVICASEDSHVYVWKHEVDSRPSKSKGITVTRSYEHFHSQDVSVAIPWPGM---------C--------DYLDEVSTTNHPPTPVEETNGSESSPLASGCTSSPFHGTISSATDSYFFDRISATWPEEKLPLSTKNPSPRSSVDFSVGINDSRSAWGMVIVTAGLRGEIRTYQNFGLPVRI 909
Match: A0A4S4EU06 ((Uncharacterized protein {ECO:0000313|EMBL:THG20390.1}))
HSP 1 Score: 1139.02 bits (2945), Expect = 0.000e+0
Identity = 630/970 (64.95%), Postives = 716/970 (73.81%), Query Frame = 0
Query: 1 MSKASIQQQQQ-QDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR---SLALDH---GEI---QLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGKCSSNGNGFCNDSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHNRSCSGSVALPGGNGSTVSPNKPPLGKHSR------VDSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGNDKNNSDYNGNGDG---AKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKS--------------------------------------QHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKK--RSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
M K S+ Q +D +CF+DSLDRI SS++SS S S E+E D ++S + +PNYA +FPMG+ +NYD+WIS+PSSV+ERR +LLR +GL+RD SR SL+ D G + ++ D + Q +++ N ++RSKSD + S + S C P+IL I+S+ + G VN H L G N SPNKPP GK+SR +DST +L+ N S V + G E + D C+ I V P CTIKNLDTGKEFVVNEVR DG W K+KEV TG+QLTMEEFEMCVGHSPIVQELMRRQNVEDG+ K+N D N G G +K KKKGSWLKSIKNVASS+TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+V+FHGPER+RVRQYGKSCKE+T LYKSQEI AH GSIWTIKFSL+GKYLASAGEDCVIH+W+V ESERKGDLL+DK EDGNL++ + N SPEP S SPN S EKKRRGRSSISRKSVSLDH+ VPETVFALSEKP CSF+GHLDDVLDLSWSKS QHLLSSSMDKTVRLWHL+ KSCLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I DRQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSCRLYNTS+NKLQQK QINL NKKKK +KITGFQFAPGS SEV ITSADSRIRV+DGVDLVHKFKGFRNTNSQISASLTAN KYV+CASEDSHVYIWKHE DSRPS+SKGV VT+SYEHFHCQDVS AIPWPGM D WG QD + DH +E S AN+PPTPVE+ N E SPLA+GC+SSP HGTISSATNSYFFDRI ATWPEEKL+LA+ P VSVDF+NG +Q RSAWGMVIVTA LRGEIRTFQNFG+PV I
Sbjct: 1 MIKPSLNQHHNNKDCDCFYDSLDRIFSSSSSSSSSSPPEDERDRDPNSNSISDSPNYAPNPRFPMGLSNNYDIWISEPSSVEERRIQLLRQMGLSRDPILSRQIPSLSADFDAGGGVFGRSVSADHLIRTQQRAQISSSNFNPGVVRSKSDTE---ASPHDQCNSISSSICSPQILSINST---SPPVFKPEGPFVNNHSVVKSQNGNGLLGLNAG--SPNKPPTGKNSRRVEEIRIDSTSSNLNFNSSSSPVLGNSGDR-----ELDDDSGCNGIVEVDGPVCTIKNLDTGKEFVVNEVREDGMWNKLKEVGTGKQLTMEEFEMCVGHSPIVQELMRRQNVEDGH-KDNVDPNAKGSGGIGSKSKKKGSWLKSIKNVASSMTGHKERRSSDERDTSSEKGGRRSSSATDDSQDVTFHGPERVRVRQYGKSCKELTALYKSQEIQAHNGSIWTIKFSLDGKYLASAGEDCVIHVWQVVESERKGDLLIDKPEDGNLNIFFVANGSPEPISLSPNLDSLPEKKRRGRSSISRKSVSLDHIMVPETVFALSEKPICSFQGHLDDVLDLSWSKSQMDICILVKQLSFLWITGSGFWLVQYVINASYYQKFYFQHLLSSSMDKTVRLWHLSGKSCLKVFSHSDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPDRQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCRLYNTSENKLQQKAQINLQNKKKKSHQKKITGFQFAPGSSSEVHITSADSRIRVIDGVDLVHKFKGFRNTNSQISASLTANAKYVLCASEDSHVYIWKHEGDSRPSRSKGVTVTQSYEHFHCQDVSVAIPWPGMCDAWGFQDAQT-------DHLDEVSTANHPPTPVEEFN-DEKSPLASGCSSSPLHGTISSATNSYFFDRISATWPEEKLVLASNNPGSPPRVSVDFTNGTSQSRSAWGMVIVTASLRGEIRTFQNFGLPVRI 948
Match: A0A2I4H4K7 ((WD repeat-containing protein 44-like {ECO:0000313|RefSeq:XP_018851091.1}))
HSP 1 Score: 1127.85 bits (2916), Expect = 0.000e+0
Identity = 604/911 (66.30%), Postives = 698/911 (76.62%), Query Frame = 0
Query: 13 DEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR-SLALDHGEIQLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGKCSSNGNGFCNDSVCCLPEILCIDSSVVVGND---LGCCHGTVVNKHHNRSCSGSVALPGGNGSTVSPNKPPLGKHS--RVDSTRKSLS-GNCISFVVRESG-GGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGNDKNNSDYNGNG---DGAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKR-SPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
D EC+++SLDRI+SS+ S + +S +E D + SPN + KFPMG + YDVWISQPSSV ERRSRLLR +GL+ D SR A D G DI +D S + M+RSKSDG + + C P +L S +V N+ C H +V S G + + SPNKPP+GK S +VD R + + G ++F G GGE +E EG LDC+ +G S+ C IKNLD GKEFVVNE+R DG+W K+KEV TGRQLTMEEFEM VGHSPIVQELMRRQNVE+GN K++SD N NG GAK+KKKGSW KSI++VASSVTG+KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPER++VRQYGKSCKE+T LYKSQEI AH GSIW+IKFSL+GKYLASAGEDCVIH+W+V ES+RKGDLL+DK +DGN+SL + N SPEP SP+ + EKKRRGRSS+SRKS+SLDHV VPETVF+LSEKP CSF GHLDDVLDLSWSKSQHLLSSSMDKTVRLWHL+S+SCLK FSH+DYVTCIQFNPVDD +FISGSLDAKVRIW+I DRQVVDW+D+HEMVTAACYTPDGQ ALVGSYKGSC LYNTS+NKLQ QINL NKKKK H+KITGFQFAPGS SEVLITSADSRIRVVD VDL+ KFKGFRN NSQISASLT+NGKYVV ASEDS V++WKHEADSRP++SK V VT SYEHFHCQDVSAAIPWPG+GD WGLQD E NGLD++ +E S AN+PPTPVE+ N S A+G ++SP HGTISSATNSYFFDRI ATWPEEKLL+ + R SP VSVDF NGVNQ SAWG+VI+TAGLRGEIRTFQNFG+PV I
Sbjct: 21 DNECYYESLDRIVSSSCSCSNSNSDNDENDTVSFSASPN----HGSVPKFPMGTSNKYDVWISQPSSVSERRSRLLRQMGLSNDPSLSRDKPARDFGF----GGDIGRSVSSDRLTNHSGASTMVRSKSDGSDRQFDVRPTS----TSACAPPVLSTPSVSLVNNNDDSYKCNHAVLVKARSGNS-------DGESRAAASPNKPPIGKSSSKKVDEIRGNFTVGLDVNFDSNAGGSGGE----VEVEG-LDCNGVGRESEKVCLIKNLDNGKEFVVNEIREDGTWNKLKEVGTGRQLTMEEFEMSVGHSPIVQELMRRQNVEEGN-KDSSDSNANGVDGSGAKMKKKGSWFKSIRSVASSVTGQKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERVKVRQYGKSCKELTALYKSQEIQAHNGSIWSIKFSLDGKYLASAGEDCVIHVWQVVESDRKGDLLMDKPDDGNMSLFFVANGSPEPTLLSPSMDNHPEKKRRGRSSMSRKSLSLDHVVVPETVFSLSEKPICSFRGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLSSESCLKIFSHNDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPDRQVVDWNDVHEMVTAACYTPDGQGALVGSYKGSCHLYNTSENKLQLNVQINLQNKKKKSNHKKITGFQFAPGSSSEVLITSADSRIRVVDSVDLIRKFKGFRNQNSQISASLTSNGKYVVSASEDSCVFVWKHEADSRPNRSKCVTVTHSYEHFHCQDVSAAIPWPGLGDSWGLQDNYCREQNGLDNNLDEVSTANHPPTPVEEIN--GGSQSASGFSNSPLHGTISSATNSYFFDRISATWPEEKLLVPTRNRSSPRVSVDFFNGVNQNVSAWGLVIITAGLRGEIRTFQNFGLPVRI 904
Match: A0A2R6QDW6 ((WD repeat-containing protein {ECO:0000313|EMBL:PSS06321.1}))
HSP 1 Score: 1123.61 bits (2905), Expect = 0.000e+0
Identity = 621/928 (66.92%), Postives = 712/928 (76.72%), Query Frame = 0
Query: 1 MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDV--AHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSR---SLALDHGEIQLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDT---GKCSSNGNGFCNDSVCCLPEILC---IDSSVVVG--NDLGCCHGTVVNKHHNRSCSGSVALPGGNGSTVSPNKPPLGKH-SRVDSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN-DKNNSDYNGNGDGA-KLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVED-ANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
MSKA ++ ++E F++SLDR++SST SSCS SS+E+++D+ +HSPNF PN + ++YDVWIS+PSSV++RR RLLR +GL+RD SR SL+ HG + D + N C ++RSKSDG +C+S+ + F P IL I SV N H +VVN +RS +GS PNKPP GK RV+ R + V + GE V DC+ + V+ ACTIKNLD GKEFVVNEVR DG W K KEV T RQLTMEEFE+CVGHSPIVQELMRRQNVEDGN D +SD N +G A KLKKKGSWLKSIK+VA ++TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPER+RVRQ GKS KE+T LYKSQEI AH GSIWTIKFSL+G+YLASAGEDCVIH+W+V ES+RKGDLL+DK EDGNL++L + N SPEP S SPN S+LEKKRRGRSSISRKS+SL+ + VPET+FALSEKPFCSFEGHL+DVLDLSWSKSQ LLSSSMDKTVRLW+L+SKSCLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I DR VVDW+DL+EMVTAACY PDGQ+ALVGSYKGSCRLYNTS+NKLQQK QINL NKKKK H+KITGFQFAPGS SEVL+TSADSRIRVVDG DLVHKFKGFRN NSQISASLTANGKYVVCASEDSHVY+WKHEADSRP++SKGV VTRSYEHFHCQDVS AIPWPGM D WG D SGE N L DH +E AN+PPTPVE+ N ESSP+ATG ++SP HGTIS+A+NSYFFDRI ATWPEEKL+LAAK RSP VSVDFSNG+NQ AWGMVIVTAGLRGEIRTFQNFG+PV I
Sbjct: 1 MSKAG---DEEDEDERFYESLDRLVSSTTSSCSSSSAEDDDDDERDPNSHSPNFAPNNP-------SLWNSYDVWISEPSSVEDRRMRLLRQMGLSRDPVLSRQTPSLSSAHGGRSASADQLCGGN---------CVPGIVRSKSDGGASVHAQCNSDSSVFS-------PRILSGNSISQSVNQSEENSFVNNHSSVVN---SRSDNGS------------PNKPPTGKICRRVEEIRNDSVSTSSNLNVNCNSSGELNHVS------DCNGVVGVNGLACTIKNLDNGKEFVVNEVREDGMWNKFKEVGTERQLTMEEFEICVGHSPIVQELMRRQNVEDGNKDSVDSDVNESGGSASKLKKKGSWLKSIKSVAGTMTGYKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERVRVRQCGKSSKELTALYKSQEIQAHNGSIWTIKFSLDGRYLASAGEDCVIHVWQVVESQRKGDLLMDKQEDGNLNILFVANGSPEPTSLSPNLESRLEKKRRGRSSISRKSMSLEQILVPETLFALSEKPFCSFEGHLNDVLDLSWSKSQ-LLSSSMDKTVRLWNLSSKSCLKIFSHSDYVTCIQFNPVDDKYFISGSLDAKVRIWSIPDRLVVDWNDLNEMVTAACYAPDGQSALVGSYKGSCRLYNTSENKLQQKDQINLQNKKKKFHHKKITGFQFAPGSSSEVLVTSADSRIRVVDGTDLVHKFKGFRNPNSQISASLTANGKYVVCASEDSHVYVWKHEADSRPNRSKGVTVTRSYEHFHCQDVSVAIPWPGMCDSWGFHDTFSGEQNQLHDHLDEVPTANHPPTPVEEIINSNESSPMATGYSNSPLHGTISTASNSYFFDRISATWPEEKLILAAKNRSPRVSVDFSNGLNQNTPAWGMVIVTAGLRGEIRTFQNFGLPVRI 880
Match: A0A5J5AC58 ((Uncharacterized protein {ECO:0000313|EMBL:KAA8526851.1}))
HSP 1 Score: 1123.23 bits (2904), Expect = 0.000e+0
Identity = 586/777 (75.42%), Postives = 647/777 (83.27%), Query Frame = 0
Query: 147 CSSNGNGFCN-DSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHN--RSCSGSVALPGGNGSTVSPNKPPLGKHSRV------DSTRKSLSGNCISF-VVRESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN-DKNNSDYNGNGD-GAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
C+S+ + CN +S +IL I+S N+ G G+ VN H+ +S SGS P N SPNKPP GK+SR DS LS NC S V+ + GGE + D C I SVSDP CTIKNLD GKEFVVNEVR DG W K+KEV TGRQLTMEEFEMC+GHSPIVQEL+RRQNVEDGN D +S+ NG+G G+KLKKKGSWLKSI+NVAS++TG KERRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPERIRVRQYGKSCKE+T LYKSQEI AH GSIWTIKFSL+GKYLASAGED VIH+ +V ESERKGDLL+DK EDGNL+LL+ N SPEPNS SPN S EKKRRGRSSISRKSVS+DHV VPETVFALSEKP CSF+GHLDDVLDLSWSKSQHLLSSSMDKTVRLW+L+SKSCLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I +RQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSC LYNTS+NKLQQK+QINL NKKKK H+KITGFQFAPGS SEVLITSADSRIRV+D ++LVHKFKGFRNTNSQISASLTANGKYVVCASEDS+VY+WKHE DSRPS+SKG+ VTRSYEHFHCQDVS AIPWPGM D WGLQD CSGE NG+ DH +E S AN+PPTPVE+ N ESSPLA+ C SSP HGTISSATNSYFFDRI ATWPEEKLLLA K RSP SVDFS+G+NQ RSAWGMVIVTAGLRGEIRTFQNFG+PV I
Sbjct: 7 CNSSSHDHCNSNSSVYSSQILSINSISPSVNEAG---GSFVNNHNVDVKSRSGS---PITN--VASPNKPPTGKNSRRVEEIRNDSKSMHLSFNCNSLPVLGNATGGEL------DDDSGCYGIVSVSDPVCTIKNLDNGKEFVVNEVRDDGMWNKLKEVGTGRQLTMEEFEMCLGHSPIVQELLRRQNVEDGNKDNAHSNVNGSGSSGSKLKKKGSWLKSIRNVASTMTGHKERRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERIRVRQYGKSCKELTALYKSQEIQAHNGSIWTIKFSLDGKYLASAGEDRVIHVRQVVESERKGDLLMDKLEDGNLNLLI-ANGSPEPNSMSPNLDSNPEKKRRGRSSISRKSVSMDHVVVPETVFALSEKPICSFQGHLDDVLDLSWSKSQHLLSSSMDKTVRLWNLSSKSCLKIFSHSDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPERQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCHLYNTSENKLQQKSQINLQNKKKKSHHKKITGFQFAPGSTSEVLITSADSRIRVIDWIELVHKFKGFRNTNSQISASLTANGKYVVCASEDSYVYVWKHEGDSRPSRSKGITVTRSYEHFHCQDVSVAIPWPGMCDEWGLQDACSGEQNGIGDHLDEVSTANHPPTPVEEINGSESSPLASVCNSSPLHGTISSATNSYFFDRISATWPEEKLLLATKNRSPRSSVDFSSGMNQSRSAWGMVIVTAGLRGEIRTFQNFGLPVRI 768
Match: A0A4S4DWK9 ((Uncharacterized protein {ECO:0000313|EMBL:THG07730.1}))
HSP 1 Score: 1106.28 bits (2860), Expect = 0.000e+0
Identity = 628/995 (63.12%), Postives = 723/995 (72.66%), Query Frame = 0
Query: 1 MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFNPNYAQK------SKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARDSKFSRSLALDHGEIQ-------LNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDT-------GKCSSNGNGFCNDSVCCLPEILCIDSSVV---VGNDLGCCHGTVVNKHHNRSCSGSVALPGGNGSTV-----SPNKPPLGKHSRV------DSTRKSLSGNCISFVV--RESGGGERARVMEGEGDLDCDLIGSVSDPACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN-DKNNSDYNGNG-DGAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHL-------LSSS------------------MDKTVRLWHLT----SKSCLKTFSHSD-----------------YVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
MSKA ++ ++ECF++SLDRILSS+NSSCS SS+E+ +D+ S + +PNYA +FPMGV +NYDVWIS+PSSV++RR RLLR +GL+RD SR A D G ++ D ++ Q + N ++RSKSDG T +C+SN S P+IL I+++ + V N++G G VN +NRS V P GNGS V S NKPP K+ R DST + GNC S V SGGGE +GD C+ IG P CTIKNLDTGKEFVVNEVR DG+W K+KEV TGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN D +S+ NG+G G+K KKKGSWLKSI+NVAS++TG K+RRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPER+RVRQYGKSCKE+T LYKSQEI AH+GSIWTIKFSL+GKYLASAGEDCVIH+W+V E+ERKGDLL+DK DGN +LL N SPEP S SPN + EKKRRGRSS SRKSVSLD + VPETVFALSEKP CSF+GHL+DVLDLSWSKSQ +SS+ + K++ HL+ K+ L + +S+ YVTCIQFNPVDD FFISGSLDAKVRIW+I DRQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSC LYNTS+NKLQQK QINL NKKKK H+KITGFQFAPGS SEVLITSADSRIRV++GVDLVHKFKGFRNTNSQISASLTANG YVV ASEDSHVY+WKHE DS+PS+SKGV VTRSYEHFHCQDVS AIPWPGM D WG QD SGE + L D +E S AN+PPTPVE+ N ES+ LA C +SP HGTISSATNSYFFDRI ATWPEEKLLL AK SP VSVDFSN +NQ RSAWGMVIVTAGLRGEIRTFQNFG+PV I
Sbjct: 1 MSKAG---DEEDEDECFYESLDRILSSSNSSCSSSSAEDYDDDDKDQKSISDSPNYASDRPPLPIPRFPMGVSNNYDVWISEPSSVEDRRMRLLRQMGLSRDPILSRQRASDEGGTSRGDFGRSVSADQLNRSQQRGDISCSDSNPGIVRSKSDGATDHHHRIKSQCNSN-------SPLHSPQILLINNNSISPPVVNEIG---GPFVN--NNRSV---VKSPSGNGSPVINAAASLNKPPTSKNCRRVEDNMNDSTSLNPGGNCNSMSVSGNASGGGEL------DGDSGCNGIG----PVCTIKNLDTGKEFVVNEVREDGTWNKLKEVGTGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGNRDDVDSNANGSGGSGSKFKKKGSWLKSIRNVASTMTGYKDRRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERVRVRQYGKSCKEITALYKSQEIRAHSGSIWTIKFSLDGKYLASAGEDCVIHVWQVVETERKGDLLIDKPVDGNSNLLFFANGSPEPTSVSPNLENHPEKKRRGRSSTSRKSVSLDQILVPETVFALSEKPICSFQGHLNDVLDLSWSKSQTAPLNCICSISSADQTRSVGSTPPALLDWLHLVKSLHQIHLSRAIRKKNYLAGYHNSETVFVALLHWQLSDELFPYVTCIQFNPVDDRFFISGSLDAKVRIWSIPDRQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCHLYNTSENKLQQKGQINLQNKKKKSHHKKITGFQFAPGSSSEVLITSADSRIRVIEGVDLVHKFKGFRNTNSQISASLTANGNYVVSASEDSHVYVWKHEGDSQPSRSKGVTVTRSYEHFHCQDVSMAIPWPGMCDSWGFQDRSSGEQSRLGDDVDEVSTANHPPTPVEEINGNESTLLACACPTSPLHGTISSATNSYFFDRISATWPEEKLLLTAKNHSPRVSVDFSNSINQSRSAWGMVIVTAGLRGEIRTFQNFGLPVRI 967
Match: A0A5E4FQ28 ((Full=PREDICTED: WD repeat-containing {ECO:0000313|EMBL:VVA29568.1}))
HSP 1 Score: 1105.51 bits (2858), Expect = 0.000e+0
Identity = 602/932 (64.59%), Postives = 703/932 (75.43%), Query Frame = 0
Query: 1 MSKASIQQQQQQDEECFHDSLDRILSSTNSSCSPSSSEEEEDEHDVAHSPNFNPNYAQKSKFPMGVLHNYDVWISQPSSVQERRSRLLRNLGLARD-------------SKFSRSLALDHGEIQLNNDDIHNDNQNDVRVAASCNAEMLRSKSDGDTGKCSSNGNGFCNDSVCCLPEILCIDSSVVVGNDLGCCHGTVVNKHHNRSCSGSVALPGGNGSTVSPNKPPLGKHSR-VDSTRKSLSGNCISFVVRESGGGERARVMEGEGDLDCDLIGSVSD-----PACTIKNLDTGKEFVVNEVRGDGSWEKVKEVATGRQLTMEEFEMCVGHSPIVQELMRRQNVEDGN-DKNNSDYNG-NGDGAKLKKKGSWLKSIKNVASSVTGRKERRSSDDRDTASERGGRRSSSATDDSQEVSFHGPERIRVRQYGKSCKEVTGLYKSQEILAHTGSIWTIKFSLNGKYLASAGEDCVIHIWKVGESERKGDLLLDKSEDGNLSLLLLTNDSPEPNSGSPNFGSQLEKKRRGRSSISRKSVSLDHVFVPETVFALSEKPFCSFEGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLTSKSCLKTFSHSDYVTCIQFNPVDDNFFISGSLDAKVRIWNISDRQVVDWSDLHEMVTAACYTPDGQAALVGSYKGSCRLYNTSDNKLQQKTQINLHNKKKKGRHRKITGFQFAPGSKSEVLITSADSRIRVVDGVDLVHKFKGFRNTNSQISASLTANGKYVVCASEDSHVYIWKHEADSRPSKSKGVNVTRSYEHFHCQDVSAAIPWPGMGDIWGLQDGCSGELNGLDDHPEEASLANNPPTPVEDANVIESSPLATGCTSSPFHGTISSATNSYFFDRI-ATWPEEKLLLAAKKRSPHVSVDFSNGVNQRRSAWGMVIVTAGLRGEIRTFQNFGMPVCI 910
MSKA + ++ +D ECF++SLDRI+SS+ S + +S + E + NPNYA KFPMG YDVWIS+PSSV ERRS+LL +GL D F RS++ D+ QL++ N ++RSKSDG G +N + S + I C +S + G+ VN++ +++C + G ++ PNKPP GK+SR D R ++V E +LDC+ I V+D CTI+NLD GKEFVVNE+R DG W K+KEV TG+QLTMEEFEM VGHSPIVQELMRRQNVE+G+ D S+ NG NG +KLKK+G W KSIK+VAS++TG ++RRSSD+RDT+SE+GGRRSSSATDDSQ+VSFHGPER+RVRQYGKSCKE+T +YKSQEI AH GSIW+IKFSL+GKYLASAGEDCVIH+WKV ESERKGDLL++KSED N +LL +N SPEP+S SPN + +EKKRRGRSSISRKSVSLDH +PETVFALSEKP SF+GHLDDVLDLSWSKSQHLLSSSMDKTVRLWHL++++CLK FSHSDYVTCIQFNPVDD +FISGSLDAKVRIW+I DRQVVDW+DLHEMVTAACYTPDGQ ALVGSYKGSCRLYNTS+NKLQQK+QINL NKKKK +KITGFQFAPGS SEVLITSADSRIRVVD +DLVHKFKGFRN NSQISA+LTANGKYVV ASEDSHVYIWKHEADSRPS+SK V VTRSYEHFHC DVS AIPWPG+GD WGLQD E NGLD++ +E S AN+PPTPVE AN E S A+GCT+SP HGTISSA+N+YFFDRI ATWPEEKLLLA + RSP VS DF+NG NQ SAWGMVIVTAGLRGEIRTFQNFG+P+ I
Sbjct: 1 MSKA--RGEEDEDTECFYESLDRIVSSSCSCSTSNSGSDTESDP--------NPNYAVP-KFPMGASIKYDVWISEPSSVSERRSKLLSEMGLTGDPVLTRAKPHLGYAGDFGRSVSSDYLISQLSSG------------GGGVNG-IVRSKSDG--GDQCNNACSTSSISSPPILSIRC--ASGEAETEPEPETGSFVNRN-SKNCV--LKSSSGKSNSSPPNKPPSGKNSRRADEIRSD------------------SKVDE---ELDCNGIVKVTDGNGNAQVCTIRNLDNGKEFVVNEIREDGMWNKLKEVGTGKQLTMEEFEMSVGHSPIVQELMRRQNVEEGHKDGLESNANGGNGGVSKLKKRGGWFKSIKSVASTMTGHRDRRSSDERDTSSEKGGRRSSSATDDSQDVSFHGPERVRVRQYGKSCKELTAMYKSQEIQAHNGSIWSIKFSLDGKYLASAGEDCVIHVWKVMESERKGDLLMEKSEDSNFNLLF-SNGSPEPSSVSPNVDNHVEKKRRGRSSISRKSVSLDHYVIPETVFALSEKPISSFQGHLDDVLDLSWSKSQHLLSSSMDKTVRLWHLSTQTCLKIFSHSDYVTCIQFNPVDDRYFISGSLDAKVRIWSIPDRQVVDWNDLHEMVTAACYTPDGQGALVGSYKGSCRLYNTSENKLQQKSQINLQNKKKKSHQKKITGFQFAPGSSSEVLITSADSRIRVVDSIDLVHKFKGFRNANSQISATLTANGKYVVSASEDSHVYIWKHEADSRPSRSKSVTVTRSYEHFHCHDVSVAIPWPGVGDSWGLQD---AEQNGLDNNLDEVSTANHPPTPVEVANGNEGSRSASGCTNSPLHGTISSASNTYFFDRISATWPEEKLLLATRNRSPRVSFDFTNGFNQNMSAWGMVIVTAGLRGEIRTFQNFGLPIRI 876
| Match Name | Stats | Description |
|---|---|---|
| A0A7J7HCY2 | E-Value: 0.000e+0, PID: 67.66 | (Uncharacterized protein {ECO:0000313|EMBL:KAF5949... [more] |
| A0A5B7B0D0 | E-Value: 0.000e+0, PID: 68.86 | (Putative WD repeat-containing protein 44-like iso... [more] |
| A0A2R6QWT5 | E-Value: 0.000e+0, PID: 67.35 | (WD repeat-containing protein {ECO:0000313|EMBL:PS... [more] |
| A0A5J5A0P4 | E-Value: 0.000e+0, PID: 67.16 | (Uncharacterized protein {ECO:0000313|EMBL:KAA8523... [more] |
| A0A4S4EU06 | E-Value: 0.000e+0, PID: 64.95 | (Uncharacterized protein {ECO:0000313|EMBL:THG2039... [more] |
| A0A2I4H4K7 | E-Value: 0.000e+0, PID: 66.30 | (WD repeat-containing protein 44-like {ECO:0000313... [more] |
| A0A2R6QDW6 | E-Value: 0.000e+0, PID: 66.92 | (WD repeat-containing protein {ECO:0000313|EMBL:PS... [more] |
| A0A5J5AC58 | E-Value: 0.000e+0, PID: 75.42 | (Uncharacterized protein {ECO:0000313|EMBL:KAA8526... [more] |
| A0A4S4DWK9 | E-Value: 0.000e+0, PID: 63.12 | (Uncharacterized protein {ECO:0000313|EMBL:THG0773... [more] |
| A0A5E4FQ28 | E-Value: 0.000e+0, PID: 64.59 | (Full=PREDICTED: WD repeat-containing {ECO:0000313... [more] |
| Name | Description |
|---|---|
An orange, doubled-haploid, Nantes-type carrot (DH1) was used for genome sequencing. We used BAC end sequences and a newly developed linkage map with 2,075 markers to correct 135 scaffolds with one or more chimeric regions. The resulting v2.0 assembly spans 421.5 Mb and contains 4,907 scaffolds (N50 of 12.7 Mb), accounting for ∼90% of the estimated genome size of 473 Mb. The scaftig N50 of 31.2 kb is similar to those of other high-quality genome assemblies such as potato and pepper. About 86% (362 Mb) of the assembled genome is included in only 60 superscaffolds anchored to the nine pseudomolecules. The longest superscaffold spans 30.2 Mb, 85% of chromosome 4. There are a few different naming schemes for this assembly. First there is the Phytozome genome ID 388: The authors' sequences and gene predictions were also submitted to Phytozome, and can be accessed at this address: https://phytozome-next.jgi.doe.gov/info/Dcarota_v2_0 LNRQ01: These sequences were then assigned GenBank accession numbers starting at LNRQ01000001.1 which corresponds to DCARv2_Chr1, up to LNRQ01004826.1 which corresponds to an unincorporated contig, DCARv2_C10750146. These reside in bioproject PRJNA268187, which is a subproject of umbrella project PRJNA285926. Assembly GCA_001625215.1: The genome assembly was later defined an accession number GCA_001625215.1 for assembly ASM162521v1 which consists of only the 9 chromosome sequences and the plastid assembly, which have accession numbers from CM004278.1 to CM004286.1 for the chromosomes and CM004358.1 for the plastid. The mitochondrial genome was not included because it is classified as an incomplete sequence. RefSeq: The assembly was then later added to RefSeq, and there another new set of identifiers was defined from NC_030381.1 to NC_030389.1 for the chromosomes, and from NW_016089425.1 to NW_016094239.1 for unincorporated scaffolds and contigs. These reside in bioproject PRJNA326436. Note that NCBI substituted different assembled organellar genomes from different genotypes for the RefSeq records. The NCBI Sequence report lists the correspondences between the various naming methods Link to the LNRQ01000000.1 master record at NCBI Raw Reads: Link to SRA accessions used for the genome assembly This genome is available in the CarrotOmics Blast Search | |
The RefSeq genome records for Daucus carota subsp. sativus were annotated by the NCBI Eukaryotic Genome Annotation Pipeline, an automated pipeline that annotates genes, transcripts and proteins on draft and finished genome assemblies. This report presents statistics on the annotation products, the input data used in the pipeline and intermediate alignment results. View the full report at https://www.ncbi.nlm.nih.gov/genome/annotation_euk/Daucus_carota_subsp._sativus/100/ Data from this analysis can be viewed in JBrowse here. | |
This analysis is a blastp search of all of the NCBI Daucus carota subsp. sativus Annotation Release 100 polypeptide sequences against combined ExPASy SwissProt and TrEMBL databases from Nov. 17, 2021. Prior to performing the blast search, the database was filtered to remove organisms not in the Viridiplantae, and also filtered to remove DCAR gene predictions from DCAR V1.0 Gene Prediction. |
