Logo
Proteus Structure Prediction Server
Comprehensive Secondary Structure Predictions
 
 
 
Proteus prediction (ID=7685641) complete
Summary:
 
  • Time of Submission: 21:26:16 Apr 01, 2006
  • Sequence Name: 1W7D:A|PDBID|CHAIN|SEQUENCE
  • Number of residues read in: 137
  • Number of useable PDB homologs found: 2
    • 1X3BA, e-value = 4.0E-18 SOLUTION STRUCTURE OF THE FAS1 DOMAIN OF HUMAN TRANSFORMING
    • 1NYOA, e-value = 1.0E-17 SOLUTION STRUCTURE OF THE ANTIGENIC TB PROTEIN MPT70/MPB70
  • Number of sequence alignments used for ab-initio predictions: 49
  • Overall confidence value: 81.6%
  • Predicted % Helix content: 34 % (46 residues)
  • Predicted % Beta sheet content: 34 % (46 residues)
  • Predicted % Coil content: 33 % (45 residues)
Graphical Alignment of PDB Homologs:

Conserved Domains
Legend:
  H = Helix
E = Beta Strand
C = Coil
Line 1 = sequence (single letter IUPAC code, 60 characters per line)
Line 2 = secondary structure (H, E or C)
Line 3 = confidence score (0-9, 0 = low, 9 = high)

A '*' character above the overall prediction indicates the homolog's structure was used at this residue.

Predicted Secondary Structure:
 
   1    ********************************************************** 60
      ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK
     
CCCCCCHHHHHCCCCHHHHHHHHHHHHHHHHCCCCEEEEEEECHHHHHHCCCHHHHHHHH
      974778877987777789999989999999998778999999999999998887777775

  61  ********************************************** ************* 120
      PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD
     
HCCCCHHHHHHHHCEEEEEEECCCCCCCEEEEECCCEEEEEECCCCEEECCEEEECCCCC
      776788899999999998776578888867778777789889998798887799777889

 121  ***************** 137
      ASNGVIHVIDGVLMPGA
     
CCCCEEEEEECCCCCCC
      99999999999999998



    Proteus uses a "Jury of Experts" approach involving predictions from PSIPRED ( Jones, 1999 ), JNET ( Barton et al., 1999 ), TRANSSEC (a locally developed tool), and structural alignment ( XALIGN ). Following is the predicted secondary structure from each component.
Detailed Prediction Information:
 

PSIPRED
   1  ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK 60
     
CCHHHHHHHHHCCCHHHHHHHHHHCCCHHHHCCCCCEEEEEECHHHHHHCCHHHHHHHCC
      966889999978885799999988687668648998899975989997569878888658

  61  PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD 120
     
CCCHHHHHHHHHHCCCCCEEECCCCCCCCEEEEECCCEEEEEECCCEEEEEEEEEECCCC
      779889999999655676345454457877776579868999889878865676546756

 121  ASNGVIHVIDGVLMPGA 137
     
CCCCEEEEECEEECCCC
      68998998575758999



JNET
   1  ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK 60
     
CCCCHHHHHHHCCCCHHHHHHHHHCCCCCCCCCCCCEEEEECCCHHHHHCCCCHHHHHHC
      861378999855980589999986375210079996288714830442152220565411

  61  PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD 120
     
CCCHHHHHHHHHHHEECCCCECCCCCCCCEEEEECCCEEEEEECCCCEEEEEEEEEECCC
      577588888886312067100287776761677368716787347616733056650223

 121  ASNGVIHVIDGVLMPGA 137
     
CCCCEEEEEEEEECCCC
      38955888644527999



TRANSSEC
   1  ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK 60
     
CCCCEEEEEEECCCHHHHHHHHHHHCEEEEEECCCEEEEEECCCCHHHHHCCHHHHHHHH
      998578898755764678887776446788657976899877887677644455566655

  61  PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD 120
     
CCCCCCEEEEEEEEEECCCEEECCCCCCCEEEEECCCEEEEEECCCEEEEEEEEEEECCC
      576644467778888658756554547886899878668998758889864899976444

 121  ASNGVIHVIDGVLMPGA 137
     
CCCCEEEEEEEEECCCC
      56888999888767999



JURY-OF-EXPERTS PREDICTION
   1  ETGDIVETATGAGSFTTLLTAAEAAGLVDTLKGDGPFTVFAPTDAAFAALPEGTVEDLLK 60
     
CCCHHHHHHHHHCCHHHHHHHHHHHCCCCCCCCCCEEEEEEECCHHHHHHHHHHHHHHHC
      974678999984876899999999465445568879999996869999866579999865

  61  PENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGGPKVNGVSISQPDVD 120
     
CCCHHHHHHHHHHHEECCEEECCCCCCCEEEEEECCCEEEEEECCCEEEEEEEEEEECCC
      776799999999733366576578888869999589889999779868899999985788

 121  ASNGVIHVIDGVLMPGA 137
     
CCCEEEEEEEEEEECCC
      89879999879995899

3D-TO-2D MAPPING FOR TOP-SCORING HOMOLOG:

QUERY ETGDIVETATGAGSFTTLLTAAEAA------GLVDTLKGDGPFTVFAPTDAAFAALPEGT 54
1X3BA ~~GTVMDVLKGDNRFSMLVAAIQSA------GLTETLNREGVYTVFAPTNEAFRALPPRE
      ~~CCHHHHHHHCCCCHHHHHHHHHH~~~~~~CHHHHHHCCCEEEEEEEEHHHHHHCCCHH
1NYOA ~~~DPVAVAASNNPELTTLTAALSGQLNPQVNLVDTLNS-GQYTVFAPTNAAFSKLPAST
      ~~~CCCHHHHCCCCCHHHHHHHHHCCCCCCCCCHHHHCC~CEEEEEEEEHHHHHHCCHHH


  55  VEDLLKPENKEKLTEILTYHVVPGEVMSSDLTEGMTAETVEGGALTVTLEGG-P-KVNGV 112
1X3BA RSRLL--GDAKELANILKYHIGDEILVSGGIGALVRLKSLQGDKLEVSLKNNVV-SVNKE
      HHHHH~~HHHHHHHHHHHHHEEEECCCCCCCCEEEEEECCCCEEEEEEECCCEE~EECCE
1NYOA IDEL--KTNSSLLTSILTYHVVAGQTSPANVVG--TRQTLQGASVTVTGQGN-SLKVGNA
      HHHH~~HCCCCHHHHHHHHCEEEECCCCCCCCC~~EEEECCCEEEEEECCCC~EEEECCE


 113  SISQPDVDASNGVIHVIDGVLMPGA 137
1X3BA PVAEPDIMATNGVVHVITNVLQP~~
      EEECCCCCCCCCEEEEEECCCCC~~
1NYOA DVVCGGVSTANATVYMIDSVLMPPA
      EEEEECCCCCEEEEEEEECCCCCCC




PSI-BLAST ALIGNMENTS - 49 SEQUENCES USED BY JNET, TRANSSEC, and PSIPRED
Blast e-value: 4.66345E-41
1   >gi|23130027|ref|ZP_00111848.1| COG2335: Secreted and surface protein containing fasciclin-like repeats [Nostoc punctiforme]
match: --ADIVDIAVTA---ESFKTLVAAVQAA-----GLVETLK--S-PG--PF-TVFA-PNDDAFA---K-LP-PGTIQ-T-LLQ--NI-P-Q-LT-R--ILKY-HVVPGK-LL-KADLA-----ELG--T-VNS--V-EG-SP-IK--I---HS-LD-G------FEV-K---N--ATVL-AAD-IEADN--G-V-V-HVIDTVI-LPG------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.32331E-40
2   >gi|17231289|ref|NP_487837.1| hypothetical protein [Nostoc sp. PCC 7120] >gnl|BL_ORD_ID|547150 hypothetical protein all3797 [imported] - Nostoc sp. (strain PCC 7120) >gnl|BL_ORD_ID|547150 ORF_ID:all3797~hypothetical protein [Nostoc sp. PCC 7120]
match: DSKNLLELVESN---SSFTTLNKALQAA-----GLTETLK--G-KD--NL-TIFA-PTDAAFA---K-LP-QDALQ-A-LLQPDNK-E-V-LL-K--VLTY-HVVPGN-VL-STDL------KSG--E-VKS--V-EG-GT-IN--V---K--VD-TQ----GVSV-N---D--AKVT-QAD-IKASN--G-V-I-HVIDTVI-LPA------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.66716E-40
3   >gi|23129937|ref|ZP_00111758.1| COG2335: Secreted and surface protein containing fasciclin-like repeats [Nostoc punctiforme]
match: --KNLLALAESN---ASFTTLTKALKAA-----GLTGALQ--G-KD--NL-TIFA-PTDAAFA---K-LP-ADALQ-E-LLNPANK-E-V-LL-K--ILTY-HVVPGK-VL-STDL------KSG--E-VKS--L-EG-GA-IN--V---K--VD-PST---GVTV-N---D--AKVT-QPD-ITASN--G-V-I-HAIDQVI-LPP------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 8.67628E-40
4   >gi|37521719|ref|NP_925096.1| hypothetical protein glr2150 [Gloeobacter violaceus] >gnl|BL_ORD_ID|1094514 glr2150 [Gloeobacter violaceus PCC 7421]
match: --GDIVDTAVKA---GDFKTLVTALQAT-----GLDKTLK--T-KG--PF-TVFA-PTDEAFK---K-LP-PGTLD-A-LLK--DK-A-K-LT-K--ILTY-HVVSGK-VL-SSAL------KPG--S-VKT--V-EG-AP-VK--V---Q--IE-GG----KVEV-N---E--AYVT-KAD-ITADN--G-V-I-HVIDSVL-LPP------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 9.51305E-39
5   >gi|37523621|ref|NP_926998.1| hypothetical protein glr4052 [Gloeobacter violaceus] >gnl|BL_ORD_ID|433265 glr4052 [Gloeobacter violaceus PCC 7421]
match: ----IVDTAVQA---GTFKTLAQALTAA-----DLVDTLK--G-SG--PF-TVFA-PTDDAFQ---S-LP-AGTLN-D-LLKPENK-S-K-LA-N--ILKY-HVVSGK-VM-SSDI------KPG--N-VAT--V-AG-ES-IS--I---Q--TQ-GQ----QVMV-N---E--ARVT-KAD-IAADN--G-V-I-HVIDKVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.43467E-38
6   >gi|22297764|ref|NP_681011.1| ORF_ID:tll0220~hypothetical protein [Thermosynechococcus elongatus BP-1] >gnl|BL_ORD_ID|826646 ORF_ID:tll0220~hypothetical protein [Thermosynechococcus elongatus BP-1]
match: --ATIVDIAVNTP--G-FSTLVTAVKVA-----NLVEALQ--S-PG--PF-TVFA-PNDDAFA---K-LP-DGTIT-S-LVQ--NP-P-Q-LG-R--ILKY-HVVAGA-YK-ATDL-----KRMGI---VTS--L-EG-ST-IP--I---H-----GDN---PLEVKN------ATVL-AAD-IEAEN--G-I-I-HVIDTVI-LMG------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.9897E-38
7   >gi|32474254|ref|NP_867248.1| conserved hypothetical protein-putative fasciclin domain [Pirellula sp. 1] >gnl|BL_ORD_ID|706933 conserved hypothetical protein-putative fasciclin domain [Pirellula sp.]
match: --KNIVETAISA----KFNTLVAAVKAG-----GLVETLS--G-EG--PF-TVFA-PTDEAFD---K-LP-EGTLD-S-LLKPENK-D-Q-LV-A--ILKY-HVVSGK-VP-AKTVV-----TLD--S-AET--L--G-GK-VS--I---E--VK-DGT----VIL-N---D-KVKVV-KTD-VMASN--G-I-I-HVIDSVI-LPPS-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.15894E-37
8   >gi|27377585|ref|NP_769114.1| blr2474 [Bradyrhizobium japonicum] >gnl|BL_ORD_ID|443909 bll5191 [Bradyrhizobium japonicum] >gnl|BL_ORD_ID|443909 blr2474 [Bradyrhizobium japonicum USDA 110] >gnl|BL_ORD_ID|443909 bll5191 [Bradyrhizobium japonicum USDA 110]
match: ---DIVDTAVGA---GQFKTLAAALKAA-----DLVATLK--G-PG--PF-TVFA-PTDEAFA---K-LP-AGTVE-N-LLKPENK-A-K-LT-A--ILTY-HVVPGA-VK-AEQVT-----KLD--Q-AKT--V-NG-AM-VK--V---T--TK-GG----KVTI-N---D--ATVV-KAD-IPASN--G-M-I-HVIDKVI-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 3.77224E-35
9   >gi|17232756|ref|NP_489304.1| hypothetical protein [Nostoc sp. PCC 7120] >gnl|BL_ORD_ID|346221 hypothetical protein all5264 [imported] - Nostoc sp. (strain PCC 7120) >gnl|BL_ORD_ID|346221 ORF_ID:all5264~hypothetical protein [Nostoc sp. PCC 7120]
match: ENRNLAELANSAANQGQFATLIQAVKAA-----GLTDQLA--A-PG--PY-TVFA-PTDAAFA---A-LP-KNTLN-N-LLQPANK-Q-Q-LV-K--LLAY-HVIPGS-FT-SNQL------KSG--Q-VKT--V-EG-SP-VN--I---NV-DP-TNN---TVTV-N---G--ARVT-QAD-IPASN--G-I-V-HVVDQVI-LPP------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.93384E-30
10  >gi|2996636|gb|AAC08449.1| BIGH3 [Homo sapiens]
match: ---------------NLYETLGVVGSTTTQLYTDRTEKLR-PEMEGPGSF-TIFA-PSNEAWA---S-LP-AEVLD-S-LVSNVNI-E-L-L--N--ALRY-HMV-GRRVL-TDEL------KHGMT--LTS--MYQN-SN-IQ--I---HH-YP--NG---IVTV-NC-----ARLL-KADH-HATN--G-V-V-HLIDKVI----------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 7.75516E-29
11  >gi|25026619|ref|NP_736673.1| conserved hypothetical protein [Corynebacterium efficiens YS-314] >gnl|BL_ORD_ID|549188 conserved hypothetical protein [Corynebacterium efficiens YS-314]
match: ---DIVDTAAGA---GSFNTLVTAIQAA-----GLEETLR--G-DG--PF-TVFA-PTDEAFN---A-LP-EGTLD-A-LLADPQG-D---LT-E--ILTY-HVVDGE-VF-AADVL-----EMDGQT-VET--L-QG-GT-FT--V---E--IE-GEN---VVLV-DTAGN-RVNVT-DTD-IEASN--G-V-I-HVVDTVL-SP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.12017E-26
12  >gi|27375618|ref|NP_767147.1| bll0507 [Bradyrhizobium japonicum] >gnl|BL_ORD_ID|55144 bll0507 [Bradyrhizobium japonicum USDA 110]
match: ---NIVQNAVNS---KDHTTLVAAVKAA-----GLVPTLE--S-KG--PF-TVFA-PTNAAFG---K-LP-AGTVD-N-LVKPENK-A-T-LT-K--ILTY-HVVPGK-LE-ASDLT-----DGK--K-LKT--A-EG-EE-LT--V---KK-MD-GKT---WI-V-DAKGG-TSMVT-ISN-VNQSN--G-V-I-HVVDTVL-MPA------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.07322E-25
13  >gi|2257601|dbj|BAA21479.1| RGD-CAP [Gallus gallus]
match: ----IVET------EESLETLRAAVAAS-----DLNSLLE--S-EG--QY-TLLA-PTNEAFE---K-IP-REMLN-R-ILG--DP-E-A-LR-D--LLNH-HIL--KSAMCAEAII------AGL-T-MET--L-EG-TT-LD--V---GC--S-GES----VTL-N---G-RAIIA-NKD-ILATN--G-V-V-HFVNELL-IP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 5.51474E-25
14  >gi|15887818|ref|NP_353499.1| AGR_C_835p [Agrobacterium tumefaciens] >gnl|BL_ORD_ID|536085 hypothetical 18.5K protein sll1483 [imported] - Agrobacterium tumefaciens (strain C58, Cereon) >gnl|BL_ORD_ID|536085 AGR_C_835p [Agrobacterium tumefaciens str. C58 (Cereon)]
match: ENKNIVENAMNS---KDHTTLVAAVKAA-----GLVETLQ--G-KG--PF-TVFA-PTNEAFA---A-LP-KGTVE-N-LLKPENK-A-Q-LT-K--VLTC-HVVEAD-AM-SKTIEKMIKDDKGTHD-VKT--V--G-GC-I---L---KA-KE-SMD---KITLTDEMGG-VAHVT-IAD-VKQSN--G-V-I-HVIDKVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.02966E-24
15  >gi|20090575|ref|NP_616650.1| hypothetical protein (multi-domain) [Methanosarcina acetivorans str. C2A] >gnl|BL_ORD_ID|710549 hypothetical protein (multi-domain) [Methanosarcina acetivorans str. C2A]
match: ---TIIQAAQN---QTRLTTFVNATKAA-----NLTQALN--E-TG--PF-TVFV-PSNEAFD---Q-LP-AQTRD-Q-LMN--NT-T-L-LR-K--VLSY-HVASGE-YT-REQL-------ASMDA-VDN--I-QG-GT-LD--I---NM-V--GDN----ITIQN------STIE-QI--IIVKN--G-V-I-YIIDKVI-IPP------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.39754E-24
16  >gi|34864902|ref|XP_235015.2| similar to stabilin-2 [Rattus norvegicus]
match: -------------------TF-----ASNPQKTNVGQILD-EG--G--PY-TIFV-PSNEALS---N-MK-ADILD--YLLSPEGS-R-K-LL-E--LVRY-HIVAFTQLEVAT-LVS--------TPHIRS--MAN--QI-IK--F---NI-TS-KGQ----I-LAN---N-VA-MD-ETE-VAAKN--G-R-I-YTLTGVL-IPPS-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.86064E-24
17  >gi|23009052|ref|ZP_00050246.1| COG2335: Secreted and surface protein containing fasciclin-like repeats [Magnetospirillum magnetotacticum]
match: -SKTIVENAVNS---KDHTTLVAAVKAA-----GLVDTLN--G-PG--PF-TVFA-PTNAAFA---K-LP-AGTVE-T-LVQPQNK-A-T-LT-G--ILTY-HVVPGT-YT-AKDLM-----A------------------------------------------------------------------------------------------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.83147E-22
18  >gi|17229311|ref|NP_485859.1| hypothetical protein [Nostoc sp. PCC 7120] >gnl|BL_ORD_ID|20720 hypothetical protein alr1819 [imported] - Nostoc sp. (strain PCC 7120) >gnl|BL_ORD_ID|20720 ORF_ID:alr1819~hypothetical protein [Nostoc sp. PCC 7120]
match: ---NIVALAASS---NSFSTLTTLLRTA-----GLTDILE--Q-PG--PY-TVFA-PTNEAFA---A-LP-AGTLE-Q-LQQPQNR-E-L-LV-R--ILRY-HVVPGQ-LT-ANQL------SSG--Q-LTT--A-SD-AP-VN--V---RV-DT-ANN---QIAV-N---E--ARVV-QAN-IQASN--G-V-I-HAINEVL-IP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.17696E-21
19  >gi|6469702|gb|AAF13400.1| Mpt83 [Mycobacterium tuberculosis]
match: ------------------------------------DTLN--G--G--EY-TVFA-PTNAAFD---K-LP-AATID-Q-L-KTDAK-----LLSS--ILTY-HVIAGQ--ASPSRID-------GTHQ---T--L-QG-AD-LT--V------IG-ARD---DLMVNN------AGLV-CGG-VHTAN--A-T-V-YMIDTVL-MPPA-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 5.74836E-21
20  >gi|149926|gb|AAA25355.1| antigen MPB70
match: ----------AASNNPELTTLTAALSGQLNPQVNLVDTLN----SG--QY-TVFA-RTNAAFS---K-LP-ASTID-E-L-KT-NS-S-L-LT-S--ILTY-HVVAGQ--TSPANVV-------GTRQ---T--L-QG-AS-VT--V---TG--Q-GNS----LKVGN------ADVV-CGG-VSTAN--A-T-V-YMIDSVL-MPPA-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.56318E-20
21  >gi|7638019|gb|AAF65308.1| putative cell adhesion protein Sym32 [Anthopleura elegantissima]
match: -----------------FSTLLTAVLAA-----KLQGVLA--G-PG--PF-TVFA-PTNEAFA---K-IP-AEKLK-E-ILK--NI-P-L-LT-K--ILKY-HVVSGT-FC-SAGLT-----NGA--T-VPT--L-EG-SD-VT--V---H--IS-GGS----VTV-N-----NAVVV-FVD-IPVTN--G-V-V-HVIDTVL-IP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.19826E-19
22  >gi|23509668|ref|NP_702335.1| hypothetical protein [Plasmodium falciparum 3D7] >gnl|BL_ORD_ID|687453 hypothetical protein [Plasmodium falciparum 3D7]
match: ---TIINLIYSHNELKIFSNL---LNHPT-VGSSLIHEL---SLDG--PY-TAFF-PSNEAMQ-----LINIESFN-K-LYNDENK-----LSEF--VLN--HVT--KEYWLYRDLY-------GSS--YQPWLMYNE-KREAPEKL---RN-L-LNNDL--IVKIEGEFKHCNHSIYLNGSKIIRPNMKCHNGVVHIVDKPI-I--------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.97614E-18
23  >gi|31236019|ref|XP_319340.1| ENSANGP00000012360 [Anopheles gambiae] >gnl|BL_ORD_ID|782536 ENSANGP00000012360 [Anopheles gambiae str. PEST]
match: ----------AILKNNGLFAMAKYLRQS-----GLDTILN-ET--G--PY-TIFV-PTDKAFR--SLLVQ-LGGPE-KAEEKFRNNP--R-LLSG--LLLH-HVIPG-----SFEIASLQDEMTGV-----S--L-AG-TQ-LR--V---NQ-YNMHDSEWNDVKVTT--IN-GAMVVPDKQDIVIPQ--G-IA--HAVDRVM-FP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.97614E-18
24  >gi|24417458|gb|AAN60339.1| unknown [Arabidopsis thaliana]
match: ---NLTAILEKG---GQFTTFIHLLNIT-QVGSQVNIQVN-SSSEG---M-TVFA-PTDNAFQ----NLK-PGTLN-Q--LSPDDQ--VK-L-----IL-Y-HVSP--KYYSMDDLL-------SVSNPVRTQ--ASG-RDNGVYGL---NF-TGQTNQ----INVSTGYVE--TRI---SNSL--RQQRP-LAV-YVVDMVL-LPG------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.04912E-17
25  >gi|21553523|gb|AAM62616.1| arabinogalactan protein-like [Arabidopsis thaliana]
match: --TNITAILEKA---GQFTLFIRLLKST-QASDQINTQLN-SSSSN--GL-TVFA-PTDNAF----NSLK-SGTLN-S--LSDQQK--VQ-L------VQF-HVLP--TLITMPQF-------QTVSNPLRTQ-AGDG-QN-GKFPL---NI-TSSGNQ----VNITTGVVS--ATV---ANSVYSDK---QLAV-YQVDQVL-LPLA-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.76042E-16
26  >gi|2129473|pir||S52995 arabinogalactan-like protein - loblolly pine >gnl|BL_ORD_ID|478331 arabinogalactan-like protein
match: NLSGILDKA------GQFNTFLSLLKST-QVGMQLQSQLN-NSQQG---I-TIFA-PSDAAFA---A-LK-PGALN-S--ITDQDK--IA-L------LQY-HALP-SYYTFS-QF-------QTVSNPVRT--MASG-NG-GPFGV---NV-TAFGNS----VNVSTGLVN--TPVN-SA--VYSQS--P-VAV-YQVDKVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.35745E-16
27  >gi|25143043|ref|NP_740906.1| Beta-Ig-H3/Fasciclin domain containing protein (79.2 kD) (1K355Co) [Caenorhabditis elegans] >gnl|BL_ORD_ID|493813 hypothetical protein F26E4.7 - Caenorhabditis elegans
match: ---------------------------------------------GL--V-TFFA-PWNEAFD---R-IP--EQIE-RRLLR--DR-IW--L-EQ--ILKL-HIVPAKELT-SDEITN--------ETIVNT--VDNM-RQ-LYF-I---KG-EW-PTNN-VTYYVIGGGIK-TA-IM-M-DNVAATN--G-I-V-HYIERVL----------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.67285E-16
28  >gi|480007|pir||S36111 osteoblast-specific factor 2 - human >gnl|BL_ORD_ID|1071223 osteoblast specific factor 2 [Homo sapiens]
match: -----------------FSTFLSLLEAA-----DLKELLT--Q-PG--DW-TLFV-PTNDAFK---G-MT-SEEKE-I-LIR--DK-N-A-LQNI--IL-Y-HLTPG--VFIGKGF------EPGVTNILKT--T-QG-SK-IF--L---KE-V---ND---TLLV-NEL---KSK---ESD-IMTTN--G-V-I-HVVDKLL-YPA------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 8.81007E-16
29  >gi|29349574|ref|NP_813077.1| putative lipoprotein [Bacteroides thetaiotaomicron VPI-5482] >gnl|BL_ORD_ID|579256 putative lipoprotein [Bacteroides thetaiotaomicron VPI-5482]
match: ----------------DFSEWVKVLK-----YGDLFNAVN-RAED---AF-TVLA-PTNDAV------LR-FYE-K-KGVTSIEDL-GYE-YART--LVTY-HVINDS--IDREDFV-----KSGELPG-RT--L-SG-DA-LK--V---SF-GN-EGGD-KSVYINKE-----AHVS-EL-AIRTAN--G-R-I-YVLDDVL-SPA------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.13643E-15
30  >gi|3282161|gb|AAC24944.1| BIGH3 [Homo sapiens]
match: --GTVMDVLKGD---NRFSMLVAAIQSA-----GLTETLN--R-EG--VY-TVFA-PTNEAFR---A-LP-PRERS-R-LLGDAK----E-LA-N--ILKY-HI--GDEILVSGGI-------GALVR-LKS--L-QG-DK-LE--V----S-LK--NN---VVSVNKEP------VA-EPD-IMATN--G-V-V-HVITNVL-QPPA-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.82122E-14
31  >gi|31209493|ref|XP_313713.1| ENSANGP00000025197 [Anopheles gambiae] >gnl|BL_ORD_ID|916865 ENSANGP00000025197 [Anopheles gambiae str. PEST]
match: ------ETVER-LGAKEF---VSTLKKS-----GLDATLEN-------DV-TLFVVP-DSAYTTFAEQMWENNLVAFDPMARAKRAIDFASMTARDMILA--HTVNGTYFI--EDI--------SNEQLLKT-DLP-GAN--IR--I---NI-YPRG-PMAATVR-DQSHAEHPYRYTANCAPLLKLNRVADRGIVHVVDRVL-VPA------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.6852E-12
32  >gi|15239344|ref|NP_200857.1| fasciclin-like arabinogalactan-protein (FLA12) [Arabidopsis thaliana] >gnl|BL_ORD_ID|1038599 contains similarity to surface protein~gene_id:MUF9.12 [Arabidopsis thaliana]
match: ---NVTKILEKA---GQFTVFIRLLKSTGVAN-QLYGQLN-NSDNG---I-TIFA-PSDSSFT---G-LK-AGTLN-S-L--TDEQ-QVE-L-----I-QF-HVIP--SYVSSSNF-------QTISNPLRT---QAG-DS-ADGHF---P--LNVTTSG-NTVNITSGVTN--TTV---SGNVY-SD--GQLAV-YQVDKVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.33331E-12
33  >gi|30686588|ref|NP_850253.1| expressed protein [Arabidopsis thaliana] >gnl|BL_ORD_ID|652806 unknown protein [Arabidopsis thaliana] >gnl|BL_ORD_ID|652806 At2g35860 [Arabidopsis thaliana]
match: ------NSVLVALLDSHYTELAELVEKAL-----LLQTLE-EA-VGKHNI-TIFA-PRNDALE--RN-LD-P-LFK-SFLLEPRNL-K-S-L-QS--LLMF-HILP-KRIT-SPQ-------WPSLSHHHRT--L-SN-DH-LH--L---T--VD-VN----TLKVDS------AEII-RPDDVIRPD--G-I-I-HGIERLL-IP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.93198E-12
34  >gi|28573043|ref|NP_788652.1| midline fasciclin CG3359-PF [Drosophila melanogaster] >gnl|BL_ORD_ID|720707 midline fasciclin CG3359-PG [Drosophila melanogaster] >gnl|BL_ORD_ID|720707 midline fasciclin CG3359-PM [Drosophila melanogaster] >gnl|BL_ORD_ID|720707 CG3359-PF [Drosophila melanogaster] >gnl|BL_ORD_ID|720707 CG3359-PG [Drosophila melanogaster] >gnl|BL_ORD_ID|720707 CG3359-PM [Drosophila melanogaster]
match: -TKNLMDIIRER---ADMSIMRTVLEKT-----NLSAMLE-DD----KPV-TIFV-PTDAAFD---K-LE-P-HLR-RAL-----KEGRG-CASN--ILKN-HLLD---LTFCS-LATVPGAKTTAYNLL-------G-EPLL-LNR---TH-RAANQTGPTPIYINN-----LAKII-DAD-IMGTN--G-V-L-HVIDTIL----------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 3.40712E-12
35  >gi|33504507|ref|NP_878282.1| transforming growth factor, beta-induced [Danio rerio] >gnl|BL_ORD_ID|797843 beta ig-h3 [Danio rerio]
match: --GTVMDV-LKA--DNRFSTLVGAIQKA-----GLTELLN--K-KG--TY-TFFA-PTNAAFS---A-LP-SADLN-K-LMRDPK----E-LA-N--ILKY-HI--GDEFLVSGAVTSHT-------R-LKP--L-AG-DK-LE--L---G---T-RNS---TIYV-N-----RVPVV-ESD-LMATN--G-V-V-HAVNVII-KP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.72066E-11
36  >gi|552160|gb|AAA29437.1| bep4
match: --------------VQTVSD-VEAFRRSLISEAKLVS--Q-KFLADADPI-TVLV-PTNKAF----KALP-AGVLE-D--LKKKNRPNSK-TCSN--ISDL-DV---NTVILS-------------RQRIRA--L-QG-DE-ISVTL---NG-------------------------------------------------------------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.63212E-10
37  >gi|1518107|gb|AAB07015.1| transforming growth factor induced protein
match: -SGTVMDVLKGD---NRFSMLVAAIQFR-----RLTETLN--R-EG--AY-TVFA-PTNEAFQ---A-LP-PGELN-K-LLG--NA-K-E-LA-D--ILKY-HV--GEEILVSGGI-------GTLVR-LKS--L-QG-DK-LE--V---SS----KNN---AVSVNKEP------VA-ESD-IMATN--G-V-V-YAITSVL-QPPA-----------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 1.81978E-6
38  >gi|21928113|gb|AAM78144.1| maternal butanol extracted protein 2 [Paracentrotus lividus]
match: -------------------------------------------------F-TLCL-PSNEAVQKWRQELP-NE-------LKPE-----A-L-KQ--LVKA-WIVPG--MIRSSSV--------SHNMMVQS--L-NG----VEMRF---KF-TR----------IRKDHHC-KWCTNRRSDRI-TSS--G-I-I-HVMDKVI-YP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.03742E-6
39  >gi|10440526|dbj|BAB15793.1| FLJ00112 protein [Homo sapiens]
match: ----------------------------------FVKDLV--G-PG--PF-TVFA-PLSAAFD---E--------E-A-RVKDWDK-Y-G-LMPQ--VLRY-HVVACHQLLL-ENL------KL-ISN-ATS--L-QG-EP-I---V---IS-VS--QS---TVYINN-----KAKII-SSD-IISTN--G-I-V-HIIDKLL-SP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 3.68433E-6
40  >gi|24285893|gb|AAG13634.1| hyaluronan receptor for endocytosis HARE precursor [Rattus norvegicus]
match: ------------------------------------------------PV-TVFW-PTDKALE---A-LP-PEQQD-F-LFNQDNK-D-K-LK-S--YLKF-HVIRDSKAL-ASDL-----PRSASW---KT--L-QG-SE-LS--V---RC-GT-GSD----I-------------------------------------------------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 3.77248E-6
41  >gi|21428480|gb|AAM49900.1| LD25556p [Drosophila melanogaster]
match: -----TQFLQENAENGALRKFYEVIMDNGGAVLDDINSLTE--------V-TILA-PSNEAWN--------SSNIN-N-VLRDRNK-----M-RQ--ILNM-HIIK-DRL----NVDKIRQK-------------------------------------------------------------------------------------------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 5.22334E-6
42  >gi|22779437|dbj|BAC15606.1| FELE-1 [Homo sapiens]
match: ------------------------------------EIL---TTAG--PF-TVLV-PSVSSFS--------SRTMN-A-----------S-LAQQ--LCRQ-HIIAGQHIL--ED---------TRTQQTRRWWTLAG-QE-IT--V---TF-NQFTK---YSYKYKDQPQQ-TFNI-YKANNI-AAN--G---VFHVVTGL-----------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 5.5375E-6
43  >gi|7486009|pir||T08462 hypothetical protein F22O6.250 - Arabidopsis thaliana
match: --------------------LVN-LTS-------LATEMGRLVSEGYV-L-TVLA-PNDEAMA---K-L----TTD-Q--LSEPGAPE------Q--IMYY-HIIP--EYQTEESMY------NSVRRFGK---IRYD-SLRFPHKV---EA-QEADGS----VKFGHG--D-GSAYLFDPD-IY-TD--GRISV-QGIDGVL-FP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 9.13551E-6
44  >gi|15235459|ref|NP_193009.1| fasciclin-like arabinogalactan-protein (FLA2) [Arabidopsis thaliana] >gnl|BL_ORD_ID|993583 pollen surface protein homolog T20K18.80 - Arabidopsis thaliana >gnl|BL_ORD_ID|993583 putative pollen surface protein [Arabidopsis thaliana] >gnl|BL_ORD_ID|993583 putative pollen surface protein [Arabidopsis thaliana] >gnl|BL_ORD_ID|993583 AT4g12730/T20K18_80 [Arabidopsis thaliana] >gnl|BL_ORD_ID|993583 At4g12730/T20K18_80 [Arabidopsis thaliana]
match: ----ILTTILEKQGCKAFSDI---LKST-----GADKTFQ-DTVDG--GL-TVFC-PSDSAVG---KFMP-K--FK-S--LSPANKT--A-L-----VL-Y-HGMP---VYQSLQML-----RSG-NGAVNTL-ATEG-NNKFDFTV---Q---NDGED----VTLETDVV--TAKVMGT----LKDQ--EPLIV-YKIDKVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 9.44555E-6
45  >gi|31200425|ref|XP_309160.1| ENSANGP00000018604 [Anopheles gambiae] >gnl|BL_ORD_ID|722120 ENSANGP00000018604 [Anopheles gambiae str. PEST]
match: ------------------------------------------------PL-TIFA-PTNQAF-------------Q-RYL---NNKTH----------LNY-H-MSSTPLRLS-QL--------G--DTVRS--LNDD-GTPL--YI---TR-RRPLSGMQEDLYVNS------AKIVRERSNMEFKNTRGYTQILHIIDDVL-TP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 6.49969E-5
46  >gi|21410441|gb|AAH31166.1| Stab1 protein [Mus musculus]
match: --------------------------------------LK--G-DG--PF-TVFV-PHADLIS---N-MS-Q---D-E-LARIRAH-R-Q-L-----VFRY-HVV-GCRKLWSQEML-----DQG---YITT--L-SG-HT-LR--V---SE-RE-GS-----IYLND-----FARVV-SSD-LEVVN--G-V-L-HFIDHVL-LP-------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 2.22453E-4
47  >gi|1076222|pir||S51614 Algal-CAM - Volvox carteri >gnl|BL_ORD_ID|102075 Algal-CAM [Volvox carteri]
match: ----IWDFLVK---NNSFPTISLALSTA-----NEVATFNDSSQE----V-TFFL-PTETAFDKLSDALGVARSNR-AGLL-----PYLP-VIKR--ALSY-HVLPTR---IS--LQSVANQSVGGTEYYNTT-LTMGQSSSIGVRV---SP-PSSPPATSPEIFILG--VSSTAKVL-QAD-VAA----GASCI-NVVDTVL----------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.46017E-4
48  >gi|35193023|gb|AAH58607.1| Unknown (protein for MGC:67891) [Mus musculus]
match: --KTVGQILASTEVFTRFETILE--------NCGLPSILD-----GPGPF-TVFA-PSNEAVD---S-LR-DGRL--IYLF-TAGL-S-K-L-QE--LVRY-HIYNHGQLTV-EKLI-----SKG--RVL-T--MAN--QV-LT--V---NI-SE-EG----RILLGPEGIP-VRRV----D-VPAAN--G-V-I-HMLEGIL-LPP------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

Blast e-value: 4.83322E-4
49  >gi|321050|pir||A60672 blastula butanol-extractable protein 1 - sea urchin  (Paracentrotus lividus) (fragment)
match: ----------------AFSMIVDLLQ-----QRELSLALQ-VS----DPI-TVLV-PTNKAFK---A-LP-AVVLE-D-L---------K-LKKN----K-----PSSR-TCSNTL---HPYASG--QRIAA--L-QG-DE-IS--V---DH-EN-GK-----VILNKERDI-CTKCV-QAD-IPTTN--G-V-I-HVIDRVL-V--------------------------------------------------
query: ETGDIVETATGA---GSFTTLLTAAEAA-----GLVDTLK--G-DG--PF-TVFA-PTDAAFA---A-LP-EGTVE-D-LLKPENK-E-K-LT-E--ILTY-HVVPGE-VM-SSDLT-----EGM--T-AET--V-EG-GA-LT--V---T--LE-GG----P-KV-N---G--VSIS-QPD-VDASN--G-V-I-HVIDGVL-MPGA-----------------------------------------------

                        

3D Structures:
Click below to view the 3D structure of a PDB homolog related to your protein.
  • 1X3BA (e-value = 4.0E-18) - Matches to residues 3 - 143
  • 1NYOA (e-value = 1.0E-17) - Matches to residues 4 - 144