Example procedure for a single CEL file (CL2001031609AA.CEL from lung cancer (small cels)):


execute num_c_perm_2_cel CL2001031609AA.CEL d > d_CL2001031609AA.txt
execute num_c_perm_2_cel CL2001031609AA.CEL c > c_CL2001031609AA.txt
produce matching columns table from these two files , having perm codes and values in each row using eg scalc and paste its conents into eg a.txt


execute mSearch4_103 -s555 -fa.txt > b.txt
extract lines and values rows having perm code 555
extract values onlu columns from b.txt using eg scalc
produce one values column replacing \t by \n
sort values list
produce unique sorted list n.txt


execute ts_ce_53 using n.txt and Lung_DATASETA_scans_noscale.res
out.txt will have all single or multiple range values matches in _.res file



And the region image of the corresponding aminoacid :
Full=Hepatitis A virus cellular receptor 2; Short=HAVcr-2; AltName: Full=T-cell immunoglobulin and mucin domain-containing protein 3; Short=TIMD-3;
AltName: Full=T-cell membrane protein 3; AltName: Full=TIM-3;











    101 atggcgtcttctatcagaaggggccgaggggcctggacacggctg
        M  A  S  S  I  R  R  G  R  G  A  W  T  R  L 
    146 ctctcgcttctgctcctcgcagcctgggaggtggggagcggccag
        L  S  L  L  L  L  A  A  W  E  V  G  S  G  Q 
    191 ctccgctactccgtccccgaggaggccaaacacggcaccttcgtg
        L  R  Y  S  V  P  E  E  A  K  H  G  T  F  V 
    236 ggccgcatcgcgcaggacctggggctggagctggaggagctggtg
        G  R  I  A  Q  D  L  G  L  E  L  E  E  L  V 
    281 ccgcgcctgttccgggtggcgtccaaaagacacggggaccttctg
        P  R  L  F  R  V  A  S  K  R  H  G  D  L  L 
    326 gaggtaaatctgcagaatggcattttgtttgtgaattctcggatc
        E  V  N  L  Q  N  G  I  L  F  V  N  S  R  I 
    371 gaccgggaggagctgtgcgggcggagcgcggaatgtagcatccac
        D  R  E  E  L  C  G  R  S  A  E  C  S  I  H 
    416 gtggaggtgatcgtggacaggccgctgcaggttttccatgtggaa
        V  E  V  I  V  D  R  P  L  Q  V  F  H  V  E 
    461 gtggaggtgaaggacattaacgacaacccgccaatatttccaatg
        V  E  V  K  D  I  N  D  N  P  P  I  F  P  M 
    506 acagtaaagactatccggtttcccgaatcaaggctgcttgattct
        T  V  K  T  I  R  F  P  E  S  R  L  L  D  S 
    551 cggtttcctctagagggagcatctgatgcagatataggagtaaat
        R  F  P  L  E  G  A  S  D  A  D  I  G  V  N 
    596 gctcttctctcctacaagctcagctccagtgagtttttcttccta
        A  L  L  S  Y  K  L  S  S  S  E  F  F  F  L 
    641 gatatacaggcaaatgatgaactaagcgaatctttgtctctcgtg
        D  I  Q  A  N  D  E  L  S  E  S  L  S  L  V 
    686 ctggggaaatcgctggacagagaggaaactgctgaggttaatttg
        L  G  K  S  L  D  R  E  E  T  A  E  V  N  L 
    731 ttactggtggctactgatgggggcaaacctgagctcacgggcacc
        L  L  V  A  T  D  G  G  K  P  E  L  T  G  T 
    776 gttcaaatacttattaaggtattagatgtaaatgacaatgaacca
        V  Q  I  L  I  K  V  L  D  V  N  D  N  E  P 
    821 acttttgcccaatcagtttacaaagtaaaattgttagagaatacg
        T  F  A  Q  S  V  Y  K  V  K  L  L  E  N  T 
    866 gcaaatgggaccttagtggttaagttaaacgcttctgatgcagat
        A  N  G  T  L  V  V  K  L  N  A  S  D  A  D 
    911 gaaggaccgaacagcgagattgtgtattcactcggtagtgatgtg
        E  G  P  N  S  E  I  V  Y  S  L  G  S  D  V 
    956 tcctccactatacagactaagtttaccatagatcccatctcaggg
        S  S  T  I  Q  T  K  F  T  I  D  P  I  S  G 
   1001 gaaatcagaactaagggaaaattagattatgaagaagcaaagtcc
        E  I  R  T  K  G  K  L  D  Y  E  E  A  K  S 
   1046 tacgagattcaggtcactgcaactgacaaaggaaccccttcaatg
        Y  E  I  Q  V  T  A  T  D  K  G  T  P  S  M 
   1091 tcaggacattgtaaaatttcattaaaacttgtggacatcaatgat
        S  G  H  C  K  I  S  L  K  L  V  D  I  N  D 
   1136 aacacaccagaagtctcaataacgtctctctcacttcccatctca
        N  T  P  E  V  S  I  T  S  L  S  L  P  I  S 
   1181 gagaacgcttccctgggcactgtcattgctctcatcacggtgtcg
        E  N  A  S  L  G  T  V  I  A  L  I  T  V  S 
   1226 gatcgcgactctggtacgaatggacatgtcacctgctccctgacg
        D  R  D  S  G  T  N  G  H  V  T  C  S  L  T 
   1271 ccccacgtccctttcaagctggtgtccaccttcaagaattactac
        P  H  V  P  F  K  L  V  S  T  F  K  N  Y  Y 
   1316 tcgttggtgctggacagcgccctggaccgcgagagcgtgtcagcc
        S  L  V  L  D  S  A  L  D  R  E  S  V  S  A 
   1361 tatgagctggtggtgaccgcacgggacgggggctcgccttcactg
        Y  E  L  V  V  T  A  R  D  G  G  S  P  S  L 
   1406 tgggccaccaccagcgtgtccatcgaggtggccgacgtgaacgac
        W  A  T  T  S  V  S  I  E  V  A  D  V  N  D 
   1451 aacgcgccggcgttcgcacagcctgagtacacagtattcgtgaag
        N  A  P  A  F  A  Q  P  E  Y  T  V  F  V  K 
   1496 gagaacaacccgccgggctgccacatcttcacggtgtcagcgtgg
        E  N  N  P  P  G  C  H  I  F  T  V  S  A  W 
   1541 gatgcggacgcgcaggagaacgcgctggtgtcctactcgctggtg
        D  A  D  A  Q  E  N  A  L  V  S  Y  S  L  V 
   1586 gagcggcgggtgggcgagcgcgcgttgtcgagctacgtttcggtg
        E  R  R  V  G  E  R  A  L  S  S  Y  V  S  V 
   1631 cacgcggagagcggcaaggtgtacgcgctgcagccgctggaccac
        H  A  E  S  G  K  V  Y  A  L  Q  P  L  D  H 
   1676 gaggaagtggagctgctgcagttccaggtgagcgcgcgggatgcg
        E  E  V  E  L  L  Q  F  Q  V  S  A  R  D  A 
   1721 ggcgtgccgcctctgggcagcaacgtgacgctgcaggtgttcgtg
        G  V  P  P  L  G  S  N  V  T  L  Q  V  F  V 
   1766 ctggacgagaacgacaacgcgccggcactgttggcgcctagggct
        L  D  E  N  D  N  A  P  A  L  L  A  P  R  A 
   1811 ggcaccgctgctggcgcagtgagtgagctggtgccgtggtcggtg
        G  T  A  A  G  A  V  S  E  L  V  P  W  S  V 
   1856 ggtgcagggcacgtggtggcgaaggtgcgcgcagtggacgctgac
        G  A  G  H  V  V  A  K  V  R  A  V  D  A  D 
   1901 tcaggctacaacgcgtggctttcgtacgagcttcagctgggtact
        S  G  Y  N  A  W  L  S  Y  E  L  Q  L  G  T 
   1946 ggcagcgctcgcatcccgttccgcgtggggctatacacgggtgag
        G  S  A  R  I  P  F  R  V  G  L  Y  T  G  E 
   1991 atcagcacgacacgtgccctagacgaggctgactcccctcgacac
        I  S  T  T  R  A  L  D  E  A  D  S  P  R  H 
   2036 cgcctactcgtgctggtgaaggaccacggcgaaccagcgttgaca
        R  L  L  V  L  V  K  D  H  G  E  P  A  L  T 
   2081 gccacggccaccgtgttagtgtcgttggtggaaagtggccaggca
        A  T  A  T  V  L  V  S  L  V  E  S  G  Q  A 
   2126 cccaaggcctcgtcgcgggcgtgggtgggcgccgcgggctcagag
        P  K  A  S  S  R  A  W  V  G  A  A  G  S  E 
   2171 gctacgctggtggatgtcaacgtgtacctgatcatcgccatctgc
        A  T  L  V  D  V  N  V  Y  L  I  I  A  I  C 
   2216 gcggtatccagcctgttggtgctcacggtgctgctgtacactgcg
        A  V  S  S  L  L  V  L  T  V  L  L  Y  T  A 
   2261 ctgcggtgctcggtgccacccaccgagggtgcgcgcgcgccagga
        L  R  C  S  V  P  P  T  E  G  A  R  A  P  G 
   2306 aagcccacgctggtgtgctccagcgccgtggggagctggtcttac
        K  P  T  L  V  C  S  S  A  V  G  S  W  S  Y 
   2351 tcgcagcagaggcggcagagggtgtgctctggggaggaccccccc
        S  Q  Q  R  R  Q  R  V  C  S  G  E  D  P  P 
   2396 aagacggacctcatggccttcagccctagcttatctcaaggtcca
        K  T  D  L  M  A  F  S  P  S  L  S  Q  G  P 
   2441 gactccgcagaagagaaacagctctcagaatcagaatacgtagga
        D  S  A  E  E  K  Q  L  S  E  S  E  Y  V  G 
   2486 aaggtgagtcttaaatatggaggatgcagctgcacttga 2524   
        K  V  S  L  K  Y  G  G  C  S  C  T  * 


NP_113683.1 protocadherin alpha-2 isoform 2 precursor [Homo sapiens]
>gb|AAD43741.1|AF152480_1 protocadherin alpha 2 short form protein [Homo sapiens]
>gb|AAC34324.1| KIAA0345-like 12 [Homo sapiens]

EAW61999.1 hCG1982192, isoform CRA_n [Homo sapiens]

AAH03126.1 Protocadherin alpha 2 [Homo sapiens]

NP_061728.1 protocadherin alpha-2 isoform 1 precursor [Homo sapiens]
>sp|Q9Y5H9.1|PCDA2_HUMAN RecName: Full=Protocadherin alpha-2; Short=PCDH-alpha-2; Flags: Precursor
>gb|AAD43704.1| protocadherin alpha 2 [Homo sapiens]

EAW61993.1 hCG1982192, isoform CRA_h [Homo sapiens]








    155 atgagaaatttacatgatcctatagcacactttagggaaacttat
        M  R  N  L  H  D  P  I  A  H  F  R  E  T  Y 
    200 gcaacttgccagttcactttgtttgcagtagtcaaagccctggag
        A  T  C  Q  F  T  L  F  A  V  V  K  A  L  E 
    245 acatcagcatttaactgtgatttgaaattgctgaagtgggccgtt
        T  S  A  F  N  C  D  L  K  L  L  K  W  A  V 
    290 ccactaggctgcatggcctacacagcaagtctaataacagacggc
        P  L  G  C  M  A  Y  T  A  S  L  I  T  D  G 
    335 cctgtttggggggaaatgaccagtctgcagattggctacccaact
        P  V  W  G  E  M  T  S  L  Q  I  G  Y  P  T 
    380 gttgcatcagtaccccattctatcatcaacgggtacaaacgagtc
        V  A  S  V  P  H  S  I  I  N  G  Y  K  R  V 
    425 ctggccttgtctgtggagacggattacaccttcccactcgctgaa
        L  A  L  S  V  E  T  D  Y  T  F  P  L  A  E 
    470 aaggtcaaggccttcttggctgatccatctgcctttgtggctgct
        K  V  K  A  F  L  A  D  P  S  A  F  V  A  A 
    515 gcccctgtggctgctgccaccacagctgctcctgctgctgctgca
        A  P  V  A  A  A  T  T  A  A  P  A  A  A  A 
    560 gccccagctaaggttgaagccaaggaagagtcggaggagtcggac
        A  P  A  K  V  E  A  K  E  E  S  E  E  S  D 
    605 gaggatatgggatttggtctctatggagagatgaaggagcttgca
        E  D  M  G  F  G  L  Y  G  E  M  K  E  L  A 
    650 ggtttgaagtgcagagaagaatgctggctccaaggagaaagaaga
        G  L  K  C  R  E  E  C  W  L  Q  G  E  R  R 
    695 gaggagagctcttcaactagctgcttaaatctccgttgcccctcg
        E  E  S  S  S  T  S  C  L  N  L  R  C  P  S 
    740 gggcacctcgatgccacattaaaggcacatccatcattggacaga
        G  H  L  D  A  T  L  K  A  H  P  S  L  D  R 
    785 tctgcagtggaggcaggcgctggggcatttagtgaactggcctgc
        S  A  V  E  A  G  A  G  A  F  S  E  L  A  C 
    830 caggttctgttttctgctgtagcctctgagagcgtgcgtcatcat
        Q  V  L  F  S  A  V  A  S  E  S  V  R  H  H 
    875 tgtcacctaattccactttatgctaatcaggacagatcatgggaa
        C  H  L  I  P  L  Y  A  N  Q  D  R  S  W  E 
    920 ggcttaattgggaatgacatgaagccattcattctactgcagtgg
        G  L  I  G  N  D  M  K  P  F  I  L  L  Q  W 
    965 aaaactaaactatgctatagtcgaatatacgggttggggatgggg
        K  T  K  L  C  Y  S  R  I  Y  G  L  G  M  G 
   1010 ggtggggtggaatcttggctggcaggaaaggacacaggcctttta
        G  G  V  E  S  W  L  A  G  K  D  T  G  L  L 
   1055 atctcagaaccaaaaatactctgcgctctgatttctccaacttca
        I  S  E  P  K  I  L  C  A  L  I  S  P  T  S 
   1100 tttcactatgaagggcattctgctcctccaagtcggtgtgacata
        F  H  Y  E  G  H  S  A  P  P  S  R  C  D  I 
   1145 tcacagggcactcgcgagtgcttctggaagccctgccgcggaggt
        S  Q  G  T  R  E  C  F  W  K  P  C  R  G  G 
   1190 cgactcttttaccagagacagaaggtcgggaagatgcagcgagta
        R  L  F  Y  Q  R  Q  K  V  G  K  M  Q  R  V 
   1235 cagcgttcttattataactcgcctcctcacaggtttctgggcaga
        Q  R  S  Y  Y  N  S  P  P  H  R  F  L  G  R 
   1280 gcactaacctgccctaacctgttacctggaataatggatgtaaag
        A  L  T  C  P  N  L  L  P  G  I  M  D  V  K 
   1325 gaccggcgacaccgctctttgaccagaggacgctgtggcaaagag
        D  R  R  H  R  S  L  T  R  G  R  C  G  K  E 
   1370 tgtcgctacacaagctcctctctggacagtgaggactgccgcgtg
        C  R  Y  T  S  S  S  L  D  S  E  D  C  R  V 
   1415 cccacacagaaatcctacagctccagtgagactctgaaggcctat
        P  T  Q  K  S  Y  S  S  S  E  T  L  K  A  Y 
   1460 gaccatgacagcaggatgcactatggaaaccgagtcacagacctc
        D  H  D  S  R  M  H  Y  G  N  R  V  T  D  L 
   1505 atccaccgggagtcagatgagtttcccaccctaggaaccaacttc
        I  H  R  E  S  D  E  F  P  T  L  G  T  N  F 
   1550 acccttgccgaactgggcatctgtgagccctccccacaccgaagc
        T  L  A  E  L  G  I  C  E  P  S  P  H  R  S 
   1595 ggctactgctccgacatggggatccttcaccagggctactccctt
        G  Y  C  S  D  M  G  I  L  H  Q  G  Y  S  L 
   1640 agcacagggtctgacgccgactccgacaccgagggagggatgtct
        S  T  G  S  D  A  D  S  D  T  E  G  G  M  S 
   1685 ccagaacacgccatcagactgtggggcagagggataaaatccagg
        P  E  H  A  I  R  L  W  G  R  G  I  K  S  R 
   1730 cgcagttccggcctgtccagtcgtgaaaactcggcccttaccctg
        R  S  S  G  L  S  S  R  E  N  S  A  L  T  L 
   1775 actgactctgacaacgaaaacaaatcagatgatgcctattatgct
        T  D  S  D  N  E  N  K  S  D  D  A  Y  Y  A 
   1820 tttccttcccagctggagaaggcctcagctgtgtcagactgggac
        F  P  S  Q  L  E  K  A  S  A  V  S  D  W  D 
   1865 tgctggtga 1873   
        C  W  * 


CCD17866.1 odz, odd Oz/ten-m homolog 2 (Drosophila) [Homo sapiens]

CCD17867.1 odz, odd Oz/ten-m homolog 2 (Drosophila) [Homo sapiens]

XP_003582533.1 PREDICTED: teneurin-2 isoform 6 [Bos taurus]

NP_001116151.1 teneurin-2 [Homo sapiens]

XP_003268681.1 PREDICTED: teneurin-2 isoform 3 [Nomascus leucogenys]

XP_003582531.1 PREDICTED: teneurin-2 isoform 4 [Bos taurus]

Q9NT68.3 RecName: Full=Teneurin-2; Short=Ten-2; AltName: Full=Protein Odd Oz/ten-m homolog 2; AltName: Full=Tenascin-M2; Short=Ten-m2








   2501 atggagttttcctggggaagcggccaggaatcccggcgtctgctg
        M  E  F  S  W  G  S  G  Q  E  S  R  R  L  L 
   2546 ctcttacttcttctcctcgcagcctgggaggcagggaacggtcag
        L  L  L  L  L  L  A  A  W  E  A  G  N  G  Q 
   2591 ctccactactcggtctccgaggaggccaaacacggcaccttcgtg
        L  H  Y  S  V  S  E  E  A  K  H  G  T  F  V 
   2636 ggccgcatcgcgcaggacctgggactggagctggcggagctggtg
        G  R  I  A  Q  D  L  G  L  E  L  A  E  L  V 
   2681 ccgcgcctgttccgggtggcgtccaagggccgcggaggccttctg
        P  R  L  F  R  V  A  S  K  G  R  G  G  L  L 
   2726 gaggtaaatctgcagaatggcattttgtttgtgaattctcggatc
        E  V  N  L  Q  N  G  I  L  F  V  N  S  R  I 
   2771 gaccgggaggagctgtgccggcggagcgcggagtgcagcatccac
        D  R  E  E  L  C  R  R  S  A  E  C  S  I  H 
   2816 ctggaggtgatcgtagacaggccgctgcaggttttccatgtggac
        L  E  V  I  V  D  R  P  L  Q  V  F  H  V  D 
   2861 gtggaggtgagggacattaacgataacccgccggtgttcccagca
        V  E  V  R  D  I  N  D  N  P  P  V  F  P  A 
   2906 acacaaaagaacctgtccatcgcggaatccaggccgcttgactct
        T  Q  K  N  L  S  I  A  E  S  R  P  L  D  S 
   2951 cggtttccactagagggcgcctcggatgcagatatcggggagaac
        R  F  P  L  E  G  A  S  D  A  D  I  G  E  N 
   2996 gccctgctcacttacagactgagcccaaatgaatacttttctctg
        A  L  L  T  Y  R  L  S  P  N  E  Y  F  S  L 
   3041 gaaaaaccacctgatgacgagctggtaaaaggtcttgggcttata
        E  K  P  P  D  D  E  L  V  K  G  L  G  L  I 
   3086 ttacggaaatctttagacagagaagaagctccggagattttttta
        L  R  K  S  L  D  R  E  E  A  P  E  I  F  L 
   3131 gtgctcacagccactgatggaggcaaacccgagttgactggcacc
        V  L  T  A  T  D  G  G  K  P  E  L  T  G  T 
   3176 gttcagttactcatcacagtactggatgccaatgacaatgcccca
        V  Q  L  L  I  T  V  L  D  A  N  D  N  A  P 
   3221 gcttttgacagaaccatttataaggtgagattactagaaaatgtt
        A  F  D  R  T  I  Y  K  V  R  L  L  E  N  V 
   3266 cctaatggaacattggtaattaaacttaacgcctcagatttagac
        P  N  G  T  L  V  I  K  L  N  A  S  D  L  D 
   3311 gaaggattgaatggggacattgtttattcattctcaaatgatatt
        E  G  L  N  G  D  I  V  Y  S  F  S  N  D  I 
   3356 tcgccaaatgtgaaatccaagtttcacatagatccaattactgga
        S  P  N  V  K  S  K  F  H  I  D  P  I  T  G 
   3401 caaattattgtaaagggatatattgactttgaagaaagcaaatcc
        Q  I  I  V  K  G  Y  I  D  F  E  E  S  K  S 
   3446 tatgaaattattgtagagggcattgataagggacagctcccactt
        Y  E  I  I  V  E  G  I  D  K  G  Q  L  P  L 
   3491 tctggccattgtagagttattgtggaagtagaagacaacaacgat
        S  G  H  C  R  V  I  V  E  V  E  D  N  N  D 
   3536 aatgtcccagatttggaattcaagtctttatcacttccaattaga
        N  V  P  D  L  E  F  K  S  L  S  L  P  I  R 
   3581 gaggacgctccactgggtacagtcatcgccctgatcagcgtgtcc
        E  D  A  P  L  G  T  V  I  A  L  I  S  V  S 
   3626 gacaaagacatgggtgtcaatgggctggtcacctgctccttgacg
        D  K  D  M  G  V  N  G  L  V  T  C  S  L  T 
   3671 tcccacgtccccttcaagctggtgtccaccttcaagaattactac
        S  H  V  P  F  K  L  V  S  T  F  K  N  Y  Y 
   3716 tcgttggtgctggacagtgccctggaccgcgagagcgtgtcagcc
        S  L  V  L  D  S  A  L  D  R  E  S  V  S  A 
   3761 tatgagctggtggtgaccgcgcgagacgggggctcgccttcgctg
        Y  E  L  V  V  T  A  R  D  G  G  S  P  S  L 
   3806 tgggccacggccagtgtttctgtggaggtggctgatgtgaacgac
        W  A  T  A  S  V  S  V  E  V  A  D  V  N  D 
   3851 aacgctccggcgttcgcgcagcccgagtacacagtgttcgtgaag
        N  A  P  A  F  A  Q  P  E  Y  T  V  F  V  K 
   3896 gagaacaacccgccgggctgccacatcttcactgtgtctgcgtgg
        E  N  N  P  P  G  C  H  I  F  T  V  S  A  W 
   3941 gacgcggacgcgcaggagaacgcgctggtgtcctactcgctggta
        D  A  D  A  Q  E  N  A  L  V  S  Y  S  L  V 
   3986 gagcggcgggtaggggagcgcgcgctgtcgagctacgtttcggtg
        E  R  R  V  G  E  R  A  L  S  S  Y  V  S  V 
   4031 catgcggagagcggcaaggtgtacgcgctgcagccgctggaccac
        H  A  E  S  G  K  V  Y  A  L  Q  P  L  D  H 
   4076 gaggagctagagctgctgcagtttcaggtgaccgctcgcgatgcc
        E  E  L  E  L  L  Q  F  Q  V  T  A  R  D  A 
   4121 ggcgtgccacctctgggcagcaacgtgacgctgcaggtgttcgtg
        G  V  P  P  L  G  S  N  V  T  L  Q  V  F  V 
   4166 ctggacgaaaacgacaacgcgccagcactgctagcgcctcgggcg
        L  D  E  N  D  N  A  P  A  L  L  A  P  R  A 
   4211 ggtggcactggtggcgcagtgagcgagctggtgccatggtcggtg
        G  G  T  G  G  A  V  S  E  L  V  P  W  S  V 
   4256 ggtgtgggccacgtggtggcaaaggtgcgcgcggtggatgctgac
        G  V  G  H  V  V  A  K  V  R  A  V  D  A  D 
   4301 tcgggctacaacgcgtggctttcgtacgagctgcagccggggact
        S  G  Y  N  A  W  L  S  Y  E  L  Q  P  G  T 
   4346 ggtggcgcgcgcatcccgttccgcgtggggctgtacactggcgag
        G  G  A  R  I  P  F  R  V  G  L  Y  T  G  E 
   4391 atcagcacaacgcgtgccctggacgaaacggacgctccgcgccac
        I  S  T  T  R  A  L  D  E  T  D  A  P  R  H 
   4436 cgcctactggtactggtgaaggaccacggcgagcccgcgctgacg
        R  L  L  V  L  V  K  D  H  G  E  P  A  L  T 
   4481 gccacggccactgtgctggtgtcacttgtggagagtggacaggcg
        A  T  A  T  V  L  V  S  L  V  E  S  G  Q  A 
   4526 ccaaaggcctcctcacgggcgttggtgggcgctgtgggtcccgat
        P  K  A  S  S  R  A  L  V  G  A  V  G  P  D 
   4571 gctgcgctggtggatgtcaacgtatacctgatcattgccatctgc
        A  A  L  V  D  V  N  V  Y  L  I  I  A  I  C 
   4616 gcggtgtccagccttttggtgctcacgctgctgctgtacaccgcg
        A  V  S  S  L  L  V  L  T  L  L  L  Y  T  A 
   4661 ctgcggtgctctgcgctgcccaccgagggcgcgtgcgctccgggc
        L  R  C  S  A  L  P  T  E  G  A  C  A  P  G 
   4706 aagcccacgctggtgtgctccagtgcggtggggagctggtcatac
        K  P  T  L  V  C  S  S  A  V  G  S  W  S  Y 
   4751 tcgcagcagaggaggccgagggtgtgctctggtgagggcccaccc
        S  Q  Q  R  R  P  R  V  C  S  G  E  G  P  P 
   4796 aagaccgacctcatggccttcagccccagtttacctgactctagg
        K  T  D  L  M  A  F  S  P  S  L  P  D  S  R 
   4841 gacagagaagatcagctgcagacaactgaggaatcctttgcaaag
        D  R  E  D  Q  L  Q  T  T  E  E  S  F  A  K 
   4886 gtggaaataaaaccagaggtatttgacatggtgtttaccccggag
        V  E  I  K  P  E  V  F  D  M  V  F  T  P  E 
   4931 gatagattgggaaagcaatgtctgctcctcccgcttctgctcctc
        D  R  L  G  K  Q  C  L  L  L  P  L  L  L  L 
   4976 gcagcctggaaggtggggagcggccagctccactactccgtaccc
        A  A  W  K  V  G  S  G  Q  L  H  Y  S  V  P 
   5021 gaggaggccaaacacggcaccttcgtgggccggatcgcgcaggac
        E  E  A  K  H  G  T  F  V  G  R  I  A  Q  D 
   5066 ctggggctggagctggcggagctggtgccgcgcctgttcaggatg
        L  G  L  E  L  A  E  L  V  P  R  L  F  R  M 
   5111 gcctccaaagaccgcgaggaccttctggaggtaaatctgcagaat
        A  S  K  D  R  E  D  L  L  E  V  N  L  Q  N 
   5156 ggcattttgtttgtgaattctcggatcgaccgcgaggagctgtgc
        G  I  L  F  V  N  S  R  I  D  R  E  E  L  C 
   5201 gggcggagcgcggagtgcagcatccacctggaggtgatcgtggac
        G  R  S  A  E  C  S  I  H  L  E  V  I  V  D 
   5246 aggccgctgcaggttttccatgtggacgtggaggtgagggacatt
        R  P  L  Q  V  F  H  V  D  V  E  V  R  D  I 
   5291 aacgacaacccgcccttgttcccggtagaggaacaaagagtgctg
        N  D  N  P  P  L  F  P  V  E  E  Q  R  V  L 
   5336 atttacgaatctaggctgccagattctgtgtttccactggagggc
        I  Y  E  S  R  L  P  D  S  V  F  P  L  E  G 
   5381 gcgtccgatgcagatgttggctcaaattccatcttaacctataaa
        A  S  D  A  D  V  G  S  N  S  I  L  T  Y  K 
   5426 ctcagttctagcgaatacttcgggctagatgtgaaaataaacagt
        L  S  S  S  E  Y  F  G  L  D  V  K  I  N  S 
   5471 gatgacaataaacaaattgggctcttattaaagaaatccttggac
        D  D  N  K  Q  I  G  L  L  L  K  K  S  L  D 
   5516 agagaggaagctcctgcacacaacttattcctgacagccacagat
        R  E  E  A  P  A  H  N  L  F  L  T  A  T  D 
   5561 gggggcaaacctgagctcacaggcactgttcagctgctggtcaca
        G  G  K  P  E  L  T  G  T  V  Q  L  L  V  T 
   5606 gtgctggatgtgaatgataatgctcccactttcgaacagtctgaa
        V  L  D  V  N  D  N  A  P  T  F  E  Q  S  E 
   5651 tacgaagtaagaatattcgaaaatgcagacaacggaacaacagtt
        Y  E  V  R  I  F  E  N  A  D  N  G  T  T  V 
   5696 atcagactgaatgcttctgatcgggatgaaggagcgaatggggca
        I  R  L  N  A  S  D  R  D  E  G  A  N  G  A 
   5741 atttcatattcttttaatagccttgttgcagccatggttattgac
        I  S  Y  S  F  N  S  L  V  A  A  M  V  I  D 
   5786 cactttagcatagatcgaaatacgggagaaatagtgattcggggt
        H  F  S  I  D  R  N  T  G  E  I  V  I  R  G 
   5831 aatttggattttgaacaagaaaacttatacaaaatcctcattgac
        N  L  D  F  E  Q  E  N  L  Y  K  I  L  I  D 
   5876 gccacggacaaaggccatcctcccatggcgggtcattgcaccgtt
        A  T  D  K  G  H  P  P  M  A  G  H  C  T  V 
   5921 ttagtgagaattttggataaaaatgataacgtccctgagatagca
        L  V  R  I  L  D  K  N  D  N  V  P  E  I  A 
   5966 ctgacttccttatccttgcctgtacgtgaagacgctcaatttggt
        L  T  S  L  S  L  P  V  R  E  D  A  Q  F  G 
   6011 actgtcatcgccctaattagcgtgaacgacctcgattcaggtgcc
        T  V  I  A  L  I  S  V  N  D  L  D  S  G  A 
   6056 aacgggcaggtgaactgctcgctgacgcctcacgtccctttcaag
        N  G  Q  V  N  C  S  L  T  P  H  V  P  F  K 
   6101 ctggtgtccaccttcaagaattactactcgttggtgctggacagt
        L  V  S  T  F  K  N  Y  Y  S  L  V  L  D  S 
   6146 gccctggaccgcgagagcgtgtcggcctatgagttggtggtaacc
        A  L  D  R  E  S  V  S  A  Y  E  L  V  V  T 
   6191 gcgcgggacgggggctcgccttcgctgtgggccaccgccagcttg
        A  R  D  G  G  S  P  S  L  W  A  T  A  S  L 
   6236 tctgtggaggtggccgacatgaatgacaatgctccggcgttcgcg
        S  V  E  V  A  D  M  N  D  N  A  P  A  F  A 
   6281 cagcccgagtacacagtgttcgtgaaggagaacaacccgccgggc
        Q  P  E  Y  T  V  F  V  K  E  N  N  P  P  G 
   6326 tgccacatcttcacggtgtctgcgcgagacgcggacgcgcaggag
        C  H  I  F  T  V  S  A  R  D  A  D  A  Q  E 
   6371 aacgcgctggtgtcctactcgctggtggagcggcgggtgggcgag
        N  A  L  V  S  Y  S  L  V  E  R  R  V  G  E 
   6416 cgcgcgttgtcgagctacatttcggtgcacgcggagagcggcaag
        R  A  L  S  S  Y  I  S  V  H  A  E  S  G  K 
   6461 gtgtacgcgctgcagccgctggaccacgaggagctagagctgctg
        V  Y  A  L  Q  P  L  D  H  E  E  L  E  L  L 
   6506 cagtttcaggtgagcgcgcgcgacgcgggcgtgccgcctctgggc
        Q  F  Q  V  S  A  R  D  A  G  V  P  P  L  G 
   6551 agcaacgtgacgctgcaggtgttcgtgctggacgagaacgacaac
        S  N  V  T  L  Q  V  F  V  L  D  E  N  D  N 
   6596 gcgccggcgctgctggcgcctcgggtgggtggtactggtggtgca
        A  P  A  L  L  A  P  R  V  G  G  T  G  G  A 
   6641 gtgagcgagctggtgccgcggtcactgggtgcaggccaagtggtg
        V  S  E  L  V  P  R  S  L  G  A  G  Q  V  V 
   6686 gcgaaggtgcgcgcagttgacgccgactcaggctacaacgcgtgg
        A  K  V  R  A  V  D  A  D  S  G  Y  N  A  W 
   6731 ctttcgtatgagctgcagcccccggcaagcagcgctcgcttcccg
        L  S  Y  E  L  Q  P  P  A  S  S  A  R  F  P 
   6776 tttcgcgtggggctgtacacgggcgagatcagcaccactcgtgtc
        F  R  V  G  L  Y  T  G  E  I  S  T  T  R  V 
   6821 ctggacgaagcggactctccgcgccaccggctgctggtgctggtg
        L  D  E  A  D  S  P  R  H  R  L  L  V  L  V 
   6866 aaagaccacggtgagccggcgctgacagcgacggccacggttctg
        K  D  H  G  E  P  A  L  T  A  T  A  T  V  L 
   6911 gtgtcgctggtggagagtggccaggctccaaaggcgtcatcacgg
        V  S  L  V  E  S  G  Q  A  P  K  A  S  S  R 
   6956 gcgtcggtgggcgccgcgggcccagaggcggcgctggtggatgtc
        A  S  V  G  A  A  G  P  E  A  A  L  V  D  V 
   7001 aacgtgtacctgatcatcgccatctgcgcggtatccagcctgctg
        N  V  Y  L  I  I  A  I  C  A  V  S  S  L  L 
   7046 gtcctcacgctactgctgtacacagcgctgcggtgctcggcgcca
        V  L  T  L  L  L  Y  T  A  L  R  C  S  A  P 
   7091 cccaccgagggcgcgtgcacggcggacaagcccacgctggtgtgc
        P  T  E  G  A  C  T  A  D  K  P  T  L  V  C 
   7136 tccagcgcagtggggagctggtcgtactcgcagcagaggcggcag
        S  S  A  V  G  S  W  S  Y  S  Q  Q  R  R  Q 
   7181 agggtgtgctccggggagggcccacccaagatggatctcatggcc
        R  V  C  S  G  E  G  P  P  K  M  D  L  M  A 
   7226 tttagccccagcctttcaccttgtcctattatgatgggtaaggcg
        F  S  P  S  L  S  P  C  P  I  M  M  G  K  A 
   7271 gagaatcaggatttaaatgaagatcatgatgccaaagtggtgtgc
        E  N  Q  D  L  N  E  D  H  D  A  K  V  V  C 
   7316 ccgaatggatacgacccagggggccgacatctactgctgtttatt
        P  N  G  Y  D  P  G  G  R  H  L  L  L  F  I 
   7361 ataattctagcagcttgggaggcagggagaggccagctccactac
        I  I  L  A  A  W  E  A  G  R  G  Q  L  H  Y 
   7406 tcggtccccgaggaggctaaacatggcaacttcgtgggccgcatc
        S  V  P  E  E  A  K  H  G  N  F  V  G  R  I 
   7451 gcgcaggacctggggctggagctggcggagctggtgccgcgcctg
        A  Q  D  L  G  L  E  L  A  E  L  V  P  R  L 
   7496 ttccgggcggtgtgcaaattccgtggggatcttctggaggtaaat
        F  R  A  V  C  K  F  R  G  D  L  L  E  V  N 
   7541 ctgcagaatggcattttgtttgtgaattctcggatcgaccgcgag
        L  Q  N  G  I  L  F  V  N  S  R  I  D  R  E 
   7586 gagctgtgcgggcggagcgcggagtgcagcatccacctggaggtg
        E  L  C  G  R  S  A  E  C  S  I  H  L  E  V 
   7631 atcgtggaaaggccgctgcaggttttccatgtggacgtggaggtg
        I  V  E  R  P  L  Q  V  F  H  V  D  V  E  V 
   7676 aaggacattaacgacaaccctccggtgttcccagcgacacaaagg
        K  D  I  N  D  N  P  P  V  F  P  A  T  Q  R 
   7721 aatctgttcatcgcggaatccaggccgcttgactctcggtttcca
        N  L  F  I  A  E  S  R  P  L  D  S  R  F  P 
   7766 ctagagggcgcgtccgatgcagatatcggggagaacgccctgctc
        L  E  G  A  S  D  A  D  I  G  E  N  A  L  L 
   7811 acttacagactgagccccaatgagtatttcttcctggacgtgcca
        T  Y  R  L  S  P  N  E  Y  F  F  L  D  V  P 
   7856 accagcaaccagcaggtaaaacctcttggacttgtattacggaaa
        T  S  N  Q  Q  V  K  P  L  G  L  V  L  R  K 
   7901 cttttagacagagaagaaactccggagcttcatttattgctcacg
        L  L  D  R  E  E  T  P  E  L  H  L  L  L  T 
   7946 gccaccgatggaggcaaacccgagctgactggcaccgttcaatta
        A  T  D  G  G  K  P  E  L  T  G  T  V  Q  L 
   7991 ctcatcacggtactggacaacaatgacaatgccccagtgttcgac
        L  I  T  V  L  D  N  N  D  N  A  P  V  F  D 
   8036 agaaccctgtatacggtgaaattaccagaaaacgtttctatcgga
        R  T  L  Y  T  V  K  L  P  E  N  V  S  I  G 
   8081 acgctggtgattcaccccaatgcctcagatttagacgaaggcttg
        T  L  V  I  H  P  N  A  S  D  L  D  E  G  L 
   8126 aatggggatattatttactccttctccagtgatgtttctccagat
        N  G  D  I  I  Y  S  F  S  S  D  V  S  P  D 
   8171 ataaaatccaagttccacatggaccccttaagtggggcaatcaca
        I  K  S  K  F  H  M  D  P  L  S  G  A  I  T 
   8216 gtgataggacatatggattttgaagaaagtagagcacacaagatc
        V  I  G  H  M  D  F  E  E  S  R  A  H  K  I 
   8261 ccagtcgaggctgtcgataaaggcttcccacccctggctggtcat
        P  V  E  A  V  D  K  G  F  P  P  L  A  G  H 
   8306 tgtacagttcttgtggaagttgtggatgtaaatgacaatgctcca
        C  T  V  L  V  E  V  V  D  V  N  D  N  A  P 
   8351 cagttgactctcacttccctgtctctccctattccagaggacgcc
        Q  L  T  L  T  S  L  S  L  P  I  P  E  D  A 
   8396 caaccaggtaccgtcatcacattgattagcgtgtttgaccgagat
        Q  P  G  T  V  I  T  L  I  S  V  F  D  R  D 
   8441 tttggagtcaacggacaggttacctgctccctgacgccccgcgtt
        F  G  V  N  G  Q  V  T  C  S  L  T  P  R  V 
   8486 cccttcaagttggtgtccaccttcaagaattactattcattggtg
        P  F  K  L  V  S  T  F  K  N  Y  Y  S  L  V 
   8531 ctggacagcgctctggaccgcgagagtgtgtccgcctatgagctg
        L  D  S  A  L  D  R  E  S  V  S  A  Y  E  L 
   8576 gtggttaccgcgcgggacgggggctcgccttctctgtgggccact
        V  V  T  A  R  D  G  G  S  P  S  L  W  A  T 
   8621 gctagcgtgtccgtggaggtggccgacgtgaacgacaacgccccg
        A  S  V  S  V  E  V  A  D  V  N  D  N  A  P 
   8666 gcgttcgcgcagcccgagtatacggtgttcgtgaaggagaacaac
        A  F  A  Q  P  E  Y  T  V  F  V  K  E  N  N 
   8711 ccgccgggctgccacatcttcactgtgtcggcgggggacgcggac
        P  P  G  C  H  I  F  T  V  S  A  G  D  A  D 
   8756 gcgcagaagaacgcgctggtgtcctactcgctggtggagctgcgg
        A  Q  K  N  A  L  V  S  Y  S  L  V  E  L  R 
   8801 gtgggcgagcgcgcgctgtcgagctacgtgtcagtgcacgcggag
        V  G  E  R  A  L  S  S  Y  V  S  V  H  A  E 
   8846 agcggcaaggtgtacgcgctgcagccgttggaccacgaggagctg
        S  G  K  V  Y  A  L  Q  P  L  D  H  E  E  L 
   8891 gagctgttgcagttccaggtgagcgcgcgcgatgcgggcgtgccg
        E  L  L  Q  F  Q  V  S  A  R  D  A  G  V  P 
   8936 cctctgggcagcaacgtgacgctgcaggtgttcgtgctggacgag
        P  L  G  S  N  V  T  L  Q  V  F  V  L  D  E 
   8981 aacgacaacgcgccggcactgctggcgcctcgggtgggtggcact
        N  D  N  A  P  A  L  L  A  P  R  V  G  G  T 
   9026 ggtggcgcagtgagagagcttgtgccgcggtctgtgggcgcgggc
        G  G  A  V  R  E  L  V  P  R  S  V  G  A  G 
   9071 catgtggtggcgaaggtacgtgcagttgacgctgactcaggctac
        H  V  V  A  K  V  R  A  V  D  A  D  S  G  Y 
   9116 aacgcgtggctttcgtatgagttgcaaccggtggcggccggtgcg
        N  A  W  L  S  Y  E  L  Q  P  V  A  A  G  A 
   9161 agcatcccgttccgcgtggggctgtacactggtgagatcagcacg
        S  I  P  F  R  V  G  L  Y  T  G  E  I  S  T 
   9206 acacgagccctagatgagacggacgcaccgcgccaccgccttctg
        T  R  A  L  D  E  T  D  A  P  R  H  R  L  L 
   9251 gtgcttgtgaaggaccacggggagccctcgctgacagccacagcc
        V  L  V  K  D  H  G  E  P  S  L  T  A  T  A 
   9296 accgtgctggtgtcgctggtggaaagcggccaggcaccaaaggcg
        T  V  L  V  S  L  V  E  S  G  Q  A  P  K  A 
   9341 tcgtcgcgggcatcgttgggcattgcaggcccagagaccgagctg
        S  S  R  A  S  L  G  I  A  G  P  E  T  E  L 
   9386 gtggatgtcaacgtgtacctgatcatcgccatctgcgcggtgtcc
        V  D  V  N  V  Y  L  I  I  A  I  C  A  V  S 
   9431 agtctgttggtgcttaccctgctgctgtacacggcgttgcggtgc
        S  L  L  V  L  T  L  L  L  Y  T  A  L  R  C 
   9476 tcagcgccgtcctctgagggcgcatgtagtttggtaaagcccact
        S  A  P  S  S  E  G  A  C  S  L  V  K  P  T 
   9521 ctggtgtgctccagcgcggtggggagctggtcattctcccagcag
        L  V  C  S  S  A  V  G  S  W  S  F  S  Q  Q 
   9566 aggcggcagagggtgtgctctggggagggcccacccaagacagac
        R  R  Q  R  V  C  S  G  E  G  P  P  K  T  D 
   9611 ctcatggccttcagtcccagccttcctcagggtccatcctctaca
        L  M  A  F  S  P  S  L  P  Q  G  P  S  S  T 
   9656 gacaatgtggattatcactggcgaggagagctgggatcctggcga
        D  N  V  D  Y  H  W  R  G  E  L  G  S  W  R 
   9701 ctactactcttgcttctgctcctcgcagcctggaaggtggggagc
        L  L  L  L  L  L  L  L  A  A  W  K  V  G  S 
   9746 ggccagctccactactccgtccccgaggaggccaaacacggcacc
        G  Q  L  H  Y  S  V  P  E  E  A  K  H  G  T 
   9791 ttcgtgggccggatcgcgcaggacctggggctggagctggcggag
        F  V  G  R  I  A  Q  D  L  G  L  E  L  A  E 
   9836 ctggtgccgcgcctgttccgggtggcgtccaaaagacaccgggac
        L  V  P  R  L  F  R  V  A  S  K  R  H  R  D 
   9881 cttctggaggtaagtctgcagaatggcattttgtttgtgaattct
        L  L  E  V  S  L  Q  N  G  I  L  F  V  N  S 
   9926 cggatcgaccgcgaggagctgtgcgggcggagcgcggagtgcagc
        R  I  D  R  E  E  L  C  G  R  S  A  E  C  S 
   9971 atccacctggaggtgatcgtggacaggccgctgcaggttttccat
        I  H  L  E  V  I  V  D  R  P  L  Q  V  F  H 
  10016 gtggacgtggaggtgaaggatgttaatgacaacccgccagtgttc
        V  D  V  E  V  K  D  V  N  D  N  P  P  V  F 
  10061 cgggtaaaagaccaaaagctgtttgtttcagaatccagaatgcca
        R  V  K  D  Q  K  L  F  V  S  E  S  R  M  P 
  10106 gactctcggtttccgctagagggcgcgtccgatgcagatgttgga
        D  S  R  F  P  L  E  G  A  S  D  A  D  V  G 
  10151 gctaactccgtgttaacctacaggcttagctctcatgattacttc
        A  N  S  V  L  T  Y  R  L  S  S  H  D  Y  F 
  10196 atgctagatgtgaattcaaagaacgatgagaataaactggttgag
        M  L  D  V  N  S  K  N  D  E  N  K  L  V  E 
  10241 ctcgtattaagaaaatccttggacagagaggacgctcctgcgcac
        L  V  L  R  K  S  L  D  R  E  D  A  P  A  H 
  10286 cacttattcctgacagccacagatgggggcaaacctgagctcaca
        H  L  F  L  T  A  T  D  G  G  K  P  E  L  T 
  10331 ggcactgttcagctgctggtcacagtgctggatgtgaatgataat
        G  T  V  Q  L  L  V  T  V  L  D  V  N  D  N 
  10376 gctcccactttcgaacagtctgaatacgaagtaagaatattcgaa
        A  P  T  F  E  Q  S  E  Y  E  V  R  I  F  E 
  10421 aacgcagacaacggaacaacagttatcaaactgaatgcttctgat
        N  A  D  N  G  T  T  V  I  K  L  N  A  S  D 
  10466 ccggatgaaggagccaatggggcaatttcatattcttttaatagc
        P  D  E  G  A  N  G  A  I  S  Y  S  F  N  S 
  10511 cttgttgaaactatggttattgaccactttagcatagatcgaaat
        L  V  E  T  M  V  I  D  H  F  S  I  D  R  N 
  10556 acgggagaaatagtgattcggggtaatttggattttgaacaagaa
        T  G  E  I  V  I  R  G  N  L  D  F  E  Q  E 
  10601 aacttatacaaaatcctcattgacgccacggacaaaggccatcct
        N  L  Y  K  I  L  I  D  A  T  D  K  G  H  P 
  10646 cccatggcgggtcattgcaccgttttagtgagaattttggataaa
        P  M  A  G  H  C  T  V  L  V  R  I  L  D  K 
  10691 aatgataacgtccctgagatagcactgacttccttatccttgcct
        N  D  N  V  P  E  I  A  L  T  S  L  S  L  P 
  10736 gtacgtgaagacgctcaatttggtactgtcatcgccctaattagc
        V  R  E  D  A  Q  F  G  T  V  I  A  L  I  S 
  10781 gtgaacgacctcgattcaggtgccaacgggcaggtgacctgctcc
        V  N  D  L  D  S  G  A  N  G  Q  V  T  C  S 
  10826 ctgatgccccatgtccccttcaagctggtgtccaccttcaagaat
        L  M  P  H  V  P  F  K  L  V  S  T  F  K  N 
  10871 tactactcgttggtgctggacagcgccctggaccgcgagagagtg
        Y  Y  S  L  V  L  D  S  A  L  D  R  E  R  V 
  10916 tcggcctatgagttggtggtaaccgcgcgggacgggggctcgcct
        S  A  Y  E  L  V  V  T  A  R  D  G  G  S  P 
  10961 tcgctgtgggccaccgccagcttgtctgtggaggtggccgacgtg
        S  L  W  A  T  A  S  L  S  V  E  V  A  D  V 
  11006 aacgacaatgctccggcgttcgcgcagcccgagtacacggtgttc
        N  D  N  A  P  A  F  A  Q  P  E  Y  T  V  F 
  11051 gtgaaggagaacaacccgccgggctgccacatcttcacggtgtct
        V  K  E  N  N  P  P  G  C  H  I  F  T  V  S 
  11096 gcgcgagacgcggacgcgcaggagaacgcgctggtgtcctactcg
        A  R  D  A  D  A  Q  E  N  A  L  V  S  Y  S 
  11141 cttgtggagcggcgggtgggcgagcgctcgctgtcgagctacatt
        L  V  E  R  R  V  G  E  R  S  L  S  S  Y  I 
  11186 tcggtgcacacggagagcggcaaggtgtacgcgctgcagccgctg
        S  V  H  T  E  S  G  K  V  Y  A  L  Q  P  L 
  11231 gaccacgaggagctagagctgctgcagttccaggtgagcgcgcgc
        D  H  E  E  L  E  L  L  Q  F  Q  V  S  A  R 
  11276 gacgcgggcgtgccgcctctgggcagcaacgtgacgctgcaggtg
        D  A  G  V  P  P  L  G  S  N  V  T  L  Q  V 
  11321 ttcgtgctggacgagaatgacaacgcgccggcactgctggagcct
        F  V  L  D  E  N  D  N  A  P  A  L  L  E  P 
  11366 cgggtgggtggcactggtggcgcagcgagcaagctggtgccgcgg
        R  V  G  G  T  G  G  A  A  S  K  L  V  P  R 
  11411 tctgtgggcgcgggccacgtggtagcgaaggtgcgcgcagtggac
        S  V  G  A  G  H  V  V  A  K  V  R  A  V  D 
  11456 gccgactcgggctacaacgcgtggctttcgtatgagctgcagcca
        A  D  S  G  Y  N  A  W  L  S  Y  E  L  Q  P 
  11501 gctgcaagcagccctcgcatcccgttccgcgtggggctgtacacg
        A  A  S  S  P  R  I  P  F  R  V  G  L  Y  T 
  11546 ggcgagatcagcaccactcgtgtcctggacgaagcggactctccg
        G  E  I  S  T  T  R  V  L  D  E  A  D  S  P 
  11591 cgccaccgtctgctggtcctggtgaaggatcatggtgaacctgcg
        R  H  R  L  L  V  L  V  K  D  H  G  E  P  A 
  11636 ctgaccgccacggccacggttctggtgtcgctggtggagagcggc
        L  T  A  T  A  T  V  L  V  S  L  V  E  S  G 
  11681 caggctccaaaagcgtcatcgaggcagtcggctggcgttttgggt
        Q  A  P  K  A  S  S  R  Q  S  A  G  V  L  G 
  11726 ccggaagcggcgctggtggatgtcaacgtgtacctgatcatcgcc
        P  E  A  A  L  V  D  V  N  V  Y  L  I  I  A 
  11771 atctgcgcggtatccagcctgctggtgctcacgctgctgctgtac
        I  C  A  V  S  S  L  L  V  L  T  L  L  L  Y 
  11816 actgcgctgcggtgctcagcactgcccactgagggcgggtgccgg
        T  A  L  R  C  S  A  L  P  T  E  G  G  C  R 
  11861 gcgggcaagcccactctggtgtgctccagtgcggtggggagctgg
        A  G  K  P  T  L  V  C  S  S  A  V  G  S  W 
  11906 tcatactcgcaacaacagccgcagagggtgtgctctggtgagggg
        S  Y  S  Q  Q  Q  P  Q  R  V  C  S  G  E  G 
  11951 ccaccgaagacggacctcatggccttcagcccctgccttcctcct
        P  P  K  T  D  L  M  A  F  S  P  C  L  P  P 
  11996 gatctgggatcagttgatgtaggcgaagagcaagatttaaatgtt
        D  L  G  S  V  D  V  G  E  E  Q  D  L  N  V 
  12041 gatcatggcctcaaagtggtttccagatgtagctgcctgggggtc
        D  H  G  L  K  V  V  S  R  C  S  C  L  G  V 
  12086 cagtgtctgctgctctcgcttcttctcctcgcagcctgggaggtg
        Q  C  L  L  L  S  L  L  L  L  A  A  W  E  V 
  12131 gggagcggccagctccactactcagtctacgaggaggccagacac
        G  S  G  Q  L  H  Y  S  V  Y  E  E  A  R  H 
  12176 ggcaccttcgtgggccgcatcgcgcaggacctggggctggagctg
        G  T  F  V  G  R  I  A  Q  D  L  G  L  E  L 
  12221 gcggagctggtgcagcgcctgttccgggtggcgtccaaaagacac
        A  E  L  V  Q  R  L  F  R  V  A  S  K  R  H 
  12266 ggggaccttctggaggtaaatctgcagaatggcattttgtttgtg
        G  D  L  L  E  V  N  L  Q  N  G  I  L  F  V 
  12311 aattctcggattgaccgcgaggagctgtgcgggcggagcgtggag
        N  S  R  I  D  R  E  E  L  C  G  R  S  V  E 
  12356 tgcagcatccacctggaggtgatcgtggacaggccgctgcaggtt
        C  S  I  H  L  E  V  I  V  D  R  P  L  Q  V 
  12401 ttccatgtggacgtggaagtgaaggacattaacgacaacccgccc
        F  H  V  D  V  E  V  K  D  I  N  D  N  P  P 
  12446 aggttctccgtaacagaacaaaagctctcaatacctgaatccaga
        R  F  S  V  T  E  Q  K  L  S  I  P  E  S  R 
  12491 ctgcttgactctcgatttccactagaaggcgcatctgatgcggat
        L  L  D  S  R  F  P  L  E  G  A  S  D  A  D 
  12536 gttggagagaacgcattgcttacttacaaactcagtccaaatgag
        V  G  E  N  A  L  L  T  Y  K  L  S  P  N  E 
  12581 tattttgttcttgatattataaacaaaaaagacaaagacaaattc
        Y  F  V  L  D  I  I  N  K  K  D  K  D  K  F 
  12626 ccagtgcttgttctgcggaagctgctggatcgtgaagaaaatcct
        P  V  L  V  L  R  K  L  L  D  R  E  E  N  P 
  12671 cagctaaagttgttgttgacagcaactgatggaggcaaacctgaa
        Q  L  K  L  L  L  T  A  T  D  G  G  K  P  E 
  12716 tttaccggatctgtttctctgctgatcctggtgttagatgccaat
        F  T  G  S  V  S  L  L  I  L  V  L  D  A  N 
  12761 gataacgcccctatctttgacagaccggtttatgaagttaagatg
        D  N  A  P  I  F  D  R  P  V  Y  E  V  K  M 
  12806 tatgaaaatcaagtgaaccaaacattagtaatacggctcaacgct
        Y  E  N  Q  V  N  Q  T  L  V  I  R  L  N  A 
  12851 tctgattcggatgaaggaataaacaaggaaatgatgtattcattt
        S  D  S  D  E  G  I  N  K  E  M  M  Y  S  F 
  12896 agctctttggtcccacccacgataagaaggaaattttggataaac
        S  S  L  V  P  P  T  I  R  R  K  F  W  I  N 
  12941 gaaaggacgggagaaataaaagtaaatgatgctattgactttgag
        E  R  T  G  E  I  K  V  N  D  A  I  D  F  E 
  12986 gacagtaacacttatgaaattcatgtagatgttacagataaggga
        D  S  N  T  Y  E  I  H  V  D  V  T  D  K  G 
  13031 aacccacctatggttggtcactgcacggtcctagtggaactactg
        N  P  P  M  V  G  H  C  T  V  L  V  E  L  L 
  13076 gatgaaaatgataattcacctgaggtgattgtcacttctctgtct
        D  E  N  D  N  S  P  E  V  I  V  T  S  L  S 
  13121 ctcccagtgaaagaagatgctcaagtgggcaccgtcattgcccta
        L  P  V  K  E  D  A  Q  V  G  T  V  I  A  L 
  13166 atcagcgtttctgaccatgattcaggagccaacggacaggtcacc
        I  S  V  S  D  H  D  S  G  A  N  G  Q  V  T 
  13211 tgctctctgacgcctcacgttccgttcaagctggtgtccacctac
        C  S  L  T  P  H  V  P  F  K  L  V  S  T  Y 
  13256 aagaattactactcattggtgctggacagcgctctggaccgcgag
        K  N  Y  Y  S  L  V  L  D  S  A  L  D  R  E 
  13301 agggtgtcggcctatgagctggtggtgaccgcgcgggacgggggc
        R  V  S  A  Y  E  L  V  V  T  A  R  D  G  G 
  13346 tcgcctccgctgtgggccacggccagcgtgtctgtggaggtggcc
        S  P  P  L  W  A  T  A  S  V  S  V  E  V  A 
  13391 gacgtgaacgacaacgcgcctgcgttcgcgcagtccgagtacacg
        D  V  N  D  N  A  P  A  F  A  Q  S  E  Y  T 
  13436 gtgttcgtgaaggagaacaacccgccaggctgccacatcttcacg
        V  F  V  K  E  N  N  P  P  G  C  H  I  F  T 
  13481 gtgtctgcgtgggacgcggacgcgcaggagaacgccctggtgtcc
        V  S  A  W  D  A  D  A  Q  E  N  A  L  V  S 
  13526 tactctctggtggagcggcggttgggcgagcgctcgctgtcgagc
        Y  S  L  V  E  R  R  L  G  E  R  S  L  S  S 
  13571 tacgtgtcggtgcacgcggagagcggcaaggtgtacgcgctgcag
        Y  V  S  V  H  A  E  S  G  K  V  Y  A  L  Q 
  13616 ccgctggaccacgaggagctggagctgctacagttccaggtgagc
        P  L  D  H  E  E  L  E  L  L  Q  F  Q  V  S 
  13661 gcgcgcgatgggggcgtgccgcctctgggcagcaacttgacgctg
        A  R  D  G  G  V  P  P  L  G  S  N  L  T  L 
  13706 caggtgttcgtgctggacgagaacgacaacgctcccgcgctgctg
        Q  V  F  V  L  D  E  N  D  N  A  P  A  L  L 
  13751 gcgtctcccgctggcagcgcgggcggtgcagtcagtgagctggtg
        A  S  P  A  G  S  A  G  G  A  V  S  E  L  V 
  13796 ctgcggtcggtggttgcgggtcacgtggtggctaaggtgcgcgca
        L  R  S  V  V  A  G  H  V  V  A  K  V  R  A 
  13841 gtggacgctgactctggatacaacgcgtggctgtcgtatgaattg
        V  D  A  D  S  G  Y  N  A  W  L  S  Y  E  L 
  13886 cagtcggcggcggttggtgcacgcatcccgtttcgcgtggggctg
        Q  S  A  A  V  G  A  R  I  P  F  R  V  G  L 
  13931 tacacgggcgagatcagtacgacgcgcgctctggatgagactgac
        Y  T  G  E  I  S  T  T  R  A  L  D  E  T  D 
  13976 tcgccacgccagcgcctactggtgctggtgaaggaccatggcgag
        S  P  R  Q  R  L  L  V  L  V  K  D  H  G  E 
  14021 ccgtcgctgacggccacggccactgtgcttgtgtcgcttgtggag
        P  S  L  T  A  T  A  T  V  L  V  S  L  V  E 
  14066 ggcagccaggcacccaaggcctcgtcgcgggcttcagtgggcgtg
        G  S  Q  A  P  K  A  S  S  R  A  S  V  G  V 
  14111 gcgcccgaggtggccctggtggatgtcaacgtgtacctgatcatc
        A  P  E  V  A  L  V  D  V  N  V  Y  L  I  I 
  14156 gccatctgcgcggtgtccagcttgctggtgctcacgctgctgctg
        A  I  C  A  V  S  S  L  L  V  L  T  L  L  L 
  14201 tacactgcactgaggtgctcggcggcgcccaccgagggcgcatgt
        Y  T  A  L  R  C  S  A  A  P  T  E  G  A  C 
  14246 gggccggtgaagcccacgctggtgtgctctagcgcggtggggagc
        G  P  V  K  P  T  L  V  C  S  S  A  V  G  S 
  14291 tggtcttactcgcagcagaggcggcagagggtgtgttctggggag
        W  S  Y  S  Q  Q  R  R  Q  R  V  C  S  G  E 
  14336 ggcctgcccaaggcggacctcatggccttcagccccagccttcca
        G  L  P  K  A  D  L  M  A  F  S  P  S  L  P 
  14381 ccatgcccaatggtagatgtggacggggaagatcagtctattgga
        P  C  P  M  V  D  V  D  G  E  D  Q  S  I  G 
  14426 ggggaccactctaggaaggttagacttttcctttgtggattcttt
        G  D  H  S  R  K  V  R  L  F  L  C  G  F  F 
  14471 tttaaaactaccaagtttgagaatatgaatatatttgtttttcat
        F  K  T  T  K  F  E  N  M  N  I  F  V  F  H 
  14516 attgttctgcaatga 14530  
        I  V  L  Q  * 


EGW01360.1 Protocadherin alpha-4 [Cricetulus griseus]

XP_848914.2 PREDICTED: protocadherin alpha-4 [Canis lupus familiaris]

NP_114062.1 protocadherin alpha-8 isoform 2 precursor [Homo sapiens]
>gb|AAD43747.1|AF152486_1 protocadherin alpha 8 short form protein [Homo sapiens]
>gb|AAC34318.1| KIAA0345-like 6 [Homo sapiens]
>gb|AAI36751.1| Protocadherin alpha 8 [Homo sapiens]

NP_114036.1 protocadherin alpha-6 isoform 2 precursor [Homo sapiens]
>gb|AAD43745.1|AF152484_1 protocadherin alpha 6 short form protein [Homo sapiens]
>gb|AAC34320.1| KIAA0345-like 8 [Homo sapiens]

NP_061734.1 protocadherin alpha-8 isoform 1 precursor [Homo sapiens]
>sp|Q9Y5H6.1|PCDA8_HUMAN RecName: Full=Protocadherin alpha-8; Short=PCDH-alpha-8; Flags: Precursor
>gb|AAD43710.1| protocadherin alpha 8 [Homo sapiens]

NP_113688.1 protocadherin alpha-4 isoform 2 precursor [Homo sapiens]
>gb|AAD43743.1|AF152482_1 protocadherin alpha 4 short form protein [Homo sapiens]
>gb|AAC34322.1| KIAA0345-like 10 [Homo sapiens]
>gb|AAI12103.1| Protocadherin alpha 4, isoform 2 precursor [Homo sapiens]
>gb|AAI13610.1| Protocadherin alpha 4 [Homo sapiens] >gb|ADR83254.1| protocadherin alpha 4 [synthetic construct]

NP_114040.1 protocadherin alpha-7 isoform 2 precursor [Homo sapiens]
>gb|AAD43746.1|AF152485_1 protocadherin alpha 7 short form protein [Homo sapiens]
>gb|AAC34319.1| KIAA0345-like 7 [Homo sapiens]

NP_061724.1 protocadherin alpha-10 isoform 1 precursor [Homo sapiens]
>sp|Q9Y5I2.1|PCDAA_HUMAN RecName: Full=Protocadherin alpha-10; Short=PCDH-alpha-10; Flags: Precursor
>gb|AAD43700.1| protocadherin alpha 10 [Homo sapiens]












    910 atgctttttcctgatgagaaagaattcacaggagcacaaagtggg
        M  L  F  P  D  E  K  E  F  T  G  A  Q  S  G 
    955 ggaccgcagcagaatcctggggtattagatgggcctcagaaaaaa
        G  P  Q  Q  N  P  G  V  L  D  G  P  Q  K  K 
   1000 ccagaagggccaatacaggccatgatggcccaatcccaaagccta
        P  E  G  P  I  Q  A  M  M  A  Q  S  Q  S  L 
   1045 ggtaagggacctgggccccggacagacgtgggagctccatttggc
        G  K  G  P  G  P  R  T  D  V  G  A  P  F  G 
   1090 cctcaaggacatagagatgtacccttttctccagatgaaatggtt
        P  Q  G  H  R  D  V  P  F  S  P  D  E  M  V 
   1135 ccaccttctatgaactcccagtctgggaccataggacccgaccac
        P  P  S  M  N  S  Q  S  G  T  I  G  P  D  H 
   1180 cttgaccatatgactcccgagcagatagcgtggctgaaactgcag
        L  D  H  M  T  P  E  Q  I  A  W  L  K  L  Q 
   1225 caggagttttatgaagagaagaggaggaagcaggaacaagtggtt
        Q  E  F  Y  E  E  K  R  R  K  Q  E  Q  V  V 
   1270 gtccagcagtgttccctccaggacatgatggtccatcagcacggg
        V  Q  Q  C  S  L  Q  D  M  M  V  H  Q  H  G 
   1315 cctcggggagtggtccgaggacccccccctccataccagatgacc
        P  R  G  V  V  R  G  P  P  P  P  Y  Q  M  T 
   1360 cctagtgaaggctgggcacctgggggtacagagccattttctgat
        P  S  E  G  W  A  P  G  G  T  E  P  F  S  D 
   1405 ggtatcaacatgccacattctctgcccccgaggggcatggctccc
        G  I  N  M  P  H  S  L  P  P  R  G  M  A  P 
   1450 caccccaacatgccagggagccagatgcgcctccctggatttgca
        H  P  N  M  P  G  S  Q  M  R  L  P  G  F  A 
   1495 ggcatgataaactctgaaatggaagggccgaatgtccccaaccct
        G  M  I  N  S  E  M  E  G  P  N  V  P  N  P 
   1540 gcatctagaccaggtctttctggagtcagttggccagatgatgtg
        A  S  R  P  G  L  S  G  V  S  W  P  D  D  V 
   1585 ccaaaaatcccagatggtcgaaattttcctcctggccagggcatt
        P  K  I  P  D  G  R  N  F  P  P  G  Q  G  I 
   1630 ttcagcggtcctggccgaggggaacgcttcccaaacccccaagga
        F  S  G  P  G  R  G  E  R  F  P  N  P  Q  G 
   1675 ttgtctgaagagatgtttcagcagcagctggcagagaaacagctg
        L  S  E  E  M  F  Q  Q  Q  L  A  E  K  Q  L 
   1720 ggtctccccccagggatggccatggaaggcatcaggcccagcatg
        G  L  P  P  G  M  A  M  E  G  I  R  P  S  M 
   1765 gagatgaacaggatgattccaggctcccagcgccacatggagcct
        E  M  N  R  M  I  P  G  S  Q  R  H  M  E  P 
   1810 gggaataaccccattttccctcgaataccagttgagggccctctg
        G  N  N  P  I  F  P  R  I  P  V  E  G  P  L 
   1855 agtccttctaggggtgactttccaaaaggaattcccccacagatg
        S  P  S  R  G  D  F  P  K  G  I  P  P  Q  M 
   1900 ggccctggtcgggaacttgagtttgggatggttcctagtgggatg
        G  P  G  R  E  L  E  F  G  M  V  P  S  G  M 
   1945 aagggagatgtcaatctaaatgtcaacatgggatccaactctcag
        K  G  D  V  N  L  N  V  N  M  G  S  N  S  Q 
   1990 atgatacctcagaagatgagagaggctggggcgggccctgaggag
        M  I  P  Q  K  M  R  E  A  G  A  G  P  E  E 
   2035 atgctgaaattacgcccaggtggctcagacatgctgcctgctcag
        M  L  K  L  R  P  G  G  S  D  M  L  P  A  Q 
   2080 cagaagatggtgccactgccatttggtgagcacccccagcaggag
        Q  K  M  V  P  L  P  F  G  E  H  P  Q  Q  E 
   2125 tatggcatgggccccagaccattccttcccatgtctcagggtcca
        Y  G  M  G  P  R  P  F  L  P  M  S  Q  G  P 
   2170 ggcagcaacagtggcttgcggaatctcagagaaccaattgggccc
        G  S  N  S  G  L  R  N  L  R  E  P  I  G  P 
   2215 gaccagaggactaacagccggctcagtcatatgccaccactacct
        D  Q  R  T  N  S  R  L  S  H  M  P  P  L  P 
   2260 ctcaacccttccagtaaccccaccagcctcaacacagctcctcca
        L  N  P  S  S  N  P  T  S  L  N  T  A  P  P 
   2305 gttcagcgcggcctggggcggaagcccttggatatatctgtggca
        V  Q  R  G  L  G  R  K  P  L  D  I  S  V  A 
   2350 ggcagccaggtgcattccccaggcattaaccctctgaagtctccc
        G  S  Q  V  H  S  P  G  I  N  P  L  K  S  P 
   2395 acgatgcaccaagtccagtcaccaatgctgggctcgccctcgggg
        T  M  H  Q  V  Q  S  P  M  L  G  S  P  S  G 
   2440 aacctcaagtccccccagactccatcgcagctggcaggcatgctg
        N  L  K  S  P  Q  T  P  S  Q  L  A  G  M  L 
   2485 gcgggcccagctgctgctgcttccattaagtccccccctgttttg
        A  G  P  A  A  A  A  S  I  K  S  P  P  V  L 
   2530 gggtctgctgctgcttcacctgtccacctcaagtctccatcactt
        G  S  A  A  A  S  P  V  H  L  K  S  P  S  L 
   2575 cctgccccgtcacctggatggacctcttctccaaaacctcccctt
        P  A  P  S  P  G  W  T  S  S  P  K  P  P  L 
   2620 cagagtcctgggatccctccaaaccataaagcacccctcaccatg
        Q  S  P  G  I  P  P  N  H  K  A  P  L  T  M 
   2665 gcctccccagccatgctgggaaatgtagagtcgagagaaagagca
        A  S  P  A  M  L  G  N  V  E  S  R  E  R  A 
   2710 catatttctccgtgggacactccttgtattgacggcgtcgggctg
        H  I  S  P  W  D  T  P  C  I  D  G  V  G  L 
   2755 gagagccgcagtcccggctgcagcacctgggagaaggcagaccgt
        E  S  R  S  P  G  C  S  T  W  E  K  A  D  R 
   2800 gtgagggggcctgtggccccagcgtgctgtggcctccgggagtgg
        V  R  G  P  V  A  P  A  C  C  G  L  R  E  W 
   2845 gaagtggaggcaggagccttccttacacttcgccatgagtttcct
        E  V  E  A  G  A  F  L  T  L  R  H  E  F  P 
   2890 gatcgactccagcatcatgattacctaggcttgggtttggatact
        D  R  L  Q  H  H  D  Y  L  G  L  G  L  D  T 
   2935 tctccagtaatgaaggcccctcccaagctggagggtgatgctact
        S  P  V  M  K  A  P  P  K  L  E  G  D  A  T 
   2980 gatggctcctttgccaataagcatggctgccatgtcattggccac
        D  G  S  F  A  N  K  H  G  C  H  V  I  G  H 
   3025 attgatgactacagtgccctaagacagcagattgcggagggcaag
        I  D  D  Y  S  A  L  R  Q  Q  I  A  E  G  K 
   3070 ctgctggtcaaaaagatagtgtctcttgtgagatcagcgtgcagc
        L  L  V  K  K  I  V  S  L  V  R  S  A  C  S 
   3115 ttccctggccttgaagcccaaggcacagaggtaagaggagacgag
        F  P  G  L  E  A  Q  G  T  E  V  R  G  D  E 
   3160 tgtcagtacagcctcctctcccctagccagacgctcagggtcacc
        C  Q  Y  S  L  L  S  P  S  Q  T  L  R  V  T 
   3205 accttcccctctgctcgcccgtcgccattcttccaaccactcgct
        T  F  P  S  A  R  P  S  P  F  F  Q  P  L  A 
   3250 gccaaagattccaccaacagtcacccacaggacaacccaggcctc
        A  K  D  S  T  N  S  H  P  Q  D  N  P  G  L 
   3295 ctttcagcagaggctcccgccccgcagccaccgcgccctctcacc
        L  S  A  E  A  P  A  P  Q  P  P  R  P  L  T 
   3340 cccgcagttctgcccgccgcctctgcccagtag 3372   
        P  A  V  L  P  A  A  S  A  Q  * 


BAG11026.1 B-cell lymphoma 9 protein [synthetic construct]

NP_004317.2 B-cell CLL/lymphoma 9 protein [Homo sapiens]
>sp|O00512.4|BCL9_HUMAN RecName: Full=B-cell CLL/lymphoma 9 protein; Short=B-cell lymphoma 9 protein; Short=Bcl-9; AltName: Full=Protein legless homolog
>emb|CAI15198.1| B-cell CLL/lymphoma 9 [Homo sapiens]
>gb|EAW50932.1| B-cell CLL/lymphoma 9, isoform CRA_a [Homo sapiens]
>gb|EAW50933.1| B-cell CLL/lymphoma 9, isoform CRA_a [Homo sapiens]

XP_513752.2 PREDICTED: b-cell CLL/lymphoma 9 protein [Pan troglodytes]

AAI16452.1 B-cell CLL/lymphoma 9 [Homo sapiens]

CAA73942.1 B-cell CLL/lymphoma 9 [Homo sapiens]