Team:Bielefeld-CeBiTec/Results/toolbox/fusing

Fusing

Short summary

Protein fusion is usually limited by the C‑ or N‑terminus of a protein. The incorporation of non-canonical amino acids that could be fused to each other or to surfaces enables several additional applications. This tool facilitates immobilization of proteins and improved stability of protein polymer networks. Furthermore, non-canonical amino acids could lead to enhanced efficiency of pathways by combining enzymes of one pathway or for any other system where colocalization is beneficial. With the synthesis of Nε-L-cysteinyl-L-lysine and the novel amino acid Nγ-cyanobenzothiazolyl-L-asparagine, we provide the first steps for a new way to fuse proteins with each other and to enhance the stability of protein polymer networks. First attempts show that both amino acids are able to bind specificly to each other under physiological conditions. Additionally, the methods of the synthesis of Nγ-cyanobenzothiazolyl-L-asparagine can be applied to each system containing one free amino group and one free carboxy group.

Synthesis of Nε-L-cysteinyl-L-lysine

We used a modified version of the method of Nguyen et al. (2011) to produce Nε‑L‑cysteinyl‑L‑lysine. To ensure a specific reaction between the amino group of the side chain of lysine (see Figure 1) and the carboxyl group of cysteine (see Figure 1) in a selective manner, so called protecting groups are introduced. Commonly used protecting groups are tert‑butyloxycarbonyl (Boc), methyl ester and trityl (Trt). They bind reversible to the corresponding functionality and can be easily removed by acids and bases after the coupling reaction happened. The first step of the synthesis is a coupling reaction of N‑Boc‑L‑lysine‑O‑methyl ester and N‑Boc‑L‑cysteine-S-Trt. Due to the protected functional groups, only the unprotected amino group of N‑Boc‑L‑lysine‑O‑methyl ester and the unprotected carboxyl group of the N‑Boc‑L‑cysteine‑S‑Trt can react with each other. The result is N‑Boc‑L‑lysine[Nε‑(N‑Boc‑L‑cysteine‑S‑Trt)]‑6‑methyl ester. After removal of ester protection with lithium hydroxide as well as Boc and Trt with trifluoroacetic acid, Nε-L-cysteinyl-L-lysine trifluoroacetic acid salt is obtained. Figure 1 shows the schematic reaction.

Figure 1: Schematic reaction of the synthesis of Nε‑L‑cysteinyl‑L‑lysine fluoroacetatic acid salt (Nguyen et al., 2011).
The unprotected carboxyl group of the cysteine (red) and the unprotected amino group of the lysine (green) are highlighted.

We synthesized Nε‑L‑cysteinyl‑L‑lysine in two batches to ensure that the method of Nguyen et al. (2001) is successful. For the first batch, we used dry dimethylformamide (DMF) as solvent for the coupling reaction as described by Nguyen et al. (2011). Due to low yield compared to Nguyen et al. using DMF, we used tetrahydrofuran (THF) for the second batch.

Coupling reaction of N-Boc-L-lysine-O-methyl ester and N-Boc-L-cysteine-S-Trt

Table 1 shows the used quantity of reactants and solvents for both batches.

Table 1: List of used reactants and solvents for the coupling.

No. Reagent MW / g mol-1 eq. weight / g n / mmol
1 N-Boc-L-lysine-O-methyl ester acetate salt 320.40 1.00 1.00 3.12
2 N-Boc-L-cysteine-S-Trt 463.60 1.50 2.14 4.62
3 DCC 206.33 1.50 0.95 4.62
4 DMAP 122.17 0.16 0.06 0.49
Solvent Volume & mL
5 DMF/THF 10

Figure 2: Result of the TLC analysis after the coupling reaction.
A: N‑Boc‑L‑lysine‑O‑methyl ester; B: N‑Boc‑L‑cysteine‑S‑Trt; C: N‑Boc‑L‑lysine‑O‑methyl ester, N‑Boc‑L‑cysteine‑S‑Trt and the reaction mixture after the coupling reaction; D: the reaction mixture after the coupling reaction.

The thin layer chromatography (TLC) analysis of the reaction mixture shows that after the coupling reaction no N-Boc-L-lysine-O-methyl ester was left (see Figure 2). This indicates that the N-Boc-L-lysine-O-methyl ester completely reacted. The two spots on the top of C and D are the product N-Boc-L-lysine[Nε-(N-Boc-L-cysteine-S-Trt)]-6-methyl ester (lower spot) and a byproduct of the reaction (upper spot).

Figure 3: Proton nuclear magnetic resonance (1H NMR) spectrum for the purified reaction mixture after the coupling reaction.
The hydrogen bonds of the protecting groups are characteristic for the estimated product N‑Boc‑L‑lysine[Nε‑(N‑Boc‑L‑cysteine‑S‑Trt)]‑6-methyl ester and therefore, their signals are highlighted.

The 1H NMR spectrum of the purified reaction mixture of the coupling reaction shows that the hydrogen atoms of all protecting groups are present (see Figure 3). The Tritylphenylmethane (Trt) at 7.2 ppm is part of the N-Boc-L-cysteine-S-Trt and the methyl ester at 3.6 ppm is originating from the N‑Boc‑L‑lysine‑O‑methyl ester. The tert‑Butyloxycarbonyl protecting group is part of both educts. In this reaction, no protecting groups should be split off, so this diagram shows the NMR spectrum for N‑Boc‑L‑lysine[Nε‑(N‑Boc‑L‑cysteine‑S‑Trt)]‑6‑methyl ester.

Removing the methyl ester of the N-Boc-L-lysine[Nε-(N-Boc-L-cysteine-S-Trt)]-6-methyl ester

Table 2 shows the used quantity of reactants and solvents for both batches.

Table 2: List of used reactants and solvents for the reaction to remove methyl ester of the first and the second batch.

First batch Second batch
No. Reagent MW / g mol-1 eq. weight / g n / mmol MW / g mol-1 eq. weight / g n / mmol
1 N-Boc-L-lysine[Nε-(N-Boc-L-cysteine-S-Trt)]-6-methyl ester 704.17 1.00 0.60 0.85 704.17 1.00 0.97 1.42
2 LiOH · H2O 42.06 1.50 0.05 1.28 42.06 1.50 0.09 2.13
Solvent Volume / mL Volume / mL
3 THF:H2O (3:1) 80 100

Figure 4: Result of the TLC analysis after removing the methyl ester.
KC2: the reaction mixture after the coupling reaction; KC3: the reaction mixture after removing the methyl ester.

After removing the methyl ester, the product is more polar than before. The result is that the N‑Boc‑L‑lysine[Nε‑(N‑Boc‑L‑cysteine‑S‑Trt)] is not soluble in the EtOAc:PE solution. The dark spot at the TLC plate for the sample KC3 is the N‑Boc‑L‑lysine[Nε‑(N‑Boc‑L‑cysteine‑S‑Trt)] and the lighter spot is the removed methyl ester (see Figure 4).

Removing the tert-Butyloxycarbonyl protecting group (Boc) and Triphenylmethane (Trt) of the N-Boc-L-lysine[Nε-(N-Boc-L-cysteine-S-Trt)]

Table 3 shows the used quantity of reactants and solvents for both batches.

Table 3: List of used reactants and solvents for the reaction to remove Boc and Trt of the first and the second batch.

First batch Second batch
No. Reagent MW / g mol-1 eq. weight / g n / mmol MW / g mol-1 eq. weight / g n / mmol
1 N-Boc-L-lysine[Nε-(N-Boc-L-cysteine-S-Trt)] 691.67 1.00 0.68 0.98 7691.67 1.00 0.96 1.39
2 Triethylsilane 116.67 4.50 0.52 4.42 116.67 4.50 0.73 6.26
Solvent Volume / mL Volume / mL
3 Trifluoroacetic acid 30 30

Figure 5: 1H NMR spectrum for the purified Nε‑L‑cysteinyl‑L‑lysine trifluoroacetatic acid salt.
All peaks of compounds with hydrogen atoms of the Nε‑L‑cysteinyl‑L‑lysine are highlighted because they are characteristic for this molecule.

The 1H NMR spectrum shows that all estimated hydrogen atoms are present and that the synthesis was successful (see Figure 5).

In the first batch, we got 400 mg of Nε‑L‑cysteinyl‑L‑lysine trifluoroacetic acid salt and in the second batch we obtained 500 mg. This corresponds to 0.84 mmol in the first batch and 1.05 mmol in the second batch. This equals to the half of the yield of Nguyen et al. (2011) with 900 mg and 1.89 mmol.

Synthesis of Nγ-cyanobenzothiazolyl-L-asparagine

The synthesis of the novel amino acid Nγ‑cyanobenzothiazolyl‑L‑asparagine (CBT‑asparagine) was a new challenge for us. Our idea was to couple the amino group of 6‑amino‑2‑cyanobenzothiazole (ACBT) with the carboxy group of the side chain of aspartic acid. Therefore, it is recommendable to use two different protective groups each bound to the N‑ and C‑terminus of the aspartic acid. Figure 6 and 7 show the schematic synthesis of CBT‑asparagine.

Figure 6: Schematic coupling reaction of ACBT and Fmoc‑asparagine‑OAllyl ester giving Fmoc‑CBT‑asparagine‑OAllyl ester as product.
The amino group of ACBT and the carboxy group of the side chain of aspartic acid get linked under water loss.

Figure 7: Schematic deprotection reaction of Fmoc‑CBT‑asparagine‑OAllyl ester.
After the coupling reaction, the deprotection reaction removes the protection groups resulting in the free amino acid or a salt of the amino acid depending of the method.

Due to the high price of 6‑amino‑2‑cyanobenzothiazole (ACBT), we synthesized it on ourselves with 2‑chloro‑6‑nitrobenzothiazole (Cl‑NBT) as starting material following the procedure of (Hauser et al., 2016). Figure 8 shows the schematic synthesis of ACBT.

Figure 8: Schematic synthesis of ACBT. The synthesis contains two steps.
The first step (i) is the nucleophilic substitution of the chlorine atom with a cyano group and the second step (ii) is the reduction of the nitro group to an amino group.

First, we changed the chlorine atom of the Cl‑NBT for a cyano group by nucleophilic substitution resulting in 6‑nitrobenzothiazole‑2‑carbonitrile (NBT‑CN). The second step of the synthesis of ACBT is the reduction of the nitro group of the NCBT to an amino group using iron and acetic acid. The last step is troublesome. Due to the reactivity of the iron and the acetic acid, the reaction mixtures contain a large amount of impurities. These impurities can disturb the coupling reaction between ACBT and aspartic acid. After several attempts to get an ACBT with sufficient purity for the coupling reaction of the amino group of ACBT and the carboxy group of the side chain of aspartic acid we got following results.

Nucleophilic Substitution of the Chlorine Atom with a Cyano Group

The applied reactants and solvents for the nucleophilic substitution of the chlorine atom to the cyano group are listed in table 4.

Table 4: List of used reactants and solvents for the nucleophilic substitution of the chlorine atom to the cyano group.

No. Reagent MW / g mol-1 eq. weight / g n / mmol
1 2-Chloro-6-nitrobenzothiazole 214.63 1.00 5.00 23.29
2 1,4-diazabicyclo[2.2.2]octane 112.17 0.15 0.37 3.33
3 NaCN 49.01 1.05 1.20 24.48
Solvent Volume & mL
4 MeCN 500

Figure 9: The 13C NMR spectrum of the product of the nucleophilic substitution.
All peaks over 80 ppm correspond to the estimated spectrum of NBT‑CN. The peak at 77 ppm shows the carbon atom of the used solvent - deuterated chlorofom.

The 13C NMR spectrum of the pure product shows that the nucleophilic substitution was successful (see Figure 9). The peak at 77 ppm corresponds to the resonance frequency of deuterated chloroform, the chosen solvent for the 13C NMR spectrum. The resonance frequency of the carbon of the cyano group is 112 ppm. The carbon atom at 141 ppm is part of the thiazole residue and the six carbon atoms at 118 ppm, 123 ppm, 126 ppm, 135 ppm, 147 ppm and 155 ppm are originated from the benzene ring. The signal at 112 pm is crucial here. It shows that the chlorine atom was successfully substituted by a cyano group. Additionally, we analyzed the NBT‑CN by proton NMR (see Figure 10).

Figure 10: The 1H NMR spectrum of the product of the nucleophilic substitution.
All peaks over 8 ppm correspond to the estimated spectrum of NBT‑CN.

The three signals over 8 ppm show the three hydrogen atoms of the aromatic ring of NBT-CN. The other signals originate from excessive water, ethyl acetate and acetonitrile.

Reduction of the nitro group to an amino group

The product of the nucleophilic substitution was used for the reduction of the nitro group to an amino group. Therefore, we used iron and acetic acid to create an excess of protons. After oxidation of the iron by the oxygen of the nitro group followed by the reduction of the nitro group to an amino group, we got 1.5 g of ACBT. The proton NMR shows that the nucleophilic substitution was successful (see Figure 10).

Figure 10: The 1H NMR spectrum of the product of the reduction of the nitro group of NBT-CN.
Despite of the impurities, all peaks of the hydrogen atoms of ACBT are present.

Compared to the proton NMR of the NBT-CN (see Figure 9), there is now an additional peak at 5.7 ppm which originates from the amino group. Additionally, you can see the hydrogen atoms of several impurities with resonance frequencies under 5.7 ppm like ethyl acetate, water and byproducts of the reduction with iron and acetic acid which caused problems for further experiments.

Coupling Reaction of ACBT and N-Fmoc-aspartic-acid-OAllyl ester

For this reaction, we used the method of Yuan et al. (2016). They used this method to synthesize (D‑Cys‑Lys‑CBT)2 which in turn was used to build cyclic D-Luciferin nanoparticles. This method enables the coupling of ACBT with a carboxyl group. We needed several attempts to successfully couple ACBT to the carboxy group of the side chain of aspartic acid. Due to the high reactivity of the previous reaction with iron and acetic acid, the crude product of the reduction of the nitro group of NBT-CN to the amino group of ACBT contains different impurities. In the first attempt, we tried to use N‑Boc‑aspartic acid‑OBzl and the crude product as reagent. The following figure shows the proton NMR of the product (Figure 11).

Figure 11: The 1H NMR spectrum of the product of the first attempt of the coupling reaction of ACBT and aspartic acid.
There are no peaks corresponding to the resonance frequencies of the hydrogen bonds of the ACBT. There are only signals of the hydrogen atoms of the protected aspartic acid.

The peaks at 1.2 ppm, 2.0 ppm and 4.0 ppm are originating from excessive ethyl acetate. All other peaks correspond to the resonance frequencies of the N‑Boc‑aspartic acid‑OBzl. Thus, the first attempt of the coupling reaction failed. In the second attempt we purified the crude product of the reduction reaction by column chromatography and used N‑Fmoc‑aspartic acid‑OAllyl ester. After analyzing the product of the second coupling reaction by liquid chromatography‑mass spectroscopy (LC‑MS), we got following LC-MS signal (Figure 12).

Figure 12: Mass spectrum of the product of the second attempt of the coupling reaction of ACBT and aspartic acid.
The peak at the position 553.15 shows that the coupling reaction was successful.

With the mass-to-charge ratio of 553.15 Da e-1, we got a positive signal for the anion of N‑Fmoc‑Nγ‑cyanobenzothiazolyl‑aspartic acid‑OAllyl ester which has a molecular weight of 552.61 g mol‑1.

Removing the Fluorenylmethyloxycarbonyl Group (Fmoc) and the Allyl Ester (OAll)

According to Goodman et al., (2004), we chose morpholine as base to remove the allyl ester protection group. Tetrakis(triphenylphosphine)palladium(0) (Pd(PPh3)4) serves as catalyst. For the removal of the protecting groups Fmoc and allyl ester, the applied reactants and solvents are listed in table 6.

Table 6: List of used reactants and solvents for the deprotection of N‑Fmoc‑Nγ‑cyanobenzothiazolyl‑asparagine‑OAll.

No. Reagent MW / g mol-1 eq. weight / g n / mmol
1 Fmoc-CBT-asparagine-OAll 552.61 1.00 1.40 2.53
2 Tetrakis(triphenylphosphine)palladium(0) 1,155.59 0.10 0.29 0.25
3 Morpholine 87.10 3.00 0.65 7.59
Solvent Volume & mL
4 THF 50
Figure 13 to 16 show the results of the LC-MS analysis of the deprotected CBT‑asparagine. Thereby, Figure 13 and 14 represent the results of the aqueous layer and Figure 15 and 16 the results of the organic layer after the extraction of the reaction mixture of the deprotection reaction.

Figure 13: Mass spectrum of the fraction with an elution time of 2.33 min of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction.
The peak for 379.11 Da e-1 corresponds to the morpholine salt of CBT‑asparagine which has a molecular weight of 377.42 g mol-1.

Figure 14: Mass spectrum of the fraction with an elution time of 1.33 min of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction.
The peak for 88.08 Da e-1 corresponds to excessive morpholine which has a molecular weight of 87.12 g mol-1.

Figure 13 shows that the deprotection reaction was successful resulting in the morpholine salt of CBT-asparagin. Due to the excess of morpholine, there is still some morpholine left (see Figure 14).

Figure 15: Mass spectrum of the fraction with an elution time of 2.40 min of the organic layer after the extraction of the reaction mixture of the deprotection reaction.
The peak for 291.05 Da e-1 corresponds to the free form of CBT‑asparagine which has a molecular weight of 290.30 g mol-1.

Figure 16: Mass spectrum of the fraction with an elution time of 1.37 min of the organic layer after the extraction of the reaction mixture of the deprotection reaction.
Compared to Figure 14, there is no peak corresponding to the morpholine.

Figure 15 shows that not only the morpholine salt was obtained, but also the free amino acid. Compared to Figure 14, Figure 16 shows that the excessive morpholine is only present in the aqueous layer. Usually the acidity of morpholine with a pKa of 8.36 is not high enough to remove the protecting group Fmoc. But with the 3x equivalent of morpholine to the Fmoc- and OAll-protected amino acid and using Pd(PPh3)4, it is possible to remove even Fmoc. This results in the free variant or the morpholine salt of the amino acid. In this case we got both the free amino acid and the morpholine salt of CBT‑asparagine.
Additionally, we analyzed both layers by 1H NMR. The spectra are shown in Figure 17 and 18.

Figure 17: 1H NMR spectrum of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction.
Despite of the impurities, the estimated spectrum of the unprotected CBT-asparagine is shown.

Figure 18: 1H NMR spectrum of the organic layer after the extraction of the reaction mixture of the deprotection reaction.
Despite of the impurities, the estimated spectrum of the unprotected CBT-asparagine is shown.

The NMR spectra are similar to each other. Both contain the estimated characteristics. Between 2 and 3 ppm are signals of the α carbon atom of the amino acid. The signals between 7 and 9 ppm originate from the hydrogen atoms of the primary amin, the benzene ring of the CBT side chain and some Fmoc-derivatives which are left as impurities. The peak at about 11 ppm corresponds to the amin of the CBT side chain. Both NMR spectra show that the deprotection was successful.
To test the solubility of both amino acids for further experiments, we dissolved 2.9 mg of the free form and 3.7 mg of the morpholine salt of the CBT‑asparagine each in 1 mL ammonium bicarbonate with a pH of 7.4 to get a 10 mM solution of each. For a better solubility, 15 µL of 10 M NaOH were added. The following figure shows both samples after vortexing (Figure 19).

Figure 19: Comparison of the solubility of the free form and the morpholine salt of CBT‑asparagine.
Both materials were dissolved in 1 mL ammonium bicarbonate with 15 µL of 10 M NaOH. The free form of CBT-asparagin is less soluble in ammonium bicarbonate than the morpholine salt.

Due to the neutral charge and the hydrophobic side chain of the free form of the CBT‑asparagine and compared to the morpholine salt of the CBT‑asparagine, the free form possesses a lower solubility compared to the morpholine salt. Therefore, we used the morpholine salt for further experiments.

Specific Binding of CL and CBT-asparagine

To test the condensation reaction between the 1,2-aminothiol group of CL and the cyano group of the CBT‑asparagine, we dissolved 4.70 mg of CL trifluoroacetic acid salt and 3.77 mg of CBT‑asparagine morpholine salt in two separate tubes each with 5 mL of a 20 mM ammonium bicarbonate solution with a pH of 8 to get a 2 mM solution of each amino acid. A few drops of 10 M NaOH were added to the solution with the CBT‑asparagine. Both solutions were mixed to get a reaction mixture with a concentration of 1 mM of each amino acid. For 40 minutes, every 5 minutes a sample was analyzed by LC-MS. Already after 5 minutes, we got a signal at 349 which could originate from the ligated product. After 40 minutes, following mass spectrum was obtained for the ligation reaction (see Figure 20).

Figure 20: Mass spectrum of the fraction with an elution time of 2.30 min of the ligation reaction.
The peak at 349 should correspond to the ligated product of CL and CBT‑asparagine morpholine salt which has a molecular weight of 611.73 g mol-1.

Considering the exact mass of the ligated form of CL and CBT-asparagin of 522.14 Da and a 2x positive charge, we get a mass-to-charge ratio of 261.07 Da e-1. Adding the exact mass of the morpholine anion of 88.08 Da and in consideration of the 1x positive charge, the peak for 349 Da e-1 should correspond to the estimated product of the ligation reaction. Taking into account that there is a signal for a substance with a mass-to-charge ratio of 349 Da e-1 in the aqueous layer after the extraction of the reaction mixture of the deprotection reaction (see Figure 21) prior to the ligation reaction, it is not certain that the peak for 349 Da e-1 seen in Figure 20 originates from the ligated amino acids.

Figure 21: Mass spectrum of the fraction with an elution time of 2.18 min of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction.
Even before the ligation reaction, there is a peak for 349 Da e-1.

The chromatograms of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction and the reaction mixture of the ligation reaction (see Figure 22) show that there is a difference between those signals.

Figure 22: Comparison of the chromatogram of the aqueous layer after the extraction of the reaction mixture of the deprotection reaction (red) and the reaction mixture of the ligation reaction (blue).
The peak of the substance with a mass-to-charge ratio of 349 of the red chromatogram is at 2.18 min and the one of the blue chromatogram is at 2.30 min.

Due to the dilution by mixing the 2 mM solutions of both amino acids, the peaks over 8 min of the blue chromatogram are lower than the ones of the red chromatogram. On the other hand, the peak corresponding to the substance with a mass-to-charge ratio of 349 Da e-1 of the blue chromatogram is higher in relation to the peaks over 8 min compared with the ratio between the peak corresponding to the substance with a mass-to-charge ratio of 349 Da e-1 and the peaks over 8 min of the red chromatogram. Additionally, the peaks of the substance with a mass-to-charge ratio of 349 Da e-1 of the red chromatogram (elution time of 2.18 min) and of the blue chromatogram are shifted to each other (elution time of 2.30 min). These aspects show that the substances with a mass-to-charge ratio of both samples should be different, thus showing that the ligation between CL and CBT-asparagin was successful.

References

Goodman, M. Toniolo, C., and Moroder, L. (2004). Methods of Organic Chemistry: Synthesis of Peptide and Peptidomimetics 4th ed. (Thieme).
Hauser, J.R., Beard, H.A., Bayana, M.E., Jolley, K.E., Warriner, S.L., and Bon, R.S. (2016). Economical and scalable synthesis of 6-amino-2-cyanobenzothiazole. Beilstein J. Org. Chem. 12: 2019–2025.
Nguyen, D.P., Elliott, T., Holt, M., Muir, T.W., and Chin, J.W. (2011). Genetically Encoded 1,2‑Aminothiols Facilitate Rapid and Site‑Specific Protein Labeling via a Bio‑orthogonal Cyanobenzothiazole Condensation. J. Am. Chem. Soc. 133: 11418–11421.
Yuan, Y., Wang, F., Tang, W., Ding, Z., Wang, L., Liang, L., Zheng, Z., Zhang, H., and Liang, G. (2016). Intracellular Self-Assembly of Cyclic d -Luciferin Nanoparticles for Persistent Bioluminescence Imaging of Fatty Acid Amide Hydrolase. ACS Nano 10: 7147–7153.