Amber TUTORIAL B5: Simulating the Green Fluorescent Protein

Section 1: Preparing the PDB file

1EMA是本次教程所用的pdb,能夠在PDB數據庫下載。數據庫

pdb4amber -i 1EMA.pdb -o gfp.pdb --dry --reduce

pdb4amber命令用於amber輸入pdb格式文件的準備。 --dry會刪除晶體結構中的水分子(WATER),--reduce會對pdb加氫(H)。命令執行完成後須要對產生的gfp.pdb進行手工微調,1) 須要把文件中全部MSE更換爲MET;2) 把全部SE 原子(Se)換爲硫(SD)原子; 3)在pdb文件最後一列中,把SE元素,換乘S元素。優化

pdb4amber analyses PDB files and cleans them for further usage, especially with the LeaP programs of Amber. It does NOT use any information in the original PDB file other than that contained in ATOM and HETATM records. The final output files are stripped of everything not directly related to ATOM or HETATM records.ui

Section 2: Computing partial charges and atom types for CRO

gfp.pdb文件中的CRO殘基爲非標準氨基酸殘基,不在標準amber殘基庫中,因此須要對此非標準氨基酸進行加電(deriving charges)和決定原子類型(determining the atom types of the CRO residue)操做;this

對於非標準殘基CRO,咱們使用PDB數據庫中的CIF文件對其進行定義,CRO.cif;atom

咱們使用antechamber程序讀取CRO.cif (ccif)並對其進行加電(-c bcc)及分配原子類型 (-at amber)操做,(For more general organic molecules, it is usually better to use gaff atom types.)spa

運行下面程序前,咱們須要先將CRO.cif文件中第五行中的LINKING字符刪掉,否則會報錯。rest

antechamber -fi ccif -i CRO.cif -bk CRO -fo ac -o cro.ac -c bcc -at amber

antechamber: This is the most important program in the package. It can perform many file conversions, and can also assign atomic charges and atom types. As required by the input, antechamber executes the following programs: sqm (or,alternatively, mopac or divcon), atomtype, am1bcc, bondtype, espgen, respgen and prepgen. It typically produces many intermediate files; these may be recognized by their names, in which all letters are upper-case.code

Running Errors:orm

1. "Residue CRO has a type of LINKING.  Quitting" 執行上述命令時,報錯LINKING:blog

解決方法有兩種:1) 刪除文件CRO.cif中的LINKING字符。

        2) 使用「//」註釋掉$AMBERHOME/AmberTools/src/antechamber 下的mmcif.c文件的第68行,"// exit(1);",而後make install,從新安裝antechamber,便可解決問題。

2. 使用diff命令對比程序運行結果會和對照結果文件不一樣,涉及到第一個原子NT和N,手動把NT改成N。

Section 3: Preparing the residue library and force field parameters for use with LEaP

PDB數據庫中下載的CRO.cif在通過antechamber處理後,變成一個擁有完整的H完整的電荷的分子,可是咱們的gpf.pdb文件中的的CRO是以共價鍵方式與周圍的氨基酸鏈接的,因此爲了使通過加氫加電的CRO分子鏈接(CRO分子的NC端鏈接到pdb中)到gpf.pdb文件中,咱們須要對CRO分子進行處理,把CRO分子的NC端的H去掉,並和pdb分子中相應的位置鏈接,咱們使用prepgen程序處理:

prepgen -i cro.ac -o cro.prepin -m cro.mc -rn CRO

 Prepgen: Prepgen generates the prep input file from an ac file. By default, the program generates a mainchain itself. However, you may also specify the main-chain atoms in the main chain file. From this file, you can also specify which atoms will be deleted, and whether to do charge correction or not. In order to generate the amino-acid-like residue (this kind of residue has one head atom and one tail atom to be connected to other residues), you need a main chain file. Sample main chain files are in $AMBERHOME/dat/antechamber.

使用prepgen程序及包含NC端原子,主鏈側鏈信息的主鏈(mc)文件,對cro.ac進行處理,獲得處理後的cro.prepin文件,至此,咱們便獲得了非標準殘基CRO的殘基庫及電荷信息;

接下來,咱們使用parmchk2程序檢查CRO分子(cro.prepin)的共價鍵(bonds, angles, and dihedrals)參數化狀況,

parmchk2 -i cro.prepin -f prepi -o frcmod.cro -a Y \
         -p $AMBERHOME/dat/leap/parm/parm10.dat

parmchk2: The parmchk2 program figures out what parameters will be needed and checks to see if they are in the standard files. If not, it tries to make educated guesses, and puts these new parameters into a file we are calling "frcmod.cro" here. -p flag specifies the parm10.dat file because it is the main parameter database for the force field we plan to use, ff14SB.

frcmod.cro文件中,有的行會標記「ATTN, need revision「,這種標記意味着,parmchk2程序不能在parm10數據庫中找到類似的參數,咱們把這些標記的行在frcmod.cro文件中刪除,而後再在gaff.dat數據庫中去尋找合適的參數。

grep -v "ATTN" frcmod.cro > frcmod1.cro # Strip out ATTN lines
parmchk2 -i cro.prepin -f prepi -o frcmod2.cro

至此,咱們獲得兩個frcmod文件,即frcmod1.cro(parm10.dat,刪除了ATTN)和frcmod2.cro(gaff.dat),用於下面的步驟。

Section 4: Creating the topology and coordinate files for simulation

咱們使用以前步驟中的文件,準備拓撲文件及座標文件。咱們使用ff14SB力場,使用隱式溶劑模型(igb=8,並設置PBRadii默認值爲mbondi3);首先咱們加載cro.prepin文件,而後對於參數文件,咱們先load frcmod2.cro再load frcmod1.cro,以確保全部gaff參數被parm10參數替換,而後再導入gfp.pdb結構文件,輸出gfp.parm7和gfp.rst7文件。

$$$ tleap.in

source leaprc.protein.ff14SB
set default PBRadii mbondi3
loadAmberPrep cro.prepin
loadAmberParams frcmod2.cro
loadAmberParams frcmod1.cro
x = loadPDB gfp.pdb
saveAmberParm x gfp.parm7 gfp.rst7
quit
tleap -f tleap.in

Section 5: Simulations; minimization, heating, equilibration, and production

Minimization

以初始座標爲起點,對整個體系優化,排除一些不利的構像

$$$ min.in

 simple generalized Born minimization script
 &cntrl
   imin=1, ntb=0, maxcyc=100, ntpr=10, cut=1000., igb=8, 
 /
sander -O -i min.in -p gfp.parm7 -c gfp.rst7 -o min1.out -r min1.rst7

Heating

以minimization的rst文件爲起點,對整個體系加熱,200ps加熱時間,從10K到300K

$$$ heat.in


Implicit solvent initial heating mdin
 &cntrl
   imin=0, irest=0, ntx=1,
   ntpr=1000, ntwx=1000, nstlim=100000,
   dt=0.002, ntt=3, tempi=10,
   temp0=300, gamma_ln=1.0, ig=-1,
   ntp=0, ntc=2, ntf=2, cut=1000,
   ntb=0, igb=8, ioutfm=1, nmropt=1,
 /
 &wt
   TYPE='TEMP0', ISTEP1=1, ISTEP2=100000,
   VALUE1=10.0, VALUE2=300.0,
 /
 &wt TYPE='END' /
sander -O -i heat.in -p gfp.parm7 -c min1.rst7 -o heat.mdout \
       -x heat.nc -r heat.rst7

Production

以上一步座標爲起點,進行MD run 100ns

$$$ md.in

Implicit solvent molecular dynamics
&cntrl imin=0, irest=1, ntx=5, ntpr=1000, ntwx=1000, nstlim=500000, dt=0.002, ntt=3, tempi=300, temp0=300, gamma_ln=1.0, ig=-1, ntp=0, ntc=2, ntf=2, cut=1000, ntb=0, igb=8, ioutfm=1, /
sander -O -i md.in -p gfp.parm7 -c heat.rst7 -o md1.mdout \
       -x md1.nc -r md1.rst7
相關文章
相關標籤/搜索