This is manual for command line programs
See Manual for GUI-based program
CpGPNP Manual Download
- Install perl and R package on your computer
Perl: https://www.perl.org/get.html
R package: https://www.r-project.org/
A. CpG Island predition
1. CpGPredictor.pl
[Usage]
> perl CpGPredictor.pl -f INPUTFILE[FASTA] (-l INT -w INT -m STRING -g INT -o INT) -s OUTPUTFILE
[Parameter]
-f[--file] : INPUT file (FASTA format)
-l[--searchlength] : Length of CpG island, [default 500]
-w[--slide] : moving window, [default 1]
-m[--motif] : motif which user searches
-g[--GC] : Threshold of GC contents percent, [default 55]
-o[--OBEX] : Threshold of Observed /Expected CpG Ratio, [default 0.65]
-fa : make FASTA file, [y or yes]
-s[--save] : OUTPUT file (.txt)
[Example]
For Gardiner-Garden and Frommer algorithm (1987),
> perl CpGPredictor.pl -f test.fa -l 200 -g 50 -o 0.6 -s test_result
* If you want to get a fasta file, type 'y' or 'yes' with the [-fa] option.
[Result]
.rawdata : all detected CpG islands results (1st result)
.groups : the result grouped from redundant CpG islands of .rawdata (2nd result)
.contig : CpG island results merged between the overlapped groups from .groups (Final result)
.summary : result and parameter summary file
.graph : Files to make graphs using R
.contig result example:
No. |
START |
LAST |
Length |
GC_Contents |
obs/exp_CpG_ratio |
SEQEUNCE |
1 | 10327 | 11348 | 1022 | 70.15 | 0.95 | taacccctaaccctaaccctaaccctaccctaaccctaaccctaaccctaacc |
2 | 16469 | 16668 | 200 | 51.5 | 0.60 | ACCAAGTAGAACAAGATATTTGAAATGGAAACTATTCAAAAAATTGAGAA |
3 | 28625 | 29807 | 1183 | 70.07 | 0.84 | ATTCTTTTTAAGTGACAAAACTTGTACATGTGTATCGCTC |
4 | 51434 | 51995 | 562 | 54.80 | 0.86 | GAATCACACGTATTGGAAAACCAGCGGAAGAGTAAGTCT |
5 | 121169 | 121614 | 446 | 51.34 | 0.64 | GTGCAAGTAAAGAGCCTTACTGCTGATGAGGTTTGAGG |
2. graphMake.pl
[Usage]
>perl graphMake.pl test.graph
[Result]
.pdf : graph visualization pdf file
B. Primer design
1. standardPCR.pl
[Usage]
>perl standardPCR.pl
[Parameter]
- FASTA file : fasta sequence file
- START position : start of target region
- END position : end of target region
- PRIMER LENGTH (minimum, maximum) : set minimum/maximum primer length
- Tm threshold
[Result]
.stdPrimer
.stdPrimer result example:
Category | Forward/Reverse | FirstPosition | LastPosition | 5'-primer-3' | Tm | GC(%) | PrimerLength | 5'Position | 3'Position | hair-pin_structure | product_size |
primer 1 | Forward | 150 | 500 | CCTCTCAGCCTTTGAAAGAAA | 50.24 | 42.85 | 21 | 129 | 150 | ...<.............>... | 434 |
Reverse | 150 | 500 | AACAGGAGGAGCAGAGAGCGAAGCG | 62.21 | 60 | 25 | 609 | 584 | ..<........<.>........>.. |
primer 2 | Forward | 150 | 500 | AGCCTTTGAAAGAAAGAAAGGG | 52.01 | 40.90 | 22 | 135 | 157 | ..<................>.. | 427 |
Reverse | 150 | 500 | AACAGGAGGAGCAGAGAGCGAAGCG | 62.21 | 60 | 25 | 609 | 584 | ..<........<.>........>.. |
primer 3 | Forward | 150 | 500 | AGCCTTTGAAAGAAAGAAAGGG | 52.01 | 40.90 | 22 | 135 | 157 | ..<................>.. | 427 |
Reverse | 150 | 500 | AACAGGAGGAGCAGAGAGCGAAGCG | 62.21 | 60 | 25 | 609 | 584 | ..<........<.>........>.. |
2. standardPCRmulti.pl
h4>
[Usage]
>perl standardPCRmulti.pl -p [POSITIONFILE] (-minlen [int] -maxlen [int] -mintm[int] -maxtm [int] -opt [int]) -r [FASTAFILE] -o [OUTPUTFILE]
[Parameter]
-p[--position] : the POSITION file(.groups, .contigs) resulted from CpGPredictor-2.0.pl
-minlen[--minlength] : minimum primer length, [default 20]
-maxlen[--maxlength] : maximum primer length, [default 30]
-mintm[--mintm] : the minimum Tm, [default 50]
-maxtm[--maxtm] : the maximum Tm, [default 65]
-opt[--optTm] : the optimum Tm, [default 55]
-r[--reference] : the REFERECE FASTA file
-o[--output] : the OUTPUT file
* The POSITION file can use the result file(.groups or .contig) obtained from CpGPrediction.pl, or user can manually make the POSITION file as follows
start end
54 523
39 540
46 1707
3. bisulfitePCR
[Usage]
>perl bisulfitePCR.pl
[Parameter]
- FASTA file : fasta sequence file
- START position : start of target region
- END position : end of target region
- PRODUCT SIZE : target size. It determines the direction of the primer design
- PRIMER LENGTH (minimum, maximum) : set minimum/maximum primer length
- Tm threshold
[Result]
.bisPrimer
.bisPrimer result example:
Category | Forward/Reverse | Original_Sequence | 5'-Primer-3' | Tm | GC(%) | PrimerLength | 5'Position | 3'Position | Enthalpy | Energy | Salt_concent(M) | hair-pin_structure | product_size |
primer 1 | Forward | CGAGAAAGAAGGTGAGAAAGACAGAGC | YGAGAAAGAAGGTGAGAAAGATAGAGT | 55.92 | 40.74 | 26 | 68 | 95 | 205.6 | 37.15 | 0.05 | .....<......<.>......>..... | 244 |
Reverse | CCCCACGGGAACCGCCCGTGC | ACACRAACRATTCCCRTAAAA | 54.49 | 42.85 | 20 | 360 | 339 | 178.5 | 32.7 | 0.05 | .........<.>......... |
primer 2 | Forward | CGAGAAAGAAGGTGAGAAAGACAGAGC | YGAGAAAGAAGGTGAGAAAGATAGAGT | 55.92 | 40.74 | 26 | 68 | 95 | 205.6 | 37.15 | 0.05 | .....<......<.>......>..... | 244 |
Reverse | CCCCACGGGAACCGCCCGTGC | ACACRAACRATTCCCRTAAAA | 54.49 | 42.85 | 20 | 360 | 339 | 178.5 | 32.7 | 0.05 | .........<.>......... |
primer 3 | Forward | CGAGAAAGAAGGTGAGAAAGACAGAGC | YGAGAAAGAAGGTGAGAAAGATAGAGT | 55.92 | 40.74 | 26 | 68 | 95 | 205.6 | 37.15 | 0.05 | .....<......<.>......>..... | 130 |
Reverse | CGCCGTAAGGTGCCGCAGTCCCGAATG | CATTCRAAACTACRACACCTTACRACR | 59.24 | 48.14 | 26 | 252 | 225 | 215.2 | 40.25 | 0.05 | ..<...<<..<.....>..>>...>.. |
4. bisulfitePCRmulti
[Usage]
>perl bisulfitePCRmulti -p [POSITIONFILE] (-minlen [int] -maxlen [int] -mintm[int] -maxtm [int] -t [int]) -r [FASTAFILE] -o [OUTPUTFILE]
[Parameter]
-p[--position] : the POSITION file(.groups, .contigs) resulted from CpGPredictor-2.0.pl
-minlen[--minlength] : minimum primer length, [default 20]
-maxlen[--maxlength] : maximum primer length, [default 30]
-mintm[--mintm] : the minimum Tm, [default 50]
-maxtm[--maxtm] : the maximum Tm, [default 65]
-t[--threshold] : the threshold of product size, [default 300]
-r[--reference] : the REFERECE FASTA file
-o[--output] : the OUTPUT file
* The POSITION file is the same as that used in standardPCR-2.0.pl
5. msp.pl
[Usage]
>perl msp.pl -f [FASTAFILE] -s [int] -e [int] -a [int] -o OUTPUTFILE
[Parameter]
-f[--fasta] : Input your fasta file
-s[--start] : Start Position where you want to design MSP in your FASTA file
-e[--end] : End Position where you want to design MSP in your FASTA file
-a[--amplicon] : amplicon size (default 100)
-o[--output] : the output file
* If you do not specify start and end position, map to the entire fasta file.
[Result]
The MSP program produces two results as follows.
1) .CpGMAP
2) .msp.result
1) .CpGMAP
- CpGMAP is a file that maps CG dimer positions
Map Of CpG site:
GTAGGCGAGTCGGACGTCGTTCGTAGTATCGGAGAGGGCGTATTGTAAAGGCGGGTAGTAGATCGTGGAGAGTTCGGGAG
CGGAGTTGGATATCGTTTCGGAGGGAAGAAATGAGGTAGCGGCGGTTTTCGGATTCGGTTATGTTCGTTTTTTGTTTTCG
GAGTTTAGCGTCGTTTCGGTTAGGTTAGTTCGGATATTGAGCGGGTCGAGCGCGAGTTTTCGGCGTTCGGCGGAGCGAAG
ATGTAGTGAGTTTTCGCGGGATTGTTGCGCGGGGTTCGTCGCGGTTAGTCGGATTTAGTATTCGATCGTATTTTGGGCGA
GTTGTTGATTTGAGATTAGTTTAAACGGGGGGTTTTTTATTTTTAGTATTTTTCGGAGGTGGGGAGTATCGGTTTTTAGG
TATATTCGTTGTAGAGTTTTCGCGGGTTTTGGTTTTAGTTTTGAGGGTTGTGTGTGTTAGGGGATTTATTTTACGTTCGT
|
Map of CpG :
.....CG...CG..CG.CG..CG......CG.......CG...........CG..........CG.........CG....
CG...........CG...CG...................CG.CG.....CG....CG........CG...........CG
........CG.CG...CG............CG.........CG...CG..CGCG......CG.CG..CG.CG...CG...
..............CGCG.........CGCG.....CG.CGCG......CG...........CG..CG.........CG.
.........................CG..........................CG..............CG.........
......CG............CGCG.................................................CG..CG.
|
Position of CpG :
.....6....11...15..18...22.......30........39............52...........64........
..75.....81............94....99....................120..123......130.....136....
.....146............159.........169..172....177.............191..........202....
207...211.213.......221..224...228..231....236..................255.257.........
.268.270......277..280.282.......290............303...307..........318..........
.................346...........................374...............390............
....407.............421.423.......................................
|
2) .msp.result
- The msp.result file shows the M and U primer set.
CpG_position | sense/antisense | M/U Primer | 5'-primer_sequence-3' | Tm | 5'-end position | 3'-end position |
30 | forward | M_Primer | TCGGACGTCGTTCGTAGTATC | 59.21 | 8 | 29 |
forward | U_Primer | GGTGAGTTGGATGTTGTTTGTAGTATT | 54.18 | 2 | 29 |
reverse | Universal(M&U) | RCCTAARCCAATACAARCAA | 53.02 | 150 | 130 |
39 | forward | M_Primer | GTTCGTAGTATCGGAGAGGGC | 57.34 | 17 | 38 |
forward | U_Primer | GTTGTTTGTAGTATTGGAGAGGGT | 53.25 | 14 | 38 |
reverse | Universal(M&U) | AATACAARCAAAAAACAAAARCC | 53.09 | 162 | 139 |
1. positionFinder.pl
[Usage]
>perl positionFinder.pl
[Parameter]
- FASTA file : fasta sequence file
- starting motif : Position of Starting motif
- end motif : Position of End motif