Amino acid dipepetide frequency for Pseudomonas phage PRR1 (Bacteriophage PRR1)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.378AlaAla: 4.378 ± 1.978
0.0AlaCys: 0.0 ± 0.0
3.503AlaAsp: 3.503 ± 2.371
2.627AlaGlu: 2.627 ± 1.159
0.876AlaPhe: 0.876 ± 0.754
3.503AlaGly: 3.503 ± 1.516
0.876AlaHis: 0.876 ± 0.61
4.378AlaIle: 4.378 ± 1.5
2.627AlaLys: 2.627 ± 0.867
6.13AlaLeu: 6.13 ± 2.121
0.0AlaMet: 0.0 ± 0.0
2.627AlaAsn: 2.627 ± 1.831
0.0AlaPro: 0.0 ± 0.0
0.876AlaGln: 0.876 ± 1.418
3.503AlaArg: 3.503 ± 0.892
7.881AlaSer: 7.881 ± 2.668
7.005AlaThr: 7.005 ± 1.549
3.503AlaVal: 3.503 ± 1.88
0.0AlaTrp: 0.0 ± 0.0
0.876AlaTyr: 0.876 ± 0.61
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
2.627CysAsp: 2.627 ± 0.867
0.0CysGlu: 0.0 ± 0.0
0.876CysPhe: 0.876 ± 0.61
1.751CysGly: 1.751 ± 1.22
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
2.627CysLys: 2.627 ± 1.867
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.876CysAsn: 0.876 ± 0.61
0.876CysPro: 0.876 ± 0.61
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.0CysThr: 0.0 ± 0.0
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.503AspAla: 3.503 ± 2.371
0.0AspCys: 0.0 ± 0.0
2.627AspAsp: 2.627 ± 1.831
0.0AspGlu: 0.0 ± 0.0
1.751AspPhe: 1.751 ± 1.356
2.627AspGly: 2.627 ± 1.831
0.876AspHis: 0.876 ± 1.418
4.378AspIle: 4.378 ± 0.676
2.627AspLys: 2.627 ± 1.508
7.881AspLeu: 7.881 ± 2.601
1.751AspMet: 1.751 ± 0.537
0.876AspAsn: 0.876 ± 1.418
3.503AspPro: 3.503 ± 1.4
1.751AspGln: 1.751 ± 1.22
1.751AspArg: 1.751 ± 1.33
3.503AspSer: 3.503 ± 1.516
1.751AspThr: 1.751 ± 1.33
6.13AspVal: 6.13 ± 1.8
3.503AspTrp: 3.503 ± 1.88
3.503AspTyr: 3.503 ± 0.892
0.0AspXaa: 0.0 ± 0.0
Glu
2.627GluAla: 2.627 ± 1.675
0.0GluCys: 0.0 ± 0.0
1.751GluAsp: 1.751 ± 0.537
0.0GluGlu: 0.0 ± 0.0
0.0GluPhe: 0.0 ± 0.0
3.503GluGly: 3.503 ± 1.074
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
2.627GluLys: 2.627 ± 0.867
6.13GluLeu: 6.13 ± 2.248
1.751GluMet: 1.751 ± 1.509
2.627GluAsn: 2.627 ± 2.263
2.627GluPro: 2.627 ± 1.831
1.751GluGln: 1.751 ± 0.537
3.503GluArg: 3.503 ± 1.877
7.881GluSer: 7.881 ± 2.557
0.876GluThr: 0.876 ± 0.61
0.876GluVal: 0.876 ± 1.418
2.627GluTrp: 2.627 ± 1.831
2.627GluTyr: 2.627 ± 1.159
0.0GluXaa: 0.0 ± 0.0
Phe
2.627PheAla: 2.627 ± 0.867
0.0PheCys: 0.0 ± 0.0
2.627PheAsp: 2.627 ± 1.508
2.627PheGlu: 2.627 ± 1.159
0.0PhePhe: 0.0 ± 0.0
1.751PheGly: 1.751 ± 1.22
0.0PheHis: 0.0 ± 0.0
4.378PheIle: 4.378 ± 1.307
2.627PheLys: 2.627 ± 1.159
4.378PheLeu: 4.378 ± 1.978
0.876PheMet: 0.876 ± 0.754
0.0PheAsn: 0.0 ± 0.0
0.876PhePro: 0.876 ± 0.754
0.0PheGln: 0.0 ± 0.0
3.503PheArg: 3.503 ± 1.4
6.13PheSer: 6.13 ± 2.149
4.378PheThr: 4.378 ± 0.676
3.503PheVal: 3.503 ± 3.818
1.751PheTrp: 1.751 ± 1.22
0.876PheTyr: 0.876 ± 0.61
0.0PheXaa: 0.0 ± 0.0
Gly
1.751GlyAla: 1.751 ± 1.22
0.876GlyCys: 0.876 ± 0.61
6.13GlyAsp: 6.13 ± 2.248
2.627GlyGlu: 2.627 ± 2.681
5.254GlyPhe: 5.254 ± 1.61
3.503GlyGly: 3.503 ± 1.4
0.876GlyHis: 0.876 ± 0.754
5.254GlyIle: 5.254 ± 1.653
4.378GlyLys: 4.378 ± 1.307
7.005GlyLeu: 7.005 ± 2.989
0.876GlyMet: 0.876 ± 1.418
4.378GlyAsn: 4.378 ± 1.307
1.751GlyPro: 1.751 ± 1.22
1.751GlyGln: 1.751 ± 1.985
5.254GlyArg: 5.254 ± 0.632
3.503GlySer: 3.503 ± 1.4
3.503GlyThr: 3.503 ± 3.018
6.13GlyVal: 6.13 ± 1.959
0.876GlyTrp: 0.876 ± 0.754
4.378GlyTyr: 4.378 ± 1.307
0.0GlyXaa: 0.0 ± 0.0
His
0.876HisAla: 0.876 ± 0.61
0.876HisCys: 0.876 ± 0.61
0.876HisAsp: 0.876 ± 1.418
0.876HisGlu: 0.876 ± 0.61
0.0HisPhe: 0.0 ± 0.0
0.876HisGly: 0.876 ± 0.754
0.0HisHis: 0.0 ± 0.0
0.876HisIle: 0.876 ± 0.61
0.876HisLys: 0.876 ± 0.61
1.751HisLeu: 1.751 ± 1.985
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.876HisPro: 0.876 ± 0.754
0.876HisGln: 0.876 ± 0.754
0.876HisArg: 0.876 ± 0.61
0.0HisSer: 0.0 ± 0.0
0.876HisThr: 0.876 ± 1.418
0.0HisVal: 0.0 ± 0.0
1.751HisTrp: 1.751 ± 1.22
1.751HisTyr: 1.751 ± 1.509
0.0HisXaa: 0.0 ± 0.0
Ile
2.627IleAla: 2.627 ± 2.592
0.876IleCys: 0.876 ± 0.61
7.881IleAsp: 7.881 ± 1.099
0.876IleGlu: 0.876 ± 0.61
0.876IlePhe: 0.876 ± 0.61
2.627IleGly: 2.627 ± 1.508
0.876IleHis: 0.876 ± 0.61
1.751IleIle: 1.751 ± 0.537
0.876IleLys: 0.876 ± 0.754
2.627IleLeu: 2.627 ± 3.866
0.0IleMet: 0.0 ± 0.0
4.378IleAsn: 4.378 ± 1.307
1.751IlePro: 1.751 ± 0.537
0.0IleGln: 0.0 ± 0.0
7.005IleArg: 7.005 ± 1.523
8.757IleSer: 8.757 ± 1.155
1.751IleThr: 1.751 ± 1.509
3.503IleVal: 3.503 ± 2.661
0.876IleTrp: 0.876 ± 0.61
2.627IleTyr: 2.627 ± 2.28
0.0IleXaa: 0.0 ± 0.0
Lys
1.751LysAla: 1.751 ± 1.33
0.0LysCys: 0.0 ± 0.0
0.876LysAsp: 0.876 ± 1.418
4.378LysGlu: 4.378 ± 2.349
2.627LysPhe: 2.627 ± 0.867
5.254LysGly: 5.254 ± 2.317
0.0LysHis: 0.0 ± 0.0
0.876LysIle: 0.876 ± 0.754
0.876LysLys: 0.876 ± 0.61
6.13LysLeu: 6.13 ± 2.121
2.627LysMet: 2.627 ± 1.508
0.876LysAsn: 0.876 ± 0.754
4.378LysPro: 4.378 ± 1.978
0.0LysGln: 0.0 ± 0.0
5.254LysArg: 5.254 ± 2.317
4.378LysSer: 4.378 ± 1.18
1.751LysThr: 1.751 ± 1.356
6.13LysVal: 6.13 ± 3.41
0.0LysTrp: 0.0 ± 0.0
0.876LysTyr: 0.876 ± 0.754
0.0LysXaa: 0.0 ± 0.0
Leu
7.005LeuAla: 7.005 ± 5.1
0.876LeuCys: 0.876 ± 0.61
3.503LeuAsp: 3.503 ± 1.4
4.378LeuGlu: 4.378 ± 1.307
2.627LeuPhe: 2.627 ± 2.28
4.378LeuGly: 4.378 ± 3.051
1.751LeuHis: 1.751 ± 0.537
8.757LeuIle: 8.757 ± 2.613
4.378LeuLys: 4.378 ± 3.99
11.384LeuLeu: 11.384 ± 2.807
1.751LeuMet: 1.751 ± 0.633
6.13LeuAsn: 6.13 ± 2.149
4.378LeuPro: 4.378 ± 1.641
3.503LeuGln: 3.503 ± 2.661
13.135LeuArg: 13.135 ± 3.438
5.254LeuSer: 5.254 ± 1.376
6.13LeuThr: 6.13 ± 3.118
6.13LeuVal: 6.13 ± 3.335
0.876LeuTrp: 0.876 ± 1.954
3.503LeuTyr: 3.503 ± 1.074
0.0LeuXaa: 0.0 ± 0.0
Met
2.627MetAla: 2.627 ± 0.972
0.876MetCys: 0.876 ± 1.954
0.0MetAsp: 0.0 ± 0.0
0.876MetGlu: 0.876 ± 0.754
1.751MetPhe: 1.751 ± 1.22
1.751MetGly: 1.751 ± 1.22
0.0MetHis: 0.0 ± 0.0
1.751MetIle: 1.751 ± 1.33
1.751MetLys: 1.751 ± 0.537
1.751MetLeu: 1.751 ± 1.356
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
0.876MetPro: 0.876 ± 0.754
1.751MetGln: 1.751 ± 0.537
0.876MetArg: 0.876 ± 0.754
0.876MetSer: 0.876 ± 0.61
2.627MetThr: 2.627 ± 2.263
0.876MetVal: 0.876 ± 0.754
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.876AsnAla: 0.876 ± 0.754
0.0AsnCys: 0.0 ± 0.0
2.627AsnAsp: 2.627 ± 0.972
0.876AsnGlu: 0.876 ± 0.754
4.378AsnPhe: 4.378 ± 0.676
5.254AsnGly: 5.254 ± 2.201
1.751AsnHis: 1.751 ± 1.811
1.751AsnIle: 1.751 ± 1.985
0.876AsnLys: 0.876 ± 0.754
7.881AsnLeu: 7.881 ± 1.749
1.751AsnMet: 1.751 ± 0.537
1.751AsnAsn: 1.751 ± 1.356
5.254AsnPro: 5.254 ± 1.734
1.751AsnGln: 1.751 ± 1.22
1.751AsnArg: 1.751 ± 0.537
0.876AsnSer: 0.876 ± 0.754
0.876AsnThr: 0.876 ± 0.61
4.378AsnVal: 4.378 ± 1.18
0.876AsnTrp: 0.876 ± 0.754
0.876AsnTyr: 0.876 ± 0.754
0.0AsnXaa: 0.0 ± 0.0
Pro
1.751ProAla: 1.751 ± 1.22
0.0ProCys: 0.0 ± 0.0
4.378ProAsp: 4.378 ± 1.978
1.751ProGlu: 1.751 ± 0.537
5.254ProPhe: 5.254 ± 2.317
2.627ProGly: 2.627 ± 0.867
0.0ProHis: 0.0 ± 0.0
1.751ProIle: 1.751 ± 1.33
1.751ProLys: 1.751 ± 1.22
6.13ProLeu: 6.13 ± 2.149
1.751ProMet: 1.751 ± 1.515
2.627ProAsn: 2.627 ± 0.972
0.876ProPro: 0.876 ± 0.61
0.0ProGln: 0.0 ± 0.0
3.503ProArg: 3.503 ± 0.892
6.13ProSer: 6.13 ± 3.171
2.627ProThr: 2.627 ± 1.831
3.503ProVal: 3.503 ± 2.712
0.876ProTrp: 0.876 ± 0.754
2.627ProTyr: 2.627 ± 1.159
0.0ProXaa: 0.0 ± 0.0
Gln
0.876GlnAla: 0.876 ± 0.754
0.876GlnCys: 0.876 ± 0.61
0.0GlnAsp: 0.0 ± 0.0
0.876GlnGlu: 0.876 ± 0.61
0.0GlnPhe: 0.0 ± 0.0
0.876GlnGly: 0.876 ± 1.418
0.876GlnHis: 0.876 ± 0.754
0.0GlnIle: 0.0 ± 0.0
1.751GlnLys: 1.751 ± 0.537
3.503GlnLeu: 3.503 ± 2.512
0.0GlnMet: 0.0 ± 0.0
0.876GlnAsn: 0.876 ± 1.418
0.876GlnPro: 0.876 ± 0.754
0.0GlnGln: 0.0 ± 0.0
1.751GlnArg: 1.751 ± 0.537
3.503GlnSer: 3.503 ± 1.877
2.627GlnThr: 2.627 ± 1.675
0.876GlnVal: 0.876 ± 0.754
0.0GlnTrp: 0.0 ± 0.0
0.876GlnTyr: 0.876 ± 0.754
0.0GlnXaa: 0.0 ± 0.0
Arg
4.378ArgAla: 4.378 ± 1.307
0.0ArgCys: 0.0 ± 0.0
6.13ArgAsp: 6.13 ± 1.764
4.378ArgGlu: 4.378 ± 1.18
2.627ArgPhe: 2.627 ± 1.508
7.005ArgGly: 7.005 ± 2.789
0.876ArgHis: 0.876 ± 0.61
0.0ArgIle: 0.0 ± 0.0
3.503ArgLys: 3.503 ± 1.122
6.13ArgLeu: 6.13 ± 1.8
2.627ArgMet: 2.627 ± 1.713
7.005ArgAsn: 7.005 ± 1.549
1.751ArgPro: 1.751 ± 0.537
1.751ArgGln: 1.751 ± 1.509
5.254ArgArg: 5.254 ± 2.571
8.757ArgSer: 8.757 ± 2.046
5.254ArgThr: 5.254 ± 1.12
4.378ArgVal: 4.378 ± 1.307
2.627ArgTrp: 2.627 ± 2.263
4.378ArgTyr: 4.378 ± 2.349
0.0ArgXaa: 0.0 ± 0.0
Ser
6.13SerAla: 6.13 ± 2.248
1.751SerCys: 1.751 ± 1.22
3.503SerAsp: 3.503 ± 1.4
2.627SerGlu: 2.627 ± 1.159
5.254SerPhe: 5.254 ± 1.734
6.13SerGly: 6.13 ± 3.031
0.0SerHis: 0.0 ± 0.0
3.503SerIle: 3.503 ± 1.516
4.378SerLys: 4.378 ± 1.611
7.881SerLeu: 7.881 ± 3.35
0.876SerMet: 0.876 ± 0.754
4.378SerAsn: 4.378 ± 0.676
4.378SerPro: 4.378 ± 1.611
1.751SerGln: 1.751 ± 1.356
9.632SerArg: 9.632 ± 1.484
7.881SerSer: 7.881 ± 4.498
8.757SerThr: 8.757 ± 4.149
7.881SerVal: 7.881 ± 1.595
0.876SerTrp: 0.876 ± 0.61
3.503SerTyr: 3.503 ± 2.333
0.0SerXaa: 0.0 ± 0.0
Thr
2.627ThrAla: 2.627 ± 0.867
0.0ThrCys: 0.0 ± 0.0
0.876ThrAsp: 0.876 ± 0.61
5.254ThrGlu: 5.254 ± 2.928
2.627ThrPhe: 2.627 ± 0.972
6.13ThrGly: 6.13 ± 1.786
1.751ThrHis: 1.751 ± 1.509
6.13ThrIle: 6.13 ± 4.612
6.13ThrLys: 6.13 ± 1.985
2.627ThrLeu: 2.627 ± 1.508
1.751ThrMet: 1.751 ± 1.22
2.627ThrAsn: 2.627 ± 1.657
4.378ThrPro: 4.378 ± 2.205
0.876ThrGln: 0.876 ± 0.754
4.378ThrArg: 4.378 ± 2.62
6.13ThrSer: 6.13 ± 1.959
2.627ThrThr: 2.627 ± 1.675
4.378ThrVal: 4.378 ± 0.676
1.751ThrTrp: 1.751 ± 0.537
1.751ThrTyr: 1.751 ± 1.509
0.0ThrXaa: 0.0 ± 0.0
Val
3.503ValAla: 3.503 ± 3.018
0.0ValCys: 0.0 ± 0.0
1.751ValAsp: 1.751 ± 1.985
3.503ValGlu: 3.503 ± 1.122
3.503ValPhe: 3.503 ± 1.4
8.757ValGly: 8.757 ± 4.377
1.751ValHis: 1.751 ± 1.356
5.254ValIle: 5.254 ± 1.828
1.751ValLys: 1.751 ± 0.537
3.503ValLeu: 3.503 ± 1.122
0.876ValMet: 0.876 ± 0.61
3.503ValAsn: 3.503 ± 2.371
7.881ValPro: 7.881 ± 2.916
2.627ValGln: 2.627 ± 0.972
5.254ValArg: 5.254 ± 1.12
4.378ValSer: 4.378 ± 1.5
7.881ValThr: 7.881 ± 3.043
7.005ValVal: 7.005 ± 6.416
0.876ValTrp: 0.876 ± 0.754
1.751ValTyr: 1.751 ± 1.33
0.0ValXaa: 0.0 ± 0.0
Trp
3.503TrpAla: 3.503 ± 1.4
0.876TrpCys: 0.876 ± 0.754
0.0TrpAsp: 0.0 ± 0.0
2.627TrpGlu: 2.627 ± 1.159
0.876TrpPhe: 0.876 ± 0.61
0.0TrpGly: 0.0 ± 0.0
1.751TrpHis: 1.751 ± 1.22
0.0TrpIle: 0.0 ± 0.0
1.751TrpLys: 1.751 ± 1.22
3.503TrpLeu: 3.503 ± 1.831
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.876TrpPro: 0.876 ± 0.754
0.0TrpGln: 0.0 ± 0.0
0.876TrpArg: 0.876 ± 0.754
0.876TrpSer: 0.876 ± 0.61
1.751TrpThr: 1.751 ± 0.537
1.751TrpVal: 1.751 ± 1.509
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.751TyrAla: 1.751 ± 1.509
1.751TyrCys: 1.751 ± 1.22
2.627TyrAsp: 2.627 ± 0.972
2.627TyrGlu: 2.627 ± 1.159
1.751TyrPhe: 1.751 ± 1.22
1.751TyrGly: 1.751 ± 1.22
0.876TyrHis: 0.876 ± 0.61
1.751TyrIle: 1.751 ± 0.537
0.876TyrLys: 0.876 ± 1.418
3.503TyrLeu: 3.503 ± 2.106
0.876TyrMet: 0.876 ± 1.2
0.876TyrAsn: 0.876 ± 0.754
1.751TyrPro: 1.751 ± 0.537
0.0TyrGln: 0.0 ± 0.0
2.627TyrArg: 2.627 ± 2.263
4.378TyrSer: 4.378 ± 2.257
1.751TyrThr: 1.751 ± 0.537
4.378TyrVal: 4.378 ± 2.855
0.876TyrTrp: 0.876 ± 0.61
1.751TyrTyr: 1.751 ± 1.22
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1143 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski