Amino acid dipepetide frequency for Chimpanzee stool associated circular ssDNA virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.895AlaAla: 0.895 ± 0.647
0.895AlaCys: 0.895 ± 1.149
3.581AlaAsp: 3.581 ± 1.471
0.895AlaGlu: 0.895 ± 0.682
0.895AlaPhe: 0.895 ± 0.647
6.267AlaGly: 6.267 ± 2.673
0.0AlaHis: 0.0 ± 0.0
5.372AlaIle: 5.372 ± 1.289
0.895AlaLys: 0.895 ± 0.647
8.953AlaLeu: 8.953 ± 2.523
4.476AlaMet: 4.476 ± 3.259
3.581AlaAsn: 3.581 ± 1.933
3.581AlaPro: 3.581 ± 2.266
1.791AlaGln: 1.791 ± 0.967
5.372AlaArg: 5.372 ± 1.579
4.476AlaSer: 4.476 ± 2.164
2.686AlaThr: 2.686 ± 1.106
8.057AlaVal: 8.057 ± 2.99
0.0AlaTrp: 0.0 ± 0.0
2.686AlaTyr: 2.686 ± 1.538
0.0AlaXaa: 0.0 ± 0.0
Cys
0.895CysAla: 0.895 ± 1.149
0.895CysCys: 0.895 ± 1.082
0.895CysAsp: 0.895 ± 1.149
0.0CysGlu: 0.0 ± 0.0
0.895CysPhe: 0.895 ± 1.082
0.895CysGly: 0.895 ± 0.682
0.0CysHis: 0.0 ± 0.0
2.686CysIle: 2.686 ± 2.337
0.0CysLys: 0.0 ± 0.0
2.686CysLeu: 2.686 ± 2.346
0.0CysMet: 0.0 ± 0.0
0.895CysAsn: 0.895 ± 0.682
2.686CysPro: 2.686 ± 1.499
0.895CysGln: 0.895 ± 1.0
1.791CysArg: 1.791 ± 2.164
3.581CysSer: 3.581 ± 1.226
1.791CysThr: 1.791 ± 1.07
0.895CysVal: 0.895 ± 1.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.791AspAla: 1.791 ± 1.294
0.0AspCys: 0.0 ± 0.0
0.0AspAsp: 0.0 ± 0.0
2.686AspGlu: 2.686 ± 0.907
0.895AspPhe: 0.895 ± 0.647
1.791AspGly: 1.791 ± 0.967
0.0AspHis: 0.0 ± 0.0
4.476AspIle: 4.476 ± 2.014
3.581AspLys: 3.581 ± 1.368
2.686AspLeu: 2.686 ± 1.083
0.895AspMet: 0.895 ± 1.149
0.0AspAsn: 0.0 ± 0.0
2.686AspPro: 2.686 ± 1.538
2.686AspGln: 2.686 ± 1.65
5.372AspArg: 5.372 ± 2.775
5.372AspSer: 5.372 ± 2.9
5.372AspThr: 5.372 ± 3.046
2.686AspVal: 2.686 ± 1.552
0.895AspTrp: 0.895 ± 0.682
1.791AspTyr: 1.791 ± 0.743
0.0AspXaa: 0.0 ± 0.0
Glu
1.791GluAla: 1.791 ± 1.174
0.895GluCys: 0.895 ± 0.682
2.686GluAsp: 2.686 ± 1.215
1.791GluGlu: 1.791 ± 1.363
0.895GluPhe: 0.895 ± 0.647
6.267GluGly: 6.267 ± 3.061
1.791GluHis: 1.791 ± 1.17
3.581GluIle: 3.581 ± 1.628
0.0GluLys: 0.0 ± 0.0
1.791GluLeu: 1.791 ± 1.363
1.791GluMet: 1.791 ± 1.596
1.791GluAsn: 1.791 ± 0.967
0.895GluPro: 0.895 ± 0.798
0.0GluGln: 0.0 ± 0.0
1.791GluArg: 1.791 ± 1.17
0.895GluSer: 0.895 ± 1.082
3.581GluThr: 3.581 ± 2.069
1.791GluVal: 1.791 ± 0.743
0.0GluTrp: 0.0 ± 0.0
0.895GluTyr: 0.895 ± 0.682
0.0GluXaa: 0.0 ± 0.0
Phe
0.895PheAla: 0.895 ± 0.647
0.0PheCys: 0.0 ± 0.0
0.0PheAsp: 0.0 ± 0.0
1.791PheGlu: 1.791 ± 1.22
0.0PhePhe: 0.0 ± 0.0
3.581PheGly: 3.581 ± 1.486
0.0PheHis: 0.0 ± 0.0
0.895PheIle: 0.895 ± 0.647
1.791PheLys: 1.791 ± 1.174
0.0PheLeu: 0.0 ± 0.0
0.0PheMet: 0.0 ± 0.0
0.895PheAsn: 0.895 ± 1.149
0.0PhePro: 0.0 ± 0.0
0.895PheGln: 0.895 ± 0.647
3.581PheArg: 3.581 ± 2.012
2.686PheSer: 2.686 ± 1.399
0.895PheThr: 0.895 ± 0.682
3.581PheVal: 3.581 ± 2.311
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
6.267GlyAla: 6.267 ± 2.09
0.0GlyCys: 0.0 ± 0.0
2.686GlyAsp: 2.686 ± 1.552
1.791GlyGlu: 1.791 ± 0.743
0.895GlyPhe: 0.895 ± 1.149
1.791GlyGly: 1.791 ± 1.363
2.686GlyHis: 2.686 ± 1.416
3.581GlyIle: 3.581 ± 1.468
8.057GlyLys: 8.057 ± 2.334
8.057GlyLeu: 8.057 ± 3.421
2.686GlyMet: 2.686 ± 1.106
1.791GlyAsn: 1.791 ± 0.925
7.162GlyPro: 7.162 ± 2.77
4.476GlyGln: 4.476 ± 2.277
5.372GlyArg: 5.372 ± 3.86
5.372GlySer: 5.372 ± 1.73
1.791GlyThr: 1.791 ± 0.743
2.686GlyVal: 2.686 ± 1.15
1.791GlyTrp: 1.791 ± 0.743
3.581GlyTyr: 3.581 ± 1.538
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.791HisAsp: 1.791 ± 1.596
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
3.581HisGly: 3.581 ± 2.337
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.791HisLys: 1.791 ± 0.925
4.476HisLeu: 4.476 ± 2.033
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.895HisPro: 0.895 ± 1.0
0.895HisGln: 0.895 ± 0.798
2.686HisArg: 2.686 ± 2.394
2.686HisSer: 2.686 ± 1.736
4.476HisThr: 4.476 ± 2.111
0.0HisVal: 0.0 ± 0.0
0.895HisTrp: 0.895 ± 0.682
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.686IleAla: 2.686 ± 1.552
2.686IleCys: 2.686 ± 3.246
1.791IleAsp: 1.791 ± 0.743
1.791IleGlu: 1.791 ± 1.063
0.895IlePhe: 0.895 ± 0.682
4.476IleGly: 4.476 ± 2.414
4.476IleHis: 4.476 ± 1.628
0.0IleIle: 0.0 ± 0.0
0.895IleLys: 0.895 ± 0.682
2.686IleLeu: 2.686 ± 1.353
3.581IleMet: 3.581 ± 1.438
0.0IleAsn: 0.0 ± 0.0
5.372IlePro: 5.372 ± 2.362
3.581IleGln: 3.581 ± 1.432
4.476IleArg: 4.476 ± 2.111
5.372IleSer: 5.372 ± 3.189
1.791IleThr: 1.791 ± 0.967
6.267IleVal: 6.267 ± 1.931
1.791IleTrp: 1.791 ± 0.925
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
4.476LysAla: 4.476 ± 1.325
0.895LysCys: 0.895 ± 1.0
0.895LysAsp: 0.895 ± 0.647
0.895LysGlu: 0.895 ± 0.682
0.895LysPhe: 0.895 ± 1.149
2.686LysGly: 2.686 ± 1.271
3.581LysHis: 3.581 ± 3.192
1.791LysIle: 1.791 ± 1.063
0.895LysLys: 0.895 ± 1.149
4.476LysLeu: 4.476 ± 1.419
1.791LysMet: 1.791 ± 1.174
0.895LysAsn: 0.895 ± 1.149
1.791LysPro: 1.791 ± 0.743
0.0LysGln: 0.0 ± 0.0
0.0LysArg: 0.0 ± 0.0
3.581LysSer: 3.581 ± 1.538
0.0LysThr: 0.0 ± 0.0
4.476LysVal: 4.476 ± 2.185
1.791LysTrp: 1.791 ± 1.363
1.791LysTyr: 1.791 ± 1.363
0.0LysXaa: 0.0 ± 0.0
Leu
4.476LeuAla: 4.476 ± 1.855
2.686LeuCys: 2.686 ± 1.35
4.476LeuAsp: 4.476 ± 1.714
0.895LeuGlu: 0.895 ± 0.682
3.581LeuPhe: 3.581 ± 1.352
4.476LeuGly: 4.476 ± 1.857
1.791LeuHis: 1.791 ± 1.17
2.686LeuIle: 2.686 ± 2.045
0.895LeuLys: 0.895 ± 0.647
5.372LeuLeu: 5.372 ± 4.106
2.686LeuMet: 2.686 ± 1.448
3.581LeuAsn: 3.581 ± 1.569
7.162LeuPro: 7.162 ± 3.673
3.581LeuGln: 3.581 ± 1.352
7.162LeuArg: 7.162 ± 1.727
7.162LeuSer: 7.162 ± 4.171
4.476LeuThr: 4.476 ± 2.032
6.267LeuVal: 6.267 ± 1.731
0.895LeuTrp: 0.895 ± 1.0
4.476LeuTyr: 4.476 ± 1.459
0.0LeuXaa: 0.0 ± 0.0
Met
3.581MetAla: 3.581 ± 2.086
0.0MetCys: 0.0 ± 0.0
1.791MetAsp: 1.791 ± 1.363
2.686MetGlu: 2.686 ± 1.65
2.686MetPhe: 2.686 ± 2.154
4.476MetGly: 4.476 ± 1.628
0.895MetHis: 0.895 ± 0.682
0.895MetIle: 0.895 ± 0.682
1.791MetLys: 1.791 ± 1.294
2.686MetLeu: 2.686 ± 1.467
2.686MetMet: 2.686 ± 2.204
0.895MetAsn: 0.895 ± 0.647
3.581MetPro: 3.581 ± 1.899
1.791MetGln: 1.791 ± 0.925
2.686MetArg: 2.686 ± 1.064
1.791MetSer: 1.791 ± 1.118
4.476MetThr: 4.476 ± 1.33
1.791MetVal: 1.791 ± 1.294
0.895MetTrp: 0.895 ± 1.149
1.791MetTyr: 1.791 ± 1.118
0.0MetXaa: 0.0 ± 0.0
Asn
0.895AsnAla: 0.895 ± 0.682
0.0AsnCys: 0.0 ± 0.0
2.686AsnAsp: 2.686 ± 1.271
0.895AsnGlu: 0.895 ± 0.798
1.791AsnPhe: 1.791 ± 1.203
2.686AsnGly: 2.686 ± 1.438
0.895AsnHis: 0.895 ± 0.798
3.581AsnIle: 3.581 ± 2.014
1.791AsnLys: 1.791 ± 1.17
1.791AsnLeu: 1.791 ± 1.596
3.581AsnMet: 3.581 ± 1.334
0.0AsnAsn: 0.0 ± 0.0
0.895AsnPro: 0.895 ± 1.149
1.791AsnGln: 1.791 ± 1.063
3.581AsnArg: 3.581 ± 1.471
0.895AsnSer: 0.895 ± 0.647
2.686AsnThr: 2.686 ± 1.024
2.686AsnVal: 2.686 ± 1.215
0.895AsnTrp: 0.895 ± 0.647
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
5.372ProAla: 5.372 ± 1.73
0.895ProCys: 0.895 ± 1.082
1.791ProAsp: 1.791 ± 1.294
2.686ProGlu: 2.686 ± 1.15
0.895ProPhe: 0.895 ± 1.082
2.686ProGly: 2.686 ± 2.394
0.895ProHis: 0.895 ± 0.798
2.686ProIle: 2.686 ± 1.063
1.791ProLys: 1.791 ± 0.925
7.162ProLeu: 7.162 ± 2.591
2.686ProMet: 2.686 ± 1.517
4.476ProAsn: 4.476 ± 2.06
4.476ProPro: 4.476 ± 1.102
3.581ProGln: 3.581 ± 3.38
5.372ProArg: 5.372 ± 2.26
5.372ProSer: 5.372 ± 4.129
3.581ProThr: 3.581 ± 1.933
3.581ProVal: 3.581 ± 1.226
0.895ProTrp: 0.895 ± 1.082
2.686ProTyr: 2.686 ± 1.024
0.0ProXaa: 0.0 ± 0.0
Gln
4.476GlnAla: 4.476 ± 1.714
2.686GlnCys: 2.686 ± 1.224
1.791GlnAsp: 1.791 ± 1.174
0.895GlnGlu: 0.895 ± 0.682
0.895GlnPhe: 0.895 ± 0.647
1.791GlnGly: 1.791 ± 1.596
0.895GlnHis: 0.895 ± 0.798
5.372GlnIle: 5.372 ± 2.205
0.0GlnLys: 0.0 ± 0.0
0.0GlnLeu: 0.0 ± 0.0
1.791GlnMet: 1.791 ± 0.967
0.895GlnAsn: 0.895 ± 0.647
0.895GlnPro: 0.895 ± 0.647
0.895GlnGln: 0.895 ± 0.682
5.372GlnArg: 5.372 ± 2.172
3.581GlnSer: 3.581 ± 1.203
2.686GlnThr: 2.686 ± 1.083
1.791GlnVal: 1.791 ± 1.174
0.0GlnTrp: 0.0 ± 0.0
0.895GlnTyr: 0.895 ± 0.647
0.0GlnXaa: 0.0 ± 0.0
Arg
7.162ArgAla: 7.162 ± 5.473
2.686ArgCys: 2.686 ± 1.467
3.581ArgAsp: 3.581 ± 2.337
2.686ArgGlu: 2.686 ± 1.416
0.895ArgPhe: 0.895 ± 0.682
3.581ArgGly: 3.581 ± 1.604
1.791ArgHis: 1.791 ± 1.596
3.581ArgIle: 3.581 ± 1.858
3.581ArgLys: 3.581 ± 1.636
6.267ArgLeu: 6.267 ± 2.823
3.581ArgMet: 3.581 ± 2.205
0.895ArgAsn: 0.895 ± 1.082
7.162ArgPro: 7.162 ± 1.735
4.476ArgGln: 4.476 ± 3.11
11.638ArgArg: 11.638 ± 5.564
9.848ArgSer: 9.848 ± 2.81
6.267ArgThr: 6.267 ± 2.42
1.791ArgVal: 1.791 ± 0.925
2.686ArgTrp: 2.686 ± 2.045
2.686ArgTyr: 2.686 ± 1.35
0.0ArgXaa: 0.0 ± 0.0
Ser
3.581SerAla: 3.581 ± 1.39
2.686SerCys: 2.686 ± 2.346
4.476SerAsp: 4.476 ± 1.746
4.476SerGlu: 4.476 ± 1.867
0.895SerPhe: 0.895 ± 1.082
7.162SerGly: 7.162 ± 3.221
1.791SerHis: 1.791 ± 1.17
4.476SerIle: 4.476 ± 1.435
1.791SerLys: 1.791 ± 0.967
3.581SerLeu: 3.581 ± 1.025
3.581SerMet: 3.581 ± 2.106
1.791SerAsn: 1.791 ± 0.967
3.581SerPro: 3.581 ± 2.015
0.0SerGln: 0.0 ± 0.0
8.057SerArg: 8.057 ± 6.876
8.057SerSer: 8.057 ± 4.341
9.848SerThr: 9.848 ± 3.187
7.162SerVal: 7.162 ± 2.667
4.476SerTrp: 4.476 ± 1.853
4.476SerTyr: 4.476 ± 3.376
0.0SerXaa: 0.0 ± 0.0
Thr
8.953ThrAla: 8.953 ± 2.215
0.895ThrCys: 0.895 ± 1.0
2.686ThrAsp: 2.686 ± 1.434
2.686ThrGlu: 2.686 ± 2.045
0.895ThrPhe: 0.895 ± 0.647
5.372ThrGly: 5.372 ± 2.347
1.791ThrHis: 1.791 ± 1.063
1.791ThrIle: 1.791 ± 1.07
5.372ThrLys: 5.372 ± 1.798
3.581ThrLeu: 3.581 ± 1.688
3.581ThrMet: 3.581 ± 1.352
5.372ThrAsn: 5.372 ± 1.878
6.267ThrPro: 6.267 ± 2.497
1.791ThrGln: 1.791 ± 1.203
1.791ThrArg: 1.791 ± 1.17
5.372ThrSer: 5.372 ± 1.791
3.581ThrThr: 3.581 ± 1.882
3.581ThrVal: 3.581 ± 1.8
0.895ThrTrp: 0.895 ± 0.647
1.791ThrTyr: 1.791 ± 1.174
0.0ThrXaa: 0.0 ± 0.0
Val
5.372ValAla: 5.372 ± 1.587
2.686ValCys: 2.686 ± 2.445
2.686ValAsp: 2.686 ± 1.215
4.476ValGlu: 4.476 ± 1.146
1.791ValPhe: 1.791 ± 0.743
3.581ValGly: 3.581 ± 1.562
0.895ValHis: 0.895 ± 0.798
4.476ValIle: 4.476 ± 2.014
0.895ValLys: 0.895 ± 1.149
8.057ValLeu: 8.057 ± 1.724
0.895ValMet: 0.895 ± 0.647
3.581ValAsn: 3.581 ± 1.333
2.686ValPro: 2.686 ± 1.941
3.581ValGln: 3.581 ± 1.882
6.267ValArg: 6.267 ± 2.005
5.372ValSer: 5.372 ± 2.507
1.791ValThr: 1.791 ± 1.203
3.581ValVal: 3.581 ± 1.333
0.895ValTrp: 0.895 ± 0.682
1.791ValTyr: 1.791 ± 1.203
0.0ValXaa: 0.0 ± 0.0
Trp
0.895TrpAla: 0.895 ± 0.682
0.895TrpCys: 0.895 ± 0.647
0.895TrpAsp: 0.895 ± 0.682
0.0TrpGlu: 0.0 ± 0.0
0.895TrpPhe: 0.895 ± 0.682
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.895TrpIle: 0.895 ± 1.0
0.895TrpLys: 0.895 ± 0.682
1.791TrpLeu: 1.791 ± 1.363
0.895TrpMet: 0.895 ± 0.798
0.895TrpAsn: 0.895 ± 0.682
0.0TrpPro: 0.0 ± 0.0
0.895TrpGln: 0.895 ± 0.682
0.895TrpArg: 0.895 ± 0.647
2.686TrpSer: 2.686 ± 1.467
3.581TrpThr: 3.581 ± 1.333
1.791TrpVal: 1.791 ± 1.363
0.0TrpTrp: 0.0 ± 0.0
1.791TrpTyr: 1.791 ± 0.743
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.791TyrAla: 1.791 ± 1.449
0.0TyrCys: 0.0 ± 0.0
4.476TyrAsp: 4.476 ± 1.102
0.895TyrGlu: 0.895 ± 0.682
0.0TyrPhe: 0.0 ± 0.0
5.372TyrGly: 5.372 ± 1.808
0.0TyrHis: 0.0 ± 0.0
1.791TyrIle: 1.791 ± 0.743
0.895TyrLys: 0.895 ± 1.149
2.686TyrLeu: 2.686 ± 1.48
1.791TyrMet: 1.791 ± 0.743
1.791TyrAsn: 1.791 ± 2.298
1.791TyrPro: 1.791 ± 1.22
0.0TyrGln: 0.0 ± 0.0
3.581TyrArg: 3.581 ± 2.086
1.791TyrSer: 1.791 ± 1.118
2.686TyrThr: 2.686 ± 1.271
0.895TyrVal: 0.895 ± 0.647
0.895TyrTrp: 0.895 ± 0.682
3.581TyrTyr: 3.581 ± 1.343
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1118 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski