Amino acid dipepetide frequency for Pacific flying fox faeces associated gemycircularvirus-6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.046AlaAla: 8.046 ± 0.984
2.299AlaCys: 2.299 ± 1.034
2.299AlaAsp: 2.299 ± 1.034
0.0AlaGlu: 0.0 ± 0.0
0.0AlaPhe: 0.0 ± 0.0
6.897AlaGly: 6.897 ± 1.146
0.0AlaHis: 0.0 ± 0.0
2.299AlaIle: 2.299 ± 1.695
8.046AlaLys: 8.046 ± 2.307
2.299AlaLeu: 2.299 ± 1.034
1.149AlaMet: 1.149 ± 0.87
2.299AlaAsn: 2.299 ± 0.739
2.299AlaPro: 2.299 ± 1.034
0.0AlaGln: 0.0 ± 0.0
11.494AlaArg: 11.494 ± 0.934
4.598AlaSer: 4.598 ± 2.178
9.195AlaThr: 9.195 ± 2.862
4.598AlaVal: 4.598 ± 3.48
2.299AlaTrp: 2.299 ± 1.034
3.448AlaTyr: 3.448 ± 0.373
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
3.448CysCys: 3.448 ± 0.373
2.299CysAsp: 2.299 ± 1.034
0.0CysGlu: 0.0 ± 0.0
3.448CysPhe: 3.448 ± 0.373
1.149CysGly: 1.149 ± 0.848
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
3.448CysAsn: 3.448 ± 1.533
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
3.448CysSer: 3.448 ± 1.968
1.149CysThr: 1.149 ± 0.87
1.149CysVal: 1.149 ± 0.848
0.0CysTrp: 0.0 ± 0.0
1.149CysTyr: 1.149 ± 0.87
0.0CysXaa: 0.0 ± 0.0
Asp
9.195AspAla: 9.195 ± 4.137
0.0AspCys: 0.0 ± 0.0
10.345AspAsp: 10.345 ± 0.717
5.747AspGlu: 5.747 ± 0.381
0.0AspPhe: 0.0 ± 0.0
12.644AspGly: 12.644 ± 4.367
1.149AspHis: 1.149 ± 0.848
1.149AspIle: 1.149 ± 0.87
2.299AspLys: 2.299 ± 0.739
1.149AspLeu: 1.149 ± 0.848
1.149AspMet: 1.149 ± 0.848
3.448AspAsn: 3.448 ± 0.373
3.448AspPro: 3.448 ± 1.533
1.149AspGln: 1.149 ± 0.87
3.448AspArg: 3.448 ± 0.373
1.149AspSer: 1.149 ± 0.848
3.448AspThr: 3.448 ± 0.373
1.149AspVal: 1.149 ± 0.87
8.046AspTrp: 8.046 ± 2.219
6.897AspTyr: 6.897 ± 0.747
0.0AspXaa: 0.0 ± 0.0
Glu
1.149GluAla: 1.149 ± 0.87
1.149GluCys: 1.149 ± 0.87
2.299GluAsp: 2.299 ± 1.034
3.448GluGlu: 3.448 ± 1.533
5.747GluPhe: 5.747 ± 2.474
3.448GluGly: 3.448 ± 0.373
0.0GluHis: 0.0 ± 0.0
3.448GluIle: 3.448 ± 1.533
2.299GluLys: 2.299 ± 1.034
4.598GluLeu: 4.598 ± 2.069
1.149GluMet: 1.149 ± 0.836
1.149GluAsn: 1.149 ± 0.848
1.149GluPro: 1.149 ± 0.848
2.299GluGln: 2.299 ± 1.034
6.897GluArg: 6.897 ± 0.747
0.0GluSer: 0.0 ± 0.0
2.299GluThr: 2.299 ± 1.74
1.149GluVal: 1.149 ± 0.848
3.448GluTrp: 3.448 ± 1.533
1.149GluTyr: 1.149 ± 0.848
0.0GluXaa: 0.0 ± 0.0
Phe
2.299PheAla: 2.299 ± 1.74
0.0PheCys: 0.0 ± 0.0
8.046PheAsp: 8.046 ± 3.469
5.747PheGlu: 5.747 ± 2.474
2.299PhePhe: 2.299 ± 1.034
1.149PheGly: 1.149 ± 0.848
0.0PheHis: 0.0 ± 0.0
1.149PheIle: 1.149 ± 0.848
2.299PheLys: 2.299 ± 1.034
0.0PheLeu: 0.0 ± 0.0
3.448PheMet: 3.448 ± 1.533
2.299PheAsn: 2.299 ± 1.74
2.299PhePro: 2.299 ± 1.107
0.0PheGln: 0.0 ± 0.0
1.149PheArg: 1.149 ± 0.848
8.046PheSer: 8.046 ± 2.219
4.598PheThr: 4.598 ± 3.48
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
1.149PheTyr: 1.149 ± 0.848
0.0PheXaa: 0.0 ± 0.0
Gly
2.299GlyAla: 2.299 ± 0.739
4.598GlyCys: 4.598 ± 2.069
6.897GlyAsp: 6.897 ± 0.747
4.598GlyGlu: 4.598 ± 2.069
2.299GlyPhe: 2.299 ± 1.034
9.195GlyGly: 9.195 ± 0.15
3.448GlyHis: 3.448 ± 0.373
0.0GlyIle: 0.0 ± 0.0
6.897GlyLys: 6.897 ± 2.346
4.598GlyLeu: 4.598 ± 2.106
3.448GlyMet: 3.448 ± 0.373
2.299GlyAsn: 2.299 ± 1.74
0.0GlyPro: 0.0 ± 0.0
4.598GlyGln: 4.598 ± 0.7
6.897GlyArg: 6.897 ± 0.747
5.747GlySer: 5.747 ± 0.381
6.897GlyThr: 6.897 ± 0.747
3.448GlyVal: 3.448 ± 2.61
3.448GlyTrp: 3.448 ± 0.373
1.149GlyTyr: 1.149 ± 0.848
0.0GlyXaa: 0.0 ± 0.0
His
2.299HisAla: 2.299 ± 1.034
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.149HisGlu: 1.149 ± 0.87
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.299HisIle: 2.299 ± 1.034
0.0HisLys: 0.0 ± 0.0
0.0HisLeu: 0.0 ± 0.0
0.0HisMet: 0.0 ± 0.0
1.149HisAsn: 1.149 ± 0.848
1.149HisPro: 1.149 ± 0.87
2.299HisGln: 2.299 ± 1.034
2.299HisArg: 2.299 ± 1.74
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
2.299HisVal: 2.299 ± 1.034
0.0HisTrp: 0.0 ± 0.0
3.448HisTyr: 3.448 ± 1.533
0.0HisXaa: 0.0 ± 0.0
Ile
1.149IleAla: 1.149 ± 0.87
2.299IleCys: 2.299 ± 0.739
0.0IleAsp: 0.0 ± 0.0
1.149IleGlu: 1.149 ± 0.848
3.448IlePhe: 3.448 ± 2.543
8.046IleGly: 8.046 ± 3.469
2.299IleHis: 2.299 ± 1.034
5.747IleIle: 5.747 ± 1.289
0.0IleLys: 0.0 ± 0.0
2.299IleLeu: 2.299 ± 1.034
0.0IleMet: 0.0 ± 0.0
1.149IleAsn: 1.149 ± 0.87
2.299IlePro: 2.299 ± 1.034
2.299IleGln: 2.299 ± 1.74
3.448IleArg: 3.448 ± 0.373
2.299IleSer: 2.299 ± 1.034
1.149IleThr: 1.149 ± 0.87
1.149IleVal: 1.149 ± 0.87
0.0IleTrp: 0.0 ± 0.0
1.149IleTyr: 1.149 ± 0.87
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
2.299LysAsp: 2.299 ± 0.739
3.448LysGlu: 3.448 ± 0.373
0.0LysPhe: 0.0 ± 0.0
5.747LysGly: 5.747 ± 1.509
2.299LysHis: 2.299 ± 1.034
4.598LysIle: 4.598 ± 0.7
1.149LysLys: 1.149 ± 0.87
2.299LysLeu: 2.299 ± 1.034
0.0LysMet: 0.0 ± 0.0
0.0LysAsn: 0.0 ± 0.0
5.747LysPro: 5.747 ± 2.474
1.149LysGln: 1.149 ± 0.87
2.299LysArg: 2.299 ± 0.739
0.0LysSer: 0.0 ± 0.0
3.448LysThr: 3.448 ± 1.331
1.149LysVal: 1.149 ± 0.87
1.149LysTrp: 1.149 ± 0.848
6.897LysTyr: 6.897 ± 1.605
0.0LysXaa: 0.0 ± 0.0
Leu
9.195LeuAla: 9.195 ± 2.608
0.0LeuCys: 0.0 ± 0.0
1.149LeuAsp: 1.149 ± 0.87
5.747LeuGlu: 5.747 ± 2.474
1.149LeuPhe: 1.149 ± 0.848
5.747LeuGly: 5.747 ± 3.037
2.299LeuHis: 2.299 ± 1.034
2.299LeuIle: 2.299 ± 1.034
3.448LeuLys: 3.448 ± 2.61
0.0LeuLeu: 0.0 ± 0.0
0.0LeuMet: 0.0 ± 0.0
1.149LeuAsn: 1.149 ± 0.87
2.299LeuPro: 2.299 ± 1.107
0.0LeuGln: 0.0 ± 0.0
3.448LeuArg: 3.448 ± 1.533
0.0LeuSer: 0.0 ± 0.0
8.046LeuThr: 8.046 ± 0.984
4.598LeuVal: 4.598 ± 2.178
1.149LeuTrp: 1.149 ± 0.848
1.149LeuTyr: 1.149 ± 0.87
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
3.448MetCys: 3.448 ± 1.968
1.149MetAsp: 1.149 ± 0.848
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.149MetGly: 1.149 ± 0.87
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.0MetLys: 0.0 ± 0.0
3.448MetLeu: 3.448 ± 0.373
0.0MetMet: 0.0 ± 0.0
1.149MetAsn: 1.149 ± 0.87
5.747MetPro: 5.747 ± 1.679
0.0MetGln: 0.0 ± 0.0
2.299MetArg: 2.299 ± 1.74
0.0MetSer: 0.0 ± 0.0
0.0MetThr: 0.0 ± 0.0
3.448MetVal: 3.448 ± 1.331
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
3.448AsnAla: 3.448 ± 2.61
1.149AsnCys: 1.149 ± 0.848
2.299AsnAsp: 2.299 ± 1.034
1.149AsnGlu: 1.149 ± 0.87
2.299AsnPhe: 2.299 ± 1.034
2.299AsnGly: 2.299 ± 0.739
0.0AsnHis: 0.0 ± 0.0
3.448AsnIle: 3.448 ± 0.373
1.149AsnLys: 1.149 ± 0.848
3.448AsnLeu: 3.448 ± 2.61
0.0AsnMet: 0.0 ± 0.0
0.0AsnAsn: 0.0 ± 0.0
1.149AsnPro: 1.149 ± 0.87
1.149AsnGln: 1.149 ± 0.87
1.149AsnArg: 1.149 ± 0.848
0.0AsnSer: 0.0 ± 0.0
6.897AsnThr: 6.897 ± 3.872
4.598AsnVal: 4.598 ± 2.069
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.598ProAla: 4.598 ± 0.7
0.0ProCys: 0.0 ± 0.0
4.598ProAsp: 4.598 ± 2.069
1.149ProGlu: 1.149 ± 0.87
4.598ProPhe: 4.598 ± 1.478
2.299ProGly: 2.299 ± 1.034
2.299ProHis: 2.299 ± 1.034
1.149ProIle: 1.149 ± 0.848
1.149ProLys: 1.149 ± 0.848
0.0ProLeu: 0.0 ± 0.0
1.149ProMet: 1.149 ± 0.87
2.299ProAsn: 2.299 ± 1.034
0.0ProPro: 0.0 ± 0.0
3.448ProGln: 3.448 ± 0.373
5.747ProArg: 5.747 ± 1.289
3.448ProSer: 3.448 ± 0.373
3.448ProThr: 3.448 ± 1.968
2.299ProVal: 2.299 ± 1.034
2.299ProTrp: 2.299 ± 1.74
6.897ProTyr: 6.897 ± 2.536
0.0ProXaa: 0.0 ± 0.0
Gln
1.149GlnAla: 1.149 ± 0.87
0.0GlnCys: 0.0 ± 0.0
4.598GlnAsp: 4.598 ± 0.7
0.0GlnGlu: 0.0 ± 0.0
1.149GlnPhe: 1.149 ± 0.848
1.149GlnGly: 1.149 ± 0.87
0.0GlnHis: 0.0 ± 0.0
1.149GlnIle: 1.149 ± 0.87
2.299GlnLys: 2.299 ± 1.034
3.448GlnLeu: 3.448 ± 0.373
0.0GlnMet: 0.0 ± 0.0
1.149GlnAsn: 1.149 ± 0.87
0.0GlnPro: 0.0 ± 0.0
1.149GlnGln: 1.149 ± 0.87
2.299GlnArg: 2.299 ± 1.034
1.149GlnSer: 1.149 ± 0.87
2.299GlnThr: 2.299 ± 1.74
0.0GlnVal: 0.0 ± 0.0
1.149GlnTrp: 1.149 ± 0.87
4.598GlnTyr: 4.598 ± 0.85
0.0GlnXaa: 0.0 ± 0.0
Arg
8.046ArgAla: 8.046 ± 2.219
0.0ArgCys: 0.0 ± 0.0
9.195ArgAsp: 9.195 ± 1.401
6.897ArgGlu: 6.897 ± 3.103
0.0ArgPhe: 0.0 ± 0.0
5.747ArgGly: 5.747 ± 1.289
0.0ArgHis: 0.0 ± 0.0
5.747ArgIle: 5.747 ± 1.679
2.299ArgLys: 2.299 ± 1.74
6.897ArgLeu: 6.897 ± 1.605
1.149ArgMet: 1.149 ± 0.733
0.0ArgAsn: 0.0 ± 0.0
10.345ArgPro: 10.345 ± 1.755
3.448ArgGln: 3.448 ± 0.373
18.391ArgArg: 18.391 ± 9.811
5.747ArgSer: 5.747 ± 1.679
3.448ArgThr: 3.448 ± 0.373
6.897ArgVal: 6.897 ± 1.605
1.149ArgTrp: 1.149 ± 0.87
3.448ArgTyr: 3.448 ± 2.61
0.0ArgXaa: 0.0 ± 0.0
Ser
1.149SerAla: 1.149 ± 0.87
0.0SerCys: 0.0 ± 0.0
3.448SerAsp: 3.448 ± 1.374
2.299SerGlu: 2.299 ± 1.034
1.149SerPhe: 1.149 ± 0.848
6.897SerGly: 6.897 ± 2.536
1.149SerHis: 1.149 ± 0.848
0.0SerIle: 0.0 ± 0.0
0.0SerLys: 0.0 ± 0.0
4.598SerLeu: 4.598 ± 2.069
1.149SerMet: 1.149 ± 0.87
4.598SerAsn: 4.598 ± 0.85
3.448SerPro: 3.448 ± 1.654
0.0SerGln: 0.0 ± 0.0
4.598SerArg: 4.598 ± 3.48
3.448SerSer: 3.448 ± 0.373
2.299SerThr: 2.299 ± 1.034
5.747SerVal: 5.747 ± 2.474
0.0SerTrp: 0.0 ± 0.0
4.598SerTyr: 4.598 ± 2.178
0.0SerXaa: 0.0 ± 0.0
Thr
10.345ThrAla: 10.345 ± 5.133
0.0ThrCys: 0.0 ± 0.0
4.598ThrAsp: 4.598 ± 2.069
2.299ThrGlu: 2.299 ± 0.739
4.598ThrPhe: 4.598 ± 0.85
3.448ThrGly: 3.448 ± 1.654
0.0ThrHis: 0.0 ± 0.0
3.448ThrIle: 3.448 ± 0.373
1.149ThrLys: 1.149 ± 0.848
3.448ThrLeu: 3.448 ± 2.61
2.299ThrMet: 2.299 ± 1.74
3.448ThrAsn: 3.448 ± 2.61
2.299ThrPro: 2.299 ± 1.034
1.149ThrGln: 1.149 ± 0.87
8.046ThrArg: 8.046 ± 3.469
4.598ThrSer: 4.598 ± 3.48
4.598ThrThr: 4.598 ± 3.48
3.448ThrVal: 3.448 ± 1.374
0.0ThrTrp: 0.0 ± 0.0
4.598ThrTyr: 4.598 ± 0.85
0.0ThrXaa: 0.0 ± 0.0
Val
3.448ValAla: 3.448 ± 0.373
0.0ValCys: 0.0 ± 0.0
3.448ValAsp: 3.448 ± 0.373
3.448ValGlu: 3.448 ± 1.533
5.747ValPhe: 5.747 ± 3.019
1.149ValGly: 1.149 ± 0.87
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
1.149ValLys: 1.149 ± 0.848
8.046ValLeu: 8.046 ± 1.999
3.448ValMet: 3.448 ± 0.373
3.448ValAsn: 3.448 ± 1.533
5.747ValPro: 5.747 ± 1.289
2.299ValGln: 2.299 ± 0.739
6.897ValArg: 6.897 ± 1.605
2.299ValSer: 2.299 ± 1.74
0.0ValThr: 0.0 ± 0.0
0.0ValVal: 0.0 ± 0.0
1.149ValTrp: 1.149 ± 0.848
2.299ValTyr: 2.299 ± 1.74
0.0ValXaa: 0.0 ± 0.0
Trp
1.149TrpAla: 1.149 ± 0.848
1.149TrpCys: 1.149 ± 0.87
4.598TrpAsp: 4.598 ± 2.251
0.0TrpGlu: 0.0 ± 0.0
2.299TrpPhe: 2.299 ± 1.034
1.149TrpGly: 1.149 ± 0.848
2.299TrpHis: 2.299 ± 1.74
0.0TrpIle: 0.0 ± 0.0
2.299TrpLys: 2.299 ± 1.034
2.299TrpLeu: 2.299 ± 1.695
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.149TrpGln: 1.149 ± 0.87
1.149TrpArg: 1.149 ± 0.87
3.448TrpSer: 3.448 ± 0.373
1.149TrpThr: 1.149 ± 0.87
2.299TrpVal: 2.299 ± 1.034
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.598TyrAla: 4.598 ± 0.7
1.149TyrCys: 1.149 ± 0.848
3.448TyrAsp: 3.448 ± 2.61
1.149TyrGlu: 1.149 ± 0.848
5.747TyrPhe: 5.747 ± 0.381
3.448TyrGly: 3.448 ± 2.61
1.149TyrHis: 1.149 ± 0.87
3.448TyrIle: 3.448 ± 0.373
5.747TyrLys: 5.747 ± 1.509
0.0TyrLeu: 0.0 ± 0.0
1.149TyrMet: 1.149 ± 0.87
1.149TyrAsn: 1.149 ± 0.87
3.448TyrPro: 3.448 ± 0.373
1.149TyrGln: 1.149 ± 0.87
6.897TyrArg: 6.897 ± 0.747
1.149TyrSer: 1.149 ± 0.87
3.448TyrThr: 3.448 ± 0.373
4.598TyrVal: 4.598 ± 0.85
1.149TyrTrp: 1.149 ± 0.87
4.598TyrTyr: 4.598 ± 2.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (871 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski