Amino acid dipepetide frequency for Pacific flying fox faeces associated circular DNA virus-9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.714AlaAla: 5.714 ± 4.195
1.143AlaCys: 1.143 ± 1.109
2.286AlaAsp: 2.286 ± 1.834
3.429AlaGlu: 3.429 ± 2.517
0.0AlaPhe: 0.0 ± 0.0
8.0AlaGly: 8.0 ± 3.462
2.286AlaHis: 2.286 ± 1.678
3.429AlaIle: 3.429 ± 1.901
2.286AlaLys: 2.286 ± 0.946
2.286AlaLeu: 2.286 ± 1.678
1.143AlaMet: 1.143 ± 0.917
3.429AlaAsn: 3.429 ± 0.18
2.286AlaPro: 2.286 ± 1.678
2.286AlaGln: 2.286 ± 0.946
5.714AlaArg: 5.714 ± 0.789
1.143AlaSer: 1.143 ± 0.839
2.286AlaThr: 2.286 ± 2.219
3.429AlaVal: 3.429 ± 1.557
0.0AlaTrp: 0.0 ± 0.0
4.571AlaTyr: 4.571 ± 0.905
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
1.143CysAsp: 1.143 ± 0.839
0.0CysGlu: 0.0 ± 0.0
1.143CysPhe: 1.143 ± 0.917
1.143CysGly: 1.143 ± 1.109
1.143CysHis: 1.143 ± 1.109
1.143CysIle: 1.143 ± 0.917
2.286CysLys: 2.286 ± 2.219
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.143CysAsn: 1.143 ± 1.109
0.0CysPro: 0.0 ± 0.0
1.143CysGln: 1.143 ± 1.109
1.143CysArg: 1.143 ± 1.109
2.286CysSer: 2.286 ± 2.219
2.286CysThr: 2.286 ± 0.963
1.143CysVal: 1.143 ± 1.109
1.143CysTrp: 1.143 ± 1.109
3.429CysTyr: 3.429 ± 3.328
0.0CysXaa: 0.0 ± 0.0
Asp
0.0AspAla: 0.0 ± 0.0
0.0AspCys: 0.0 ± 0.0
2.286AspAsp: 2.286 ± 1.678
1.143AspGlu: 1.143 ± 0.839
1.143AspPhe: 1.143 ± 0.839
4.571AspGly: 4.571 ± 2.285
0.0AspHis: 0.0 ± 0.0
3.429AspIle: 3.429 ± 2.517
0.0AspLys: 0.0 ± 0.0
3.429AspLeu: 3.429 ± 0.18
2.286AspMet: 2.286 ± 0.946
2.286AspAsn: 2.286 ± 0.946
1.143AspPro: 1.143 ± 0.917
3.429AspGln: 3.429 ± 2.752
5.714AspArg: 5.714 ± 2.407
3.429AspSer: 3.429 ± 2.752
2.286AspThr: 2.286 ± 1.678
0.0AspVal: 0.0 ± 0.0
0.0AspTrp: 0.0 ± 0.0
2.286AspTyr: 2.286 ± 1.678
0.0AspXaa: 0.0 ± 0.0
Glu
2.286GluAla: 2.286 ± 1.678
0.0GluCys: 0.0 ± 0.0
2.286GluAsp: 2.286 ± 1.678
4.571GluGlu: 4.571 ± 2.132
1.143GluPhe: 1.143 ± 0.839
1.143GluGly: 1.143 ± 0.917
0.0GluHis: 0.0 ± 0.0
0.0GluIle: 0.0 ± 0.0
0.0GluLys: 0.0 ± 0.0
3.429GluLeu: 3.429 ± 2.517
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
1.143GluPro: 1.143 ± 0.839
3.429GluGln: 3.429 ± 1.535
1.143GluArg: 1.143 ± 0.839
0.0GluSer: 0.0 ± 0.0
5.714GluThr: 5.714 ± 1.532
2.286GluVal: 2.286 ± 1.834
1.143GluTrp: 1.143 ± 0.839
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.286PheAla: 2.286 ± 0.963
1.143PheCys: 1.143 ± 1.109
2.286PheAsp: 2.286 ± 1.678
1.143PheGlu: 1.143 ± 0.839
3.429PhePhe: 3.429 ± 1.895
1.143PheGly: 1.143 ± 0.839
1.143PheHis: 1.143 ± 0.839
1.143PheIle: 1.143 ± 1.109
0.0PheLys: 0.0 ± 0.0
8.0PheLeu: 8.0 ± 2.438
0.0PheMet: 0.0 ± 0.0
0.0PheAsn: 0.0 ± 0.0
1.143PhePro: 1.143 ± 0.839
0.0PheGln: 0.0 ± 0.0
0.0PheArg: 0.0 ± 0.0
4.571PheSer: 4.571 ± 1.986
6.857PheThr: 6.857 ± 2.076
3.429PheVal: 3.429 ± 1.664
2.286PheTrp: 2.286 ± 0.946
4.571PheTyr: 4.571 ± 1.278
0.0PheXaa: 0.0 ± 0.0
Gly
10.286GlyAla: 10.286 ± 2.678
0.0GlyCys: 0.0 ± 0.0
2.286GlyAsp: 2.286 ± 1.834
3.429GlyGlu: 3.429 ± 1.535
2.286GlyPhe: 2.286 ± 1.834
2.286GlyGly: 2.286 ± 0.946
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
3.429GlyLys: 3.429 ± 0.18
5.714GlyLeu: 5.714 ± 3.082
2.286GlyMet: 2.286 ± 0.946
3.429GlyAsn: 3.429 ± 0.18
3.429GlyPro: 3.429 ± 1.426
3.429GlyGln: 3.429 ± 1.535
2.286GlyArg: 2.286 ± 1.834
5.714GlySer: 5.714 ± 1.814
4.571GlyThr: 4.571 ± 0.905
4.571GlyVal: 4.571 ± 3.356
1.143GlyTrp: 1.143 ± 0.839
2.286GlyTyr: 2.286 ± 0.946
0.0GlyXaa: 0.0 ± 0.0
His
1.143HisAla: 1.143 ± 0.839
0.0HisCys: 0.0 ± 0.0
2.286HisAsp: 2.286 ± 0.993
1.143HisGlu: 1.143 ± 0.839
2.286HisPhe: 2.286 ± 1.678
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
1.143HisIle: 1.143 ± 1.109
0.0HisLys: 0.0 ± 0.0
1.143HisLeu: 1.143 ± 1.109
0.0HisMet: 0.0 ± 0.0
2.286HisAsn: 2.286 ± 0.963
2.286HisPro: 2.286 ± 2.219
0.0HisGln: 0.0 ± 0.0
3.429HisArg: 3.429 ± 1.901
3.429HisSer: 3.429 ± 3.328
0.0HisThr: 0.0 ± 0.0
1.143HisVal: 1.143 ± 0.839
1.143HisTrp: 1.143 ± 0.839
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
2.286IleAla: 2.286 ± 1.834
4.571IleCys: 4.571 ± 1.278
1.143IleAsp: 1.143 ± 0.839
0.0IleGlu: 0.0 ± 0.0
3.429IlePhe: 3.429 ± 1.895
1.143IleGly: 1.143 ± 0.917
1.143IleHis: 1.143 ± 1.109
5.714IleIle: 5.714 ± 4.046
1.143IleLys: 1.143 ± 0.839
4.571IleLeu: 4.571 ± 1.927
1.143IleMet: 1.143 ± 0.917
4.571IleAsn: 4.571 ± 0.905
2.286IlePro: 2.286 ± 0.993
1.143IleGln: 1.143 ± 0.839
5.714IleArg: 5.714 ± 2.909
6.857IleSer: 6.857 ± 5.117
3.429IleThr: 3.429 ± 0.18
9.143IleVal: 9.143 ± 0.646
1.143IleTrp: 1.143 ± 1.109
5.714IleTyr: 5.714 ± 4.046
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.143LysAsp: 1.143 ± 0.917
1.143LysGlu: 1.143 ± 0.839
4.571LysPhe: 4.571 ± 0.698
3.429LysGly: 3.429 ± 0.18
1.143LysHis: 1.143 ± 1.109
3.429LysIle: 3.429 ± 2.752
2.286LysLys: 2.286 ± 0.993
2.286LysLeu: 2.286 ± 0.946
0.0LysMet: 0.0 ± 0.0
1.143LysAsn: 1.143 ± 0.917
2.286LysPro: 2.286 ± 0.993
0.0LysGln: 0.0 ± 0.0
3.429LysArg: 3.429 ± 1.895
3.429LysSer: 3.429 ± 0.18
3.429LysThr: 3.429 ± 1.895
2.286LysVal: 2.286 ± 0.993
0.0LysTrp: 0.0 ± 0.0
3.429LysTyr: 3.429 ± 1.895
0.0LysXaa: 0.0 ± 0.0
Leu
2.286LeuAla: 2.286 ± 0.946
2.286LeuCys: 2.286 ± 0.993
5.714LeuAsp: 5.714 ± 4.195
1.143LeuGlu: 1.143 ± 0.839
3.429LeuPhe: 3.429 ± 1.426
3.429LeuGly: 3.429 ± 2.517
3.429LeuHis: 3.429 ± 1.426
4.571LeuIle: 4.571 ± 1.927
1.143LeuLys: 1.143 ± 0.839
3.429LeuLeu: 3.429 ± 0.18
2.286LeuMet: 2.286 ± 1.678
1.143LeuAsn: 1.143 ± 1.109
3.429LeuPro: 3.429 ± 0.18
6.857LeuGln: 6.857 ± 1.563
6.857LeuArg: 6.857 ± 2.076
4.571LeuSer: 4.571 ± 2.132
4.571LeuThr: 4.571 ± 0.905
9.143LeuVal: 9.143 ± 2.555
0.0LeuTrp: 0.0 ± 0.0
3.429LeuTyr: 3.429 ± 1.895
0.0LeuXaa: 0.0 ± 0.0
Met
1.143MetAla: 1.143 ± 1.109
1.143MetCys: 1.143 ± 1.109
1.143MetAsp: 1.143 ± 0.839
1.143MetGlu: 1.143 ± 0.917
1.143MetPhe: 1.143 ± 0.839
0.0MetGly: 0.0 ± 0.0
1.143MetHis: 1.143 ± 0.839
3.429MetIle: 3.429 ± 1.664
0.0MetLys: 0.0 ± 0.0
3.429MetLeu: 3.429 ± 1.426
0.0MetMet: 0.0 ± 0.0
1.143MetAsn: 1.143 ± 0.917
3.429MetPro: 3.429 ± 2.752
1.143MetGln: 1.143 ± 0.839
1.143MetArg: 1.143 ± 0.917
0.0MetSer: 0.0 ± 0.0
2.286MetThr: 2.286 ± 0.946
1.143MetVal: 1.143 ± 0.917
0.0MetTrp: 0.0 ± 0.0
1.143MetTyr: 1.143 ± 0.917
0.0MetXaa: 0.0 ± 0.0
Asn
3.429AsnAla: 3.429 ± 1.535
1.143AsnCys: 1.143 ± 1.109
1.143AsnAsp: 1.143 ± 0.839
0.0AsnGlu: 0.0 ± 0.0
1.143AsnPhe: 1.143 ± 0.839
2.286AsnGly: 2.286 ± 1.834
1.143AsnHis: 1.143 ± 1.109
3.429AsnIle: 3.429 ± 0.18
3.429AsnLys: 3.429 ± 1.557
1.143AsnLeu: 1.143 ± 0.839
1.143AsnMet: 1.143 ± 0.917
3.429AsnAsn: 3.429 ± 2.752
0.0AsnPro: 0.0 ± 0.0
2.286AsnGln: 2.286 ± 0.993
1.143AsnArg: 1.143 ± 0.839
5.714AsnSer: 5.714 ± 1.814
4.571AsnThr: 4.571 ± 2.96
2.286AsnVal: 2.286 ± 1.834
2.286AsnTrp: 2.286 ± 1.678
1.143AsnTyr: 1.143 ± 0.839
0.0AsnXaa: 0.0 ± 0.0
Pro
2.286ProAla: 2.286 ± 0.993
2.286ProCys: 2.286 ± 2.219
1.143ProAsp: 1.143 ± 0.917
1.143ProGlu: 1.143 ± 0.839
1.143ProPhe: 1.143 ± 1.109
6.857ProGly: 6.857 ± 1.391
0.0ProHis: 0.0 ± 0.0
3.429ProIle: 3.429 ± 1.895
2.286ProLys: 2.286 ± 0.993
1.143ProLeu: 1.143 ± 1.109
2.286ProMet: 2.286 ± 0.946
0.0ProAsn: 0.0 ± 0.0
1.143ProPro: 1.143 ± 1.109
2.286ProGln: 2.286 ± 0.963
3.429ProArg: 3.429 ± 1.535
4.571ProSer: 4.571 ± 2.354
0.0ProThr: 0.0 ± 0.0
5.714ProVal: 5.714 ± 2.407
2.286ProTrp: 2.286 ± 1.678
2.286ProTyr: 2.286 ± 0.946
0.0ProXaa: 0.0 ± 0.0
Gln
3.429GlnAla: 3.429 ± 2.517
1.143GlnCys: 1.143 ± 1.109
1.143GlnAsp: 1.143 ± 0.839
1.143GlnGlu: 1.143 ± 0.917
1.143GlnPhe: 1.143 ± 0.917
2.286GlnGly: 2.286 ± 1.834
1.143GlnHis: 1.143 ± 0.917
3.429GlnIle: 3.429 ± 1.895
4.571GlnLys: 4.571 ± 4.438
1.143GlnLeu: 1.143 ± 0.839
1.143GlnMet: 1.143 ± 0.917
1.143GlnAsn: 1.143 ± 1.109
4.571GlnPro: 4.571 ± 2.515
5.714GlnGln: 5.714 ± 4.586
5.714GlnArg: 5.714 ± 2.546
1.143GlnSer: 1.143 ± 0.839
4.571GlnThr: 4.571 ± 1.278
3.429GlnVal: 3.429 ± 1.535
1.143GlnTrp: 1.143 ± 0.839
6.857GlnTyr: 6.857 ± 2.728
0.0GlnXaa: 0.0 ± 0.0
Arg
5.714ArgAla: 5.714 ± 0.789
0.0ArgCys: 0.0 ± 0.0
2.286ArgAsp: 2.286 ± 1.678
1.143ArgGlu: 1.143 ± 1.109
1.143ArgPhe: 1.143 ± 0.839
5.714ArgGly: 5.714 ± 3.082
2.286ArgHis: 2.286 ± 2.219
5.714ArgIle: 5.714 ± 1.039
2.286ArgLys: 2.286 ± 1.834
8.0ArgLeu: 8.0 ± 3.735
2.286ArgMet: 2.286 ± 1.834
6.857ArgAsn: 6.857 ± 3.069
5.714ArgPro: 5.714 ± 0.789
5.714ArgGln: 5.714 ± 3.4
8.0ArgArg: 8.0 ± 3.462
6.857ArgSer: 6.857 ± 1.563
4.571ArgThr: 4.571 ± 0.905
4.571ArgVal: 4.571 ± 1.986
0.0ArgTrp: 0.0 ± 0.0
5.714ArgTyr: 5.714 ± 1.155
0.0ArgXaa: 0.0 ± 0.0
Ser
1.143SerAla: 1.143 ± 0.917
1.143SerCys: 1.143 ± 1.109
0.0SerAsp: 0.0 ± 0.0
1.143SerGlu: 1.143 ± 0.917
5.714SerPhe: 5.714 ± 3.217
2.286SerGly: 2.286 ± 0.946
1.143SerHis: 1.143 ± 1.109
4.571SerIle: 4.571 ± 0.905
3.429SerLys: 3.429 ± 1.557
5.714SerLeu: 5.714 ± 1.039
1.143SerMet: 1.143 ± 0.777
2.286SerAsn: 2.286 ± 1.678
2.286SerPro: 2.286 ± 0.963
5.714SerGln: 5.714 ± 2.386
8.0SerArg: 8.0 ± 1.733
3.429SerSer: 3.429 ± 1.895
12.571SerThr: 12.571 ± 6.584
2.286SerVal: 2.286 ± 0.993
0.0SerTrp: 0.0 ± 0.0
2.286SerTyr: 2.286 ± 0.963
0.0SerXaa: 0.0 ± 0.0
Thr
9.143ThrAla: 9.143 ± 0.646
2.286ThrCys: 2.286 ± 2.219
2.286ThrAsp: 2.286 ± 1.834
1.143ThrGlu: 1.143 ± 0.839
3.429ThrPhe: 3.429 ± 0.18
4.571ThrGly: 4.571 ± 1.892
2.286ThrHis: 2.286 ± 0.963
8.0ThrIle: 8.0 ± 4.824
5.714ThrLys: 5.714 ± 0.789
3.429ThrLeu: 3.429 ± 1.901
2.286ThrMet: 2.286 ± 0.963
1.143ThrAsn: 1.143 ± 0.917
3.429ThrPro: 3.429 ± 1.901
3.429ThrGln: 3.429 ± 1.557
5.714ThrArg: 5.714 ± 1.039
3.429ThrSer: 3.429 ± 1.426
3.429ThrThr: 3.429 ± 1.426
2.286ThrVal: 2.286 ± 0.946
1.143ThrTrp: 1.143 ± 0.917
5.714ThrTyr: 5.714 ± 4.024
0.0ThrXaa: 0.0 ± 0.0
Val
2.286ValAla: 2.286 ± 0.993
0.0ValCys: 0.0 ± 0.0
3.429ValAsp: 3.429 ± 1.664
2.286ValGlu: 2.286 ± 1.678
3.429ValPhe: 3.429 ± 1.895
6.857ValGly: 6.857 ± 2.837
2.286ValHis: 2.286 ± 0.963
4.571ValIle: 4.571 ± 2.132
3.429ValLys: 3.429 ± 2.752
2.286ValLeu: 2.286 ± 1.834
3.429ValMet: 3.429 ± 0.641
5.714ValAsn: 5.714 ± 2.445
2.286ValPro: 2.286 ± 0.963
4.571ValGln: 4.571 ± 0.905
4.571ValArg: 4.571 ± 2.515
3.429ValSer: 3.429 ± 1.895
4.571ValThr: 4.571 ± 2.285
2.286ValVal: 2.286 ± 1.678
1.143ValTrp: 1.143 ± 0.839
3.429ValTyr: 3.429 ± 0.18
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.143TrpAsp: 1.143 ± 0.839
1.143TrpGlu: 1.143 ± 0.839
1.143TrpPhe: 1.143 ± 0.839
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
2.286TrpIle: 2.286 ± 1.678
0.0TrpLys: 0.0 ± 0.0
1.143TrpLeu: 1.143 ± 0.839
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
2.286TrpGln: 2.286 ± 0.963
3.429TrpArg: 3.429 ± 0.18
1.143TrpSer: 1.143 ± 0.917
0.0TrpThr: 0.0 ± 0.0
1.143TrpVal: 1.143 ± 1.109
2.286TrpTrp: 2.286 ± 1.678
2.286TrpTyr: 2.286 ± 1.678
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.286TyrAla: 2.286 ± 0.963
3.429TyrCys: 3.429 ± 1.901
2.286TyrAsp: 2.286 ± 1.834
2.286TyrGlu: 2.286 ± 0.946
2.286TyrPhe: 2.286 ± 2.219
5.714TyrGly: 5.714 ± 1.155
1.143TyrHis: 1.143 ± 1.109
3.429TyrIle: 3.429 ± 3.328
1.143TyrLys: 1.143 ± 1.109
11.429TyrLeu: 11.429 ± 1.622
1.143TyrMet: 1.143 ± 0.839
1.143TyrAsn: 1.143 ± 0.839
3.429TyrPro: 3.429 ± 0.18
1.143TyrGln: 1.143 ± 1.109
8.0TyrArg: 8.0 ± 2.146
2.286TyrSer: 2.286 ± 0.993
2.286TyrThr: 2.286 ± 0.946
4.571TyrVal: 4.571 ± 1.892
1.143TyrTrp: 1.143 ± 1.109
3.429TyrTyr: 3.429 ± 1.901
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (876 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski