Amino acid dipepetide frequency for Human PoSCV5-like circular virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.468AlaAla: 3.468 ± 0.252
0.0AlaCys: 0.0 ± 0.0
2.312AlaAsp: 2.312 ± 0.357
3.468AlaGlu: 3.468 ± 0.252
1.156AlaPhe: 1.156 ± 0.966
1.156AlaGly: 1.156 ± 0.609
0.0AlaHis: 0.0 ± 0.0
2.312AlaIle: 2.312 ± 0.357
8.092AlaLys: 8.092 ± 1.113
6.936AlaLeu: 6.936 ± 0.504
2.312AlaMet: 2.312 ± 1.218
2.312AlaAsn: 2.312 ± 1.218
2.312AlaPro: 2.312 ± 0.357
3.468AlaGln: 3.468 ± 0.252
5.78AlaArg: 5.78 ± 1.47
6.936AlaSer: 6.936 ± 3.653
0.0AlaThr: 0.0 ± 0.0
4.624AlaVal: 4.624 ± 2.288
0.0AlaTrp: 0.0 ± 0.0
1.156AlaTyr: 1.156 ± 0.966
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
2.312CysPhe: 2.312 ± 0.357
2.312CysGly: 2.312 ± 0.357
0.0CysHis: 0.0 ± 0.0
1.156CysIle: 1.156 ± 0.609
1.156CysLys: 1.156 ± 0.966
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.156CysAsn: 1.156 ± 0.609
1.156CysPro: 1.156 ± 0.966
1.156CysGln: 1.156 ± 0.609
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
1.156CysThr: 1.156 ± 0.609
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
2.312AspAla: 2.312 ± 1.218
1.156AspCys: 1.156 ± 0.609
4.624AspAsp: 4.624 ± 0.714
0.0AspGlu: 0.0 ± 0.0
0.0AspPhe: 0.0 ± 0.0
5.78AspGly: 5.78 ± 3.254
1.156AspHis: 1.156 ± 0.609
3.468AspIle: 3.468 ± 1.322
0.0AspLys: 0.0 ± 0.0
8.092AspLeu: 8.092 ± 2.036
1.156AspMet: 1.156 ± 0.609
6.936AspAsn: 6.936 ± 1.07
0.0AspPro: 0.0 ± 0.0
0.0AspGln: 0.0 ± 0.0
5.78AspArg: 5.78 ± 0.105
2.312AspSer: 2.312 ± 1.218
1.156AspThr: 1.156 ± 0.609
3.468AspVal: 3.468 ± 0.252
0.0AspTrp: 0.0 ± 0.0
2.312AspTyr: 2.312 ± 1.931
0.0AspXaa: 0.0 ± 0.0
Glu
2.312GluAla: 2.312 ± 0.357
1.156GluCys: 1.156 ± 0.966
1.156GluAsp: 1.156 ± 0.609
8.092GluGlu: 8.092 ± 6.76
4.624GluPhe: 4.624 ± 0.714
0.0GluGly: 0.0 ± 0.0
1.156GluHis: 1.156 ± 0.966
2.312GluIle: 2.312 ± 0.357
5.78GluLys: 5.78 ± 1.679
5.78GluLeu: 5.78 ± 0.105
2.312GluMet: 2.312 ± 0.357
2.312GluAsn: 2.312 ± 0.357
2.312GluPro: 2.312 ± 0.357
5.78GluGln: 5.78 ± 1.679
4.624GluArg: 4.624 ± 0.714
2.312GluSer: 2.312 ± 1.218
4.624GluThr: 4.624 ± 0.861
1.156GluVal: 1.156 ± 0.966
0.0GluTrp: 0.0 ± 0.0
2.312GluTyr: 2.312 ± 1.931
0.0GluXaa: 0.0 ± 0.0
Phe
1.156PheAla: 1.156 ± 0.609
0.0PheCys: 0.0 ± 0.0
5.78PheAsp: 5.78 ± 1.679
1.156PheGlu: 1.156 ± 0.966
2.312PhePhe: 2.312 ± 1.218
3.468PheGly: 3.468 ± 1.827
0.0PheHis: 0.0 ± 0.0
2.312PheIle: 2.312 ± 1.218
4.624PheLys: 4.624 ± 0.714
2.312PheLeu: 2.312 ± 1.931
0.0PheMet: 0.0 ± 0.0
3.468PheAsn: 3.468 ± 1.827
1.156PhePro: 1.156 ± 0.966
0.0PheGln: 0.0 ± 0.0
2.312PheArg: 2.312 ± 1.218
1.156PheSer: 1.156 ± 0.609
3.468PheThr: 3.468 ± 1.827
4.624PheVal: 4.624 ± 0.714
1.156PheTrp: 1.156 ± 0.609
3.468PheTyr: 3.468 ± 0.252
0.0PheXaa: 0.0 ± 0.0
Gly
4.624GlyAla: 4.624 ± 2.436
0.0GlyCys: 0.0 ± 0.0
2.312GlyAsp: 2.312 ± 1.931
3.468GlyGlu: 3.468 ± 1.322
2.312GlyPhe: 2.312 ± 0.357
3.468GlyGly: 3.468 ± 1.322
0.0GlyHis: 0.0 ± 0.0
1.156GlyIle: 1.156 ± 0.609
6.936GlyLys: 6.936 ± 2.645
2.312GlyLeu: 2.312 ± 1.218
1.156GlyMet: 1.156 ± 0.966
6.936GlyAsn: 6.936 ± 1.07
1.156GlyPro: 1.156 ± 0.966
3.468GlyGln: 3.468 ± 0.252
1.156GlyArg: 1.156 ± 0.609
1.156GlySer: 1.156 ± 0.966
1.156GlyThr: 1.156 ± 0.609
4.624GlyVal: 4.624 ± 2.436
0.0GlyTrp: 0.0 ± 0.0
4.624GlyTyr: 4.624 ± 0.861
0.0GlyXaa: 0.0 ± 0.0
His
2.312HisAla: 2.312 ± 0.357
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
2.312HisGlu: 2.312 ± 1.218
1.156HisPhe: 1.156 ± 0.609
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
1.156HisLys: 1.156 ± 0.609
2.312HisLeu: 2.312 ± 0.357
0.0HisMet: 0.0 ± 0.0
1.156HisAsn: 1.156 ± 0.609
2.312HisPro: 2.312 ± 0.357
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
3.468HisSer: 3.468 ± 0.252
0.0HisThr: 0.0 ± 0.0
1.156HisVal: 1.156 ± 0.609
0.0HisTrp: 0.0 ± 0.0
1.156HisTyr: 1.156 ± 0.966
0.0HisXaa: 0.0 ± 0.0
Ile
2.312IleAla: 2.312 ± 0.357
0.0IleCys: 0.0 ± 0.0
3.468IleAsp: 3.468 ± 0.252
6.936IleGlu: 6.936 ± 0.504
1.156IlePhe: 1.156 ± 0.966
2.312IleGly: 2.312 ± 0.357
2.312IleHis: 2.312 ± 1.931
5.78IleIle: 5.78 ± 1.679
4.624IleLys: 4.624 ± 2.288
4.624IleLeu: 4.624 ± 0.714
1.156IleMet: 1.156 ± 0.966
1.156IleAsn: 1.156 ± 0.966
2.312IlePro: 2.312 ± 0.357
2.312IleGln: 2.312 ± 1.218
4.624IleArg: 4.624 ± 0.714
4.624IleSer: 4.624 ± 0.861
1.156IleThr: 1.156 ± 0.966
2.312IleVal: 2.312 ± 0.357
1.156IleTrp: 1.156 ± 0.966
1.156IleTyr: 1.156 ± 0.609
0.0IleXaa: 0.0 ± 0.0
Lys
8.092LysAla: 8.092 ± 0.461
0.0LysCys: 0.0 ± 0.0
3.468LysAsp: 3.468 ± 1.322
3.468LysGlu: 3.468 ± 0.252
1.156LysPhe: 1.156 ± 0.609
6.936LysGly: 6.936 ± 0.504
2.312LysHis: 2.312 ± 1.218
4.624LysIle: 4.624 ± 2.288
2.312LysLys: 2.312 ± 0.357
9.249LysLeu: 9.249 ± 0.147
2.312LysMet: 2.312 ± 0.357
2.312LysAsn: 2.312 ± 0.357
2.312LysPro: 2.312 ± 1.218
3.468LysGln: 3.468 ± 1.322
9.249LysArg: 9.249 ± 1.722
2.312LysSer: 2.312 ± 1.218
3.468LysThr: 3.468 ± 1.322
6.936LysVal: 6.936 ± 0.504
4.624LysTrp: 4.624 ± 2.288
3.468LysTyr: 3.468 ± 0.252
0.0LysXaa: 0.0 ± 0.0
Leu
2.312LeuAla: 2.312 ± 1.218
0.0LeuCys: 0.0 ± 0.0
3.468LeuAsp: 3.468 ± 2.897
0.0LeuGlu: 0.0 ± 0.0
4.624LeuPhe: 4.624 ± 2.436
3.468LeuGly: 3.468 ± 0.252
1.156LeuHis: 1.156 ± 0.609
5.78LeuIle: 5.78 ± 3.254
5.78LeuLys: 5.78 ± 1.679
6.936LeuLeu: 6.936 ± 4.219
2.312LeuMet: 2.312 ± 1.931
5.78LeuAsn: 5.78 ± 1.679
6.936LeuPro: 6.936 ± 2.079
1.156LeuGln: 1.156 ± 0.609
2.312LeuArg: 2.312 ± 1.931
11.561LeuSer: 11.561 ± 1.365
10.405LeuThr: 10.405 ± 0.818
3.468LeuVal: 3.468 ± 0.252
0.0LeuTrp: 0.0 ± 0.0
4.624LeuTyr: 4.624 ± 0.861
0.0LeuXaa: 0.0 ± 0.0
Met
2.312MetAla: 2.312 ± 1.218
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.312MetGlu: 2.312 ± 1.931
1.156MetPhe: 1.156 ± 0.609
0.0MetGly: 0.0 ± 0.0
1.156MetHis: 1.156 ± 0.609
2.312MetIle: 2.312 ± 1.931
0.0MetLys: 0.0 ± 0.0
1.156MetLeu: 1.156 ± 0.966
0.0MetMet: 0.0 ± 0.0
1.156MetAsn: 1.156 ± 0.966
0.0MetPro: 0.0 ± 0.0
2.312MetGln: 2.312 ± 1.218
0.0MetArg: 0.0 ± 0.0
2.312MetSer: 2.312 ± 1.218
2.312MetThr: 2.312 ± 0.357
2.312MetVal: 2.312 ± 1.218
1.156MetTrp: 1.156 ± 0.966
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
5.78AsnAla: 5.78 ± 0.105
1.156AsnCys: 1.156 ± 0.609
3.468AsnAsp: 3.468 ± 1.322
2.312AsnGlu: 2.312 ± 0.357
1.156AsnPhe: 1.156 ± 0.609
2.312AsnGly: 2.312 ± 0.357
0.0AsnHis: 0.0 ± 0.0
3.468AsnIle: 3.468 ± 1.322
4.624AsnLys: 4.624 ± 0.714
4.624AsnLeu: 4.624 ± 2.288
2.312AsnMet: 2.312 ± 1.144
5.78AsnAsn: 5.78 ± 1.679
5.78AsnPro: 5.78 ± 3.044
3.468AsnGln: 3.468 ± 1.827
2.312AsnArg: 2.312 ± 0.357
3.468AsnSer: 3.468 ± 1.827
1.156AsnThr: 1.156 ± 0.966
1.156AsnVal: 1.156 ± 0.609
1.156AsnTrp: 1.156 ± 0.966
3.468AsnTyr: 3.468 ± 0.252
0.0AsnXaa: 0.0 ± 0.0
Pro
2.312ProAla: 2.312 ± 1.218
1.156ProCys: 1.156 ± 0.609
0.0ProAsp: 0.0 ± 0.0
2.312ProGlu: 2.312 ± 0.357
2.312ProPhe: 2.312 ± 1.218
0.0ProGly: 0.0 ± 0.0
0.0ProHis: 0.0 ± 0.0
3.468ProIle: 3.468 ± 0.252
2.312ProLys: 2.312 ± 0.357
5.78ProLeu: 5.78 ± 1.679
1.156ProMet: 1.156 ± 0.609
4.624ProAsn: 4.624 ± 0.861
2.312ProPro: 2.312 ± 1.931
1.156ProGln: 1.156 ± 0.609
1.156ProArg: 1.156 ± 0.609
5.78ProSer: 5.78 ± 1.47
4.624ProThr: 4.624 ± 0.714
3.468ProVal: 3.468 ± 0.252
0.0ProTrp: 0.0 ± 0.0
4.624ProTyr: 4.624 ± 2.436
0.0ProXaa: 0.0 ± 0.0
Gln
3.468GlnAla: 3.468 ± 0.252
0.0GlnCys: 0.0 ± 0.0
2.312GlnAsp: 2.312 ± 1.218
3.468GlnGlu: 3.468 ± 1.322
0.0GlnPhe: 0.0 ± 0.0
2.312GlnGly: 2.312 ± 0.357
2.312GlnHis: 2.312 ± 1.218
3.468GlnIle: 3.468 ± 0.252
3.468GlnLys: 3.468 ± 0.252
2.312GlnLeu: 2.312 ± 0.357
1.156GlnMet: 1.156 ± 0.609
0.0GlnAsn: 0.0 ± 0.0
0.0GlnPro: 0.0 ± 0.0
1.156GlnGln: 1.156 ± 0.966
0.0GlnArg: 0.0 ± 0.0
1.156GlnSer: 1.156 ± 0.609
3.468GlnThr: 3.468 ± 0.252
2.312GlnVal: 2.312 ± 0.357
1.156GlnTrp: 1.156 ± 0.966
4.624GlnTyr: 4.624 ± 0.861
0.0GlnXaa: 0.0 ± 0.0
Arg
6.936ArgAla: 6.936 ± 0.504
0.0ArgCys: 0.0 ± 0.0
1.156ArgAsp: 1.156 ± 0.966
1.156ArgGlu: 1.156 ± 0.966
2.312ArgPhe: 2.312 ± 0.357
2.312ArgGly: 2.312 ± 0.357
1.156ArgHis: 1.156 ± 0.609
1.156ArgIle: 1.156 ± 0.966
5.78ArgLys: 5.78 ± 0.105
5.78ArgLeu: 5.78 ± 0.105
0.0ArgMet: 0.0 ± 0.0
3.468ArgAsn: 3.468 ± 0.252
2.312ArgPro: 2.312 ± 1.218
3.468ArgGln: 3.468 ± 0.252
6.936ArgArg: 6.936 ± 1.07
4.624ArgSer: 4.624 ± 2.436
1.156ArgThr: 1.156 ± 0.966
2.312ArgVal: 2.312 ± 1.218
2.312ArgTrp: 2.312 ± 1.218
5.78ArgTyr: 5.78 ± 1.47
0.0ArgXaa: 0.0 ± 0.0
Ser
1.156SerAla: 1.156 ± 0.966
1.156SerCys: 1.156 ± 0.966
4.624SerAsp: 4.624 ± 2.436
10.405SerGlu: 10.405 ± 0.756
5.78SerPhe: 5.78 ± 3.044
4.624SerGly: 4.624 ± 0.861
3.468SerHis: 3.468 ± 0.252
1.156SerIle: 1.156 ± 0.609
6.936SerLys: 6.936 ± 3.653
5.78SerLeu: 5.78 ± 1.47
2.312SerMet: 2.312 ± 1.218
3.468SerAsn: 3.468 ± 0.252
2.312SerPro: 2.312 ± 1.218
0.0SerGln: 0.0 ± 0.0
2.312SerArg: 2.312 ± 0.357
8.092SerSer: 8.092 ± 2.688
5.78SerThr: 5.78 ± 3.044
0.0SerVal: 0.0 ± 0.0
1.156SerTrp: 1.156 ± 0.609
3.468SerTyr: 3.468 ± 1.827
0.0SerXaa: 0.0 ± 0.0
Thr
2.312ThrAla: 2.312 ± 0.357
2.312ThrCys: 2.312 ± 0.357
1.156ThrAsp: 1.156 ± 0.966
1.156ThrGlu: 1.156 ± 0.609
1.156ThrPhe: 1.156 ± 0.609
6.936ThrGly: 6.936 ± 0.504
0.0ThrHis: 0.0 ± 0.0
8.092ThrIle: 8.092 ± 2.688
4.624ThrLys: 4.624 ± 0.714
2.312ThrLeu: 2.312 ± 1.218
0.0ThrMet: 0.0 ± 0.0
1.156ThrAsn: 1.156 ± 0.609
5.78ThrPro: 5.78 ± 1.47
1.156ThrGln: 1.156 ± 0.609
3.468ThrArg: 3.468 ± 0.252
2.312ThrSer: 2.312 ± 0.357
2.312ThrThr: 2.312 ± 0.357
1.156ThrVal: 1.156 ± 0.609
3.468ThrTrp: 3.468 ± 1.322
2.312ThrTyr: 2.312 ± 0.357
0.0ThrXaa: 0.0 ± 0.0
Val
1.156ValAla: 1.156 ± 0.966
1.156ValCys: 1.156 ± 0.609
3.468ValAsp: 3.468 ± 0.252
1.156ValGlu: 1.156 ± 0.966
1.156ValPhe: 1.156 ± 0.966
1.156ValGly: 1.156 ± 0.609
0.0ValHis: 0.0 ± 0.0
2.312ValIle: 2.312 ± 1.931
8.092ValLys: 8.092 ± 2.688
1.156ValLeu: 1.156 ± 0.609
0.0ValMet: 0.0 ± 0.0
1.156ValAsn: 1.156 ± 0.609
5.78ValPro: 5.78 ± 1.47
2.312ValGln: 2.312 ± 1.931
2.312ValArg: 2.312 ± 0.357
5.78ValSer: 5.78 ± 1.47
3.468ValThr: 3.468 ± 1.827
3.468ValVal: 3.468 ± 1.322
0.0ValTrp: 0.0 ± 0.0
4.624ValTyr: 4.624 ± 0.861
1.156ValXaa: 1.156 ± 0.966
Trp
1.156TrpAla: 1.156 ± 0.966
0.0TrpCys: 0.0 ± 0.0
2.312TrpAsp: 2.312 ± 0.357
2.312TrpGlu: 2.312 ± 1.931
2.312TrpPhe: 2.312 ± 1.931
0.0TrpGly: 0.0 ± 0.0
1.156TrpHis: 1.156 ± 0.609
1.156TrpIle: 1.156 ± 0.966
2.312TrpLys: 2.312 ± 0.357
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.156TrpGln: 1.156 ± 0.966
1.156TrpArg: 1.156 ± 0.609
0.0TrpSer: 0.0 ± 0.0
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
3.468TrpTyr: 3.468 ± 0.252
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.312TyrAla: 2.312 ± 1.218
2.312TyrCys: 2.312 ± 0.357
4.624TyrAsp: 4.624 ± 2.436
4.624TyrGlu: 4.624 ± 2.288
5.78TyrPhe: 5.78 ± 0.105
2.312TyrGly: 2.312 ± 0.357
1.156TyrHis: 1.156 ± 0.966
0.0TyrIle: 0.0 ± 0.0
3.468TyrLys: 3.468 ± 1.827
4.624TyrLeu: 4.624 ± 0.861
1.156TyrMet: 1.156 ± 0.609
5.78TyrAsn: 5.78 ± 1.679
2.312TyrPro: 2.312 ± 0.357
1.156TyrGln: 1.156 ± 0.609
4.624TyrArg: 4.624 ± 2.436
4.624TyrSer: 4.624 ± 0.861
2.312TyrThr: 2.312 ± 1.218
2.312TyrVal: 2.312 ± 0.357
1.156TyrTrp: 1.156 ± 0.966
4.624TyrTyr: 4.624 ± 2.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
1.156XaaGly: 1.156 ± 0.966
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (866 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski