Amino acid dipepetide frequency for Po-Circo-like virus 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.837AlaAla: 2.837 ± 2.998
0.0AlaCys: 0.0 ± 0.0
1.418AlaAsp: 1.418 ± 1.036
1.418AlaGlu: 1.418 ± 1.036
0.0AlaPhe: 0.0 ± 0.0
5.674AlaGly: 5.674 ± 3.537
2.837AlaHis: 2.837 ± 2.071
5.674AlaIle: 5.674 ± 2.357
2.837AlaLys: 2.837 ± 1.185
5.674AlaLeu: 5.674 ± 2.041
0.0AlaMet: 0.0 ± 0.0
0.0AlaAsn: 0.0 ± 0.0
0.0AlaPro: 0.0 ± 0.0
1.418AlaGln: 1.418 ± 1.499
4.255AlaArg: 4.255 ± 1.636
5.674AlaSer: 5.674 ± 4.329
4.255AlaThr: 4.255 ± 2.49
0.0AlaVal: 0.0 ± 0.0
0.0AlaTrp: 0.0 ± 0.0
2.837AlaTyr: 2.837 ± 1.768
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.418CysCys: 1.418 ± 1.436
2.837CysAsp: 2.837 ± 1.179
1.418CysGlu: 1.418 ± 1.036
2.837CysPhe: 2.837 ± 2.071
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
1.418CysLeu: 1.418 ± 1.436
0.0CysMet: 0.0 ± 0.0
1.418CysAsn: 1.418 ± 1.499
0.0CysPro: 0.0 ± 0.0
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
4.255CysSer: 4.255 ± 0.733
1.418CysThr: 1.418 ± 1.436
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.418AspAla: 1.418 ± 1.436
2.837AspCys: 2.837 ± 2.071
2.837AspAsp: 2.837 ± 2.071
2.837AspGlu: 2.837 ± 2.071
2.837AspPhe: 2.837 ± 1.185
7.092AspGly: 7.092 ± 2.741
1.418AspHis: 1.418 ± 1.036
1.418AspIle: 1.418 ± 1.436
5.674AspLys: 5.674 ± 4.143
1.418AspLeu: 1.418 ± 1.036
1.418AspMet: 1.418 ± 0.825
1.418AspAsn: 1.418 ± 1.036
8.511AspPro: 8.511 ± 3.399
0.0AspGln: 0.0 ± 0.0
7.092AspArg: 7.092 ± 5.178
4.255AspSer: 4.255 ± 2.49
1.418AspThr: 1.418 ± 1.036
1.418AspVal: 1.418 ± 1.036
0.0AspTrp: 0.0 ± 0.0
2.837AspTyr: 2.837 ± 1.185
0.0AspXaa: 0.0 ± 0.0
Glu
4.255GluAla: 4.255 ± 1.7
1.418GluCys: 1.418 ± 1.036
2.837GluAsp: 2.837 ± 2.071
7.092GluGlu: 7.092 ± 5.178
1.418GluPhe: 1.418 ± 1.036
4.255GluGly: 4.255 ± 1.636
2.837GluHis: 2.837 ± 2.071
2.837GluIle: 2.837 ± 1.185
0.0GluLys: 0.0 ± 0.0
1.418GluLeu: 1.418 ± 1.036
0.0GluMet: 0.0 ± 0.0
1.418GluAsn: 1.418 ± 1.436
2.837GluPro: 2.837 ± 1.179
2.837GluGln: 2.837 ± 2.071
1.418GluArg: 1.418 ± 1.036
0.0GluSer: 0.0 ± 0.0
5.674GluThr: 5.674 ± 1.952
2.837GluVal: 2.837 ± 1.768
1.418GluTrp: 1.418 ± 1.036
2.837GluTyr: 2.837 ± 1.185
0.0GluXaa: 0.0 ± 0.0
Phe
1.418PheAla: 1.418 ± 1.499
0.0PheCys: 0.0 ± 0.0
4.255PheAsp: 4.255 ± 1.7
1.418PheGlu: 1.418 ± 1.036
2.837PhePhe: 2.837 ± 2.071
1.418PheGly: 1.418 ± 1.499
0.0PheHis: 0.0 ± 0.0
5.674PheIle: 5.674 ± 1.952
1.418PheLys: 1.418 ± 1.036
4.255PheLeu: 4.255 ± 0.733
2.837PheMet: 2.837 ± 1.185
4.255PheAsn: 4.255 ± 1.7
2.837PhePro: 2.837 ± 1.185
2.837PheGln: 2.837 ± 1.179
1.418PheArg: 1.418 ± 1.036
2.837PheSer: 2.837 ± 1.185
9.929PheThr: 9.929 ± 3.41
0.0PheVal: 0.0 ± 0.0
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.674GlyAla: 5.674 ± 1.952
0.0GlyCys: 0.0 ± 0.0
4.255GlyAsp: 4.255 ± 1.636
0.0GlyGlu: 0.0 ± 0.0
4.255GlyPhe: 4.255 ± 2.421
2.837GlyGly: 2.837 ± 2.071
1.418GlyHis: 1.418 ± 1.436
2.837GlyIle: 2.837 ± 1.768
7.092GlyLys: 7.092 ± 5.211
2.837GlyLeu: 2.837 ± 1.768
0.0GlyMet: 0.0 ± 0.0
5.674GlyAsn: 5.674 ± 3.937
1.418GlyPro: 1.418 ± 1.036
1.418GlyGln: 1.418 ± 1.499
4.255GlyArg: 4.255 ± 1.7
7.092GlySer: 7.092 ± 1.278
9.929GlyThr: 9.929 ± 0.43
2.837GlyVal: 2.837 ± 2.998
1.418GlyTrp: 1.418 ± 1.036
2.837GlyTyr: 2.837 ± 1.185
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.418HisAsp: 1.418 ± 1.036
1.418HisGlu: 1.418 ± 1.036
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
2.837HisIle: 2.837 ± 2.071
0.0HisLys: 0.0 ± 0.0
4.255HisLeu: 4.255 ± 3.107
0.0HisMet: 0.0 ± 0.0
4.255HisAsn: 4.255 ± 1.7
1.418HisPro: 1.418 ± 1.036
0.0HisGln: 0.0 ± 0.0
2.837HisArg: 2.837 ± 2.071
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.418HisVal: 1.418 ± 1.436
0.0HisTrp: 0.0 ± 0.0
1.418HisTyr: 1.418 ± 1.436
0.0HisXaa: 0.0 ± 0.0
Ile
2.837IleAla: 2.837 ± 1.179
2.837IleCys: 2.837 ± 2.998
8.511IleAsp: 8.511 ± 4.842
1.418IleGlu: 1.418 ± 1.436
2.837IlePhe: 2.837 ± 1.179
1.418IleGly: 1.418 ± 1.499
2.837IleHis: 2.837 ± 2.071
1.418IleIle: 1.418 ± 1.499
1.418IleLys: 1.418 ± 1.436
0.0IleLeu: 0.0 ± 0.0
0.0IleMet: 0.0 ± 0.0
2.837IleAsn: 2.837 ± 2.071
7.092IlePro: 7.092 ± 3.348
4.255IleGln: 4.255 ± 1.7
1.418IleArg: 1.418 ± 1.036
5.674IleSer: 5.674 ± 3.537
4.255IleThr: 4.255 ± 2.49
8.511IleVal: 8.511 ± 3.399
2.837IleTrp: 2.837 ± 2.071
5.674IleTyr: 5.674 ± 2.471
0.0IleXaa: 0.0 ± 0.0
Lys
0.0LysAla: 0.0 ± 0.0
0.0LysCys: 0.0 ± 0.0
1.418LysAsp: 1.418 ± 1.036
2.837LysGlu: 2.837 ± 1.185
2.837LysPhe: 2.837 ± 1.185
5.674LysGly: 5.674 ± 2.369
1.418LysHis: 1.418 ± 1.036
1.418LysIle: 1.418 ± 1.036
0.0LysLys: 0.0 ± 0.0
2.837LysLeu: 2.837 ± 1.185
0.0LysMet: 0.0 ± 0.0
5.674LysAsn: 5.674 ± 0.303
2.837LysPro: 2.837 ± 1.185
0.0LysGln: 0.0 ± 0.0
1.418LysArg: 1.418 ± 1.036
4.255LysSer: 4.255 ± 3.107
2.837LysThr: 2.837 ± 1.768
1.418LysVal: 1.418 ± 1.436
1.418LysTrp: 1.418 ± 1.036
1.418LysTyr: 1.418 ± 1.036
0.0LysXaa: 0.0 ± 0.0
Leu
4.255LeuAla: 4.255 ± 1.636
1.418LeuCys: 1.418 ± 1.436
5.674LeuAsp: 5.674 ± 2.553
1.418LeuGlu: 1.418 ± 1.036
1.418LeuPhe: 1.418 ± 1.036
0.0LeuGly: 0.0 ± 0.0
1.418LeuHis: 1.418 ± 1.036
5.674LeuIle: 5.674 ± 3.801
1.418LeuLys: 1.418 ± 1.036
1.418LeuLeu: 1.418 ± 1.436
2.837LeuMet: 2.837 ± 1.179
7.092LeuAsn: 7.092 ± 3.505
2.837LeuPro: 2.837 ± 2.873
0.0LeuGln: 0.0 ± 0.0
4.255LeuArg: 4.255 ± 4.497
2.837LeuSer: 2.837 ± 2.873
2.837LeuThr: 2.837 ± 1.185
7.092LeuVal: 7.092 ± 2.501
0.0LeuTrp: 0.0 ± 0.0
1.418LeuTyr: 1.418 ± 1.499
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.0MetCys: 0.0 ± 0.0
2.837MetAsp: 2.837 ± 2.071
1.418MetGlu: 1.418 ± 1.436
1.418MetPhe: 1.418 ± 1.499
0.0MetGly: 0.0 ± 0.0
1.418MetHis: 1.418 ± 1.036
4.255MetIle: 4.255 ± 1.636
1.418MetLys: 1.418 ± 1.036
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
2.837MetAsn: 2.837 ± 1.185
1.418MetPro: 1.418 ± 1.036
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
1.418MetSer: 1.418 ± 1.036
0.0MetThr: 0.0 ± 0.0
1.418MetVal: 1.418 ± 1.499
0.0MetTrp: 0.0 ± 0.0
1.418MetTyr: 1.418 ± 1.436
0.0MetXaa: 0.0 ± 0.0
Asn
2.837AsnAla: 2.837 ± 2.998
0.0AsnCys: 0.0 ± 0.0
1.418AsnAsp: 1.418 ± 1.036
4.255AsnGlu: 4.255 ± 1.636
7.092AsnPhe: 7.092 ± 2.657
5.674AsnGly: 5.674 ± 1.952
1.418AsnHis: 1.418 ± 1.436
7.092AsnIle: 7.092 ± 1.337
0.0AsnLys: 0.0 ± 0.0
8.511AsnLeu: 8.511 ± 4.769
1.418AsnMet: 1.418 ± 1.036
2.837AsnAsn: 2.837 ± 1.185
1.418AsnPro: 1.418 ± 1.036
4.255AsnGln: 4.255 ± 0.733
5.674AsnArg: 5.674 ± 0.303
2.837AsnSer: 2.837 ± 1.768
1.418AsnThr: 1.418 ± 1.436
4.255AsnVal: 4.255 ± 2.49
1.418AsnTrp: 1.418 ± 1.036
4.255AsnTyr: 4.255 ± 2.421
0.0AsnXaa: 0.0 ± 0.0
Pro
4.255ProAla: 4.255 ± 2.947
0.0ProCys: 0.0 ± 0.0
0.0ProAsp: 0.0 ± 0.0
2.837ProGlu: 2.837 ± 1.185
1.418ProPhe: 1.418 ± 1.036
4.255ProGly: 4.255 ± 2.421
2.837ProHis: 2.837 ± 2.071
4.255ProIle: 4.255 ± 1.7
2.837ProLys: 2.837 ± 1.185
1.418ProLeu: 1.418 ± 1.036
0.0ProMet: 0.0 ± 0.0
7.092ProAsn: 7.092 ± 1.339
4.255ProPro: 4.255 ± 2.421
0.0ProGln: 0.0 ± 0.0
1.418ProArg: 1.418 ± 1.036
5.674ProSer: 5.674 ± 2.553
5.674ProThr: 5.674 ± 4.329
7.092ProVal: 7.092 ± 1.337
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
1.418GlnAla: 1.418 ± 1.036
0.0GlnCys: 0.0 ± 0.0
1.418GlnAsp: 1.418 ± 1.036
2.837GlnGlu: 2.837 ± 2.071
1.418GlnPhe: 1.418 ± 1.436
4.255GlnGly: 4.255 ± 2.49
0.0GlnHis: 0.0 ± 0.0
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
1.418GlnLeu: 1.418 ± 1.499
1.418GlnMet: 1.418 ± 1.029
1.418GlnAsn: 1.418 ± 1.499
1.418GlnPro: 1.418 ± 1.499
1.418GlnGln: 1.418 ± 1.436
2.837GlnArg: 2.837 ± 2.071
8.511GlnSer: 8.511 ± 5.704
0.0GlnThr: 0.0 ± 0.0
0.0GlnVal: 0.0 ± 0.0
1.418GlnTrp: 1.418 ± 1.499
1.418GlnTyr: 1.418 ± 1.499
0.0GlnXaa: 0.0 ± 0.0
Arg
4.255ArgAla: 4.255 ± 1.636
0.0ArgCys: 0.0 ± 0.0
2.837ArgAsp: 2.837 ± 2.071
2.837ArgGlu: 2.837 ± 2.071
2.837ArgPhe: 2.837 ± 2.071
2.837ArgGly: 2.837 ± 1.179
0.0ArgHis: 0.0 ± 0.0
5.674ArgIle: 5.674 ± 2.471
2.837ArgLys: 2.837 ± 1.768
2.837ArgLeu: 2.837 ± 1.179
1.418ArgMet: 1.418 ± 1.036
2.837ArgAsn: 2.837 ± 1.768
5.674ArgPro: 5.674 ± 2.357
0.0ArgGln: 0.0 ± 0.0
4.255ArgArg: 4.255 ± 2.49
8.511ArgSer: 8.511 ± 2.825
4.255ArgThr: 4.255 ± 3.107
1.418ArgVal: 1.418 ± 1.036
1.418ArgTrp: 1.418 ± 1.036
1.418ArgTyr: 1.418 ± 1.499
0.0ArgXaa: 0.0 ± 0.0
Ser
1.418SerAla: 1.418 ± 1.499
0.0SerCys: 0.0 ± 0.0
2.837SerAsp: 2.837 ± 2.071
4.255SerGlu: 4.255 ± 0.733
4.255SerPhe: 4.255 ± 2.852
11.348SerGly: 11.348 ± 5.234
0.0SerHis: 0.0 ± 0.0
2.837SerIle: 2.837 ± 1.179
4.255SerLys: 4.255 ± 3.107
2.837SerLeu: 2.837 ± 1.768
1.418SerMet: 1.418 ± 1.036
8.511SerAsn: 8.511 ± 2.702
2.837SerPro: 2.837 ± 1.768
5.674SerGln: 5.674 ± 4.155
7.092SerArg: 7.092 ± 3.505
8.511SerSer: 8.511 ± 5.894
7.092SerThr: 7.092 ± 3.418
4.255SerVal: 4.255 ± 2.947
0.0SerTrp: 0.0 ± 0.0
8.511SerTyr: 8.511 ± 5.305
0.0SerXaa: 0.0 ± 0.0
Thr
2.837ThrAla: 2.837 ± 1.179
0.0ThrCys: 0.0 ± 0.0
2.837ThrAsp: 2.837 ± 1.185
2.837ThrGlu: 2.837 ± 1.179
2.837ThrPhe: 2.837 ± 1.179
7.092ThrGly: 7.092 ± 1.278
0.0ThrHis: 0.0 ± 0.0
8.511ThrIle: 8.511 ± 2.702
4.255ThrLys: 4.255 ± 1.636
5.674ThrLeu: 5.674 ± 2.041
2.837ThrMet: 2.837 ± 1.768
2.837ThrAsn: 2.837 ± 1.768
4.255ThrPro: 4.255 ± 1.636
2.837ThrGln: 2.837 ± 2.071
4.255ThrArg: 4.255 ± 1.636
5.674ThrSer: 5.674 ± 2.041
9.929ThrThr: 9.929 ± 4.062
8.511ThrVal: 8.511 ± 1.248
0.0ThrTrp: 0.0 ± 0.0
2.837ThrTyr: 2.837 ± 2.998
0.0ThrXaa: 0.0 ± 0.0
Val
2.837ValAla: 2.837 ± 2.998
2.837ValCys: 2.837 ± 1.768
5.674ValAsp: 5.674 ± 4.143
2.837ValGlu: 2.837 ± 1.185
2.837ValPhe: 2.837 ± 1.185
1.418ValGly: 1.418 ± 1.436
0.0ValHis: 0.0 ± 0.0
4.255ValIle: 4.255 ± 0.733
2.837ValLys: 2.837 ± 2.071
2.837ValLeu: 2.837 ± 1.768
2.837ValMet: 2.837 ± 2.071
1.418ValAsn: 1.418 ± 1.436
2.837ValPro: 2.837 ± 1.179
4.255ValGln: 4.255 ± 4.497
1.418ValArg: 1.418 ± 1.499
2.837ValSer: 2.837 ± 1.185
5.674ValThr: 5.674 ± 3.937
2.837ValVal: 2.837 ± 1.768
0.0ValTrp: 0.0 ± 0.0
7.092ValTyr: 7.092 ± 4.523
0.0ValXaa: 0.0 ± 0.0
Trp
1.418TrpAla: 1.418 ± 1.036
1.418TrpCys: 1.418 ± 1.036
1.418TrpAsp: 1.418 ± 1.036
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.418TrpGly: 1.418 ± 1.036
0.0TrpHis: 0.0 ± 0.0
1.418TrpIle: 1.418 ± 1.036
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
1.418TrpAsn: 1.418 ± 1.036
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.418TrpThr: 1.418 ± 1.499
1.418TrpVal: 1.418 ± 1.036
1.418TrpTrp: 1.418 ± 1.036
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.837TyrAla: 2.837 ± 1.768
2.837TyrCys: 2.837 ± 1.185
1.418TyrAsp: 1.418 ± 1.436
4.255TyrGlu: 4.255 ± 0.733
4.255TyrPhe: 4.255 ± 1.7
1.418TyrGly: 1.418 ± 1.036
1.418TyrHis: 1.418 ± 1.436
0.0TyrIle: 0.0 ± 0.0
1.418TyrLys: 1.418 ± 1.436
4.255TyrLeu: 4.255 ± 1.7
1.418TyrMet: 1.418 ± 1.078
2.837TyrAsn: 2.837 ± 1.768
1.418TyrPro: 1.418 ± 1.036
2.837TyrGln: 2.837 ± 1.768
2.837TyrArg: 2.837 ± 2.998
7.092TyrSer: 7.092 ± 4.624
2.837TyrThr: 2.837 ± 1.768
2.837TyrVal: 2.837 ± 1.768
0.0TyrTrp: 0.0 ± 0.0
1.418TyrTyr: 1.418 ± 1.436
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (706 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski