Amino acid dipepetide frequency for Omono River virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.179AlaAla: 10.179 ± 2.637
2.85AlaCys: 2.85 ± 0.133
4.072AlaAsp: 4.072 ± 2.156
2.85AlaGlu: 2.85 ± 1.509
2.036AlaPhe: 2.036 ± 0.39
5.7AlaGly: 5.7 ± 1.642
1.221AlaHis: 1.221 ± 0.041
6.107AlaIle: 6.107 ± 0.481
4.479AlaLys: 4.479 ± 1.069
5.7AlaLeu: 5.7 ± 0.954
2.036AlaMet: 2.036 ± 0.298
6.922AlaAsn: 6.922 ± 2.289
4.072AlaPro: 4.072 ± 0.78
2.443AlaGln: 2.443 ± 1.293
3.257AlaArg: 3.257 ± 0.34
5.293AlaSer: 5.293 ± 0.05
9.772AlaThr: 9.772 ± 2.422
6.922AlaVal: 6.922 ± 0.913
1.221AlaTrp: 1.221 ± 0.041
4.886AlaTyr: 4.886 ± 0.853
0.0AlaXaa: 0.0 ± 0.0
Cys
0.407CysAla: 0.407 ± 0.216
0.0CysCys: 0.0 ± 0.0
0.407CysAsp: 0.407 ± 0.216
1.221CysGlu: 1.221 ± 1.417
0.0CysPhe: 0.0 ± 0.0
0.814CysGly: 0.814 ± 0.257
0.407CysHis: 0.407 ± 0.472
0.407CysIle: 0.407 ± 0.216
1.629CysLys: 1.629 ± 1.202
1.221CysLeu: 1.221 ± 0.041
0.407CysMet: 0.407 ± 0.216
0.814CysAsn: 0.814 ± 0.431
0.407CysPro: 0.407 ± 0.216
0.814CysGln: 0.814 ± 0.431
0.814CysArg: 0.814 ± 0.431
0.814CysSer: 0.814 ± 0.257
0.0CysThr: 0.0 ± 0.0
0.407CysVal: 0.407 ± 0.216
0.814CysTrp: 0.814 ± 0.431
1.221CysTyr: 1.221 ± 0.041
0.0CysXaa: 0.0 ± 0.0
Asp
4.886AspAla: 4.886 ± 1.211
1.221AspCys: 1.221 ± 0.729
2.036AspAsp: 2.036 ± 0.39
5.293AspGlu: 5.293 ± 0.738
1.221AspPhe: 1.221 ± 0.729
0.814AspGly: 0.814 ± 0.431
0.407AspHis: 0.407 ± 0.216
5.7AspIle: 5.7 ± 2.486
2.443AspLys: 2.443 ± 0.083
6.107AspLeu: 6.107 ± 2.271
0.814AspMet: 0.814 ± 0.431
1.221AspAsn: 1.221 ± 0.041
3.257AspPro: 3.257 ± 1.037
2.85AspGln: 2.85 ± 0.821
1.221AspArg: 1.221 ± 0.729
1.221AspSer: 1.221 ± 0.647
1.629AspThr: 1.629 ± 0.514
2.036AspVal: 2.036 ± 1.078
0.0AspTrp: 0.0 ± 0.0
2.036AspTyr: 2.036 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
4.479GluAla: 4.479 ± 0.995
0.814GluCys: 0.814 ± 0.431
1.221GluAsp: 1.221 ± 0.041
3.257GluGlu: 3.257 ± 1.037
1.629GluPhe: 1.629 ± 1.89
3.664GluGly: 3.664 ± 0.812
1.629GluHis: 1.629 ± 0.514
3.664GluIle: 3.664 ± 1.5
1.629GluLys: 1.629 ± 0.174
5.293GluLeu: 5.293 ± 0.738
0.407GluMet: 0.407 ± 0.216
2.036GluAsn: 2.036 ± 0.298
2.443GluPro: 2.443 ± 0.083
1.221GluGln: 1.221 ± 0.647
2.85GluArg: 2.85 ± 0.821
1.629GluSer: 1.629 ± 0.174
3.664GluThr: 3.664 ± 0.124
2.443GluVal: 2.443 ± 0.605
2.036GluTrp: 2.036 ± 0.986
2.443GluTyr: 2.443 ± 0.605
0.0GluXaa: 0.0 ± 0.0
Phe
3.664PheAla: 3.664 ± 1.252
0.0PheCys: 0.0 ± 0.0
3.664PheAsp: 3.664 ± 1.5
2.443PheGlu: 2.443 ± 0.771
0.814PhePhe: 0.814 ± 0.257
3.257PheGly: 3.257 ± 0.34
0.814PheHis: 0.814 ± 0.431
0.814PheIle: 0.814 ± 0.257
2.443PheLys: 2.443 ± 1.459
1.221PheLeu: 1.221 ± 0.041
0.814PheMet: 0.814 ± 0.717
1.629PheAsn: 1.629 ± 0.174
2.443PhePro: 2.443 ± 0.083
0.0PheGln: 0.0 ± 0.0
1.221PheArg: 1.221 ± 0.729
2.85PheSer: 2.85 ± 0.555
1.221PheThr: 1.221 ± 0.647
2.036PheVal: 2.036 ± 0.986
0.814PheTrp: 0.814 ± 0.257
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.85GlyAla: 2.85 ± 0.821
0.0GlyCys: 0.0 ± 0.0
2.85GlyAsp: 2.85 ± 0.555
3.257GlyGlu: 3.257 ± 1.037
2.85GlyPhe: 2.85 ± 1.243
5.293GlyGly: 5.293 ± 0.638
0.814GlyHis: 0.814 ± 0.257
2.443GlyIle: 2.443 ± 0.771
3.664GlyLys: 3.664 ± 0.812
4.072GlyLeu: 4.072 ± 0.596
0.814GlyMet: 0.814 ± 0.257
4.479GlyAsn: 4.479 ± 0.995
2.85GlyPro: 2.85 ± 0.133
2.036GlyGln: 2.036 ± 1.078
2.85GlyArg: 2.85 ± 0.821
2.036GlySer: 2.036 ± 0.298
3.664GlyThr: 3.664 ± 1.252
4.479GlyVal: 4.479 ± 1.683
2.443GlyTrp: 2.443 ± 0.771
0.407GlyTyr: 0.407 ± 0.472
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 1.293
0.0HisCys: 0.0 ± 0.0
0.814HisAsp: 0.814 ± 0.431
2.036HisGlu: 2.036 ± 0.298
0.407HisPhe: 0.407 ± 0.216
1.221HisGly: 1.221 ± 0.729
1.629HisHis: 1.629 ± 1.202
0.0HisIle: 0.0 ± 0.0
0.814HisLys: 0.814 ± 0.945
2.036HisLeu: 2.036 ± 1.674
0.814HisMet: 0.814 ± 0.431
1.221HisAsn: 1.221 ± 0.647
0.814HisPro: 0.814 ± 0.945
0.814HisGln: 0.814 ± 0.945
1.629HisArg: 1.629 ± 0.514
1.629HisSer: 1.629 ± 0.514
0.814HisThr: 0.814 ± 0.431
2.036HisVal: 2.036 ± 0.298
0.407HisTrp: 0.407 ± 0.472
0.814HisTyr: 0.814 ± 0.945
0.0HisXaa: 0.0 ± 0.0
Ile
8.143IleAla: 8.143 ± 3.257
0.814IleCys: 0.814 ± 0.257
2.85IleAsp: 2.85 ± 0.555
1.221IleGlu: 1.221 ± 0.041
1.221IlePhe: 1.221 ± 0.647
2.85IleGly: 2.85 ± 1.243
2.036IleHis: 2.036 ± 0.986
4.072IleIle: 4.072 ± 1.284
2.85IleLys: 2.85 ± 1.243
3.257IleLeu: 3.257 ± 0.34
0.814IleMet: 0.814 ± 0.257
2.85IleAsn: 2.85 ± 0.555
7.329IlePro: 7.329 ± 0.44
2.036IleGln: 2.036 ± 0.298
2.85IleArg: 2.85 ± 0.133
3.664IleSer: 3.664 ± 0.124
3.257IleThr: 3.257 ± 0.349
3.664IleVal: 3.664 ± 0.564
1.221IleTrp: 1.221 ± 0.041
2.443IleTyr: 2.443 ± 1.459
0.0IleXaa: 0.0 ± 0.0
Lys
2.036LysAla: 2.036 ± 0.39
0.0LysCys: 0.0 ± 0.0
2.443LysAsp: 2.443 ± 0.771
2.443LysGlu: 2.443 ± 0.771
1.629LysPhe: 1.629 ± 0.514
2.036LysGly: 2.036 ± 0.298
1.629LysHis: 1.629 ± 1.202
3.664LysIle: 3.664 ± 2.188
3.664LysLys: 3.664 ± 1.94
6.107LysLeu: 6.107 ± 0.895
2.85LysMet: 2.85 ± 0.555
2.443LysAsn: 2.443 ± 0.771
4.072LysPro: 4.072 ± 0.092
2.443LysGln: 2.443 ± 0.605
2.85LysArg: 2.85 ± 2.619
3.664LysSer: 3.664 ± 0.564
3.257LysThr: 3.257 ± 1.037
4.886LysVal: 4.886 ± 1.541
1.221LysTrp: 1.221 ± 0.041
1.221LysTyr: 1.221 ± 0.041
0.0LysXaa: 0.0 ± 0.0
Leu
10.586LeuAla: 10.586 ± 0.101
0.814LeuCys: 0.814 ± 0.257
2.85LeuAsp: 2.85 ± 1.243
2.443LeuGlu: 2.443 ± 0.605
2.85LeuPhe: 2.85 ± 1.243
2.85LeuGly: 2.85 ± 0.133
2.036LeuHis: 2.036 ± 0.986
3.257LeuIle: 3.257 ± 1.028
4.479LeuLys: 4.479 ± 1.069
5.7LeuLeu: 5.7 ± 1.11
1.629LeuMet: 1.629 ± 0.174
4.886LeuAsn: 4.886 ± 2.229
5.7LeuPro: 5.7 ± 1.11
5.7LeuGln: 5.7 ± 1.642
2.443LeuArg: 2.443 ± 0.771
8.143LeuSer: 8.143 ± 2.935
8.55LeuThr: 8.55 ± 1.087
4.072LeuVal: 4.072 ± 0.092
2.036LeuTrp: 2.036 ± 1.674
2.443LeuTyr: 2.443 ± 0.771
0.0LeuXaa: 0.0 ± 0.0
Met
1.629MetAla: 1.629 ± 0.174
0.814MetCys: 0.814 ± 0.431
1.221MetAsp: 1.221 ± 0.647
0.814MetGlu: 0.814 ± 0.257
0.814MetPhe: 0.814 ± 0.431
0.407MetGly: 0.407 ± 0.216
0.814MetHis: 0.814 ± 0.431
1.629MetIle: 1.629 ± 0.174
0.814MetLys: 0.814 ± 0.945
2.036MetLeu: 2.036 ± 0.298
0.814MetMet: 0.814 ± 0.431
1.221MetAsn: 1.221 ± 0.041
0.814MetPro: 0.814 ± 0.257
0.814MetGln: 0.814 ± 0.945
0.814MetArg: 0.814 ± 0.257
2.036MetSer: 2.036 ± 0.39
1.629MetThr: 1.629 ± 0.174
1.629MetVal: 1.629 ± 0.174
0.0MetTrp: 0.0 ± 0.0
0.814MetTyr: 0.814 ± 0.257
0.0MetXaa: 0.0 ± 0.0
Asn
5.7AsnAla: 5.7 ± 0.266
0.814AsnCys: 0.814 ± 0.257
3.257AsnAsp: 3.257 ± 0.349
1.629AsnGlu: 1.629 ± 0.514
1.629AsnPhe: 1.629 ± 0.174
3.257AsnGly: 3.257 ± 0.349
0.0AsnHis: 0.0 ± 0.0
4.479AsnIle: 4.479 ± 2.445
2.85AsnLys: 2.85 ± 0.133
2.443AsnLeu: 2.443 ± 0.083
1.629AsnMet: 1.629 ± 0.514
4.072AsnAsn: 4.072 ± 0.78
5.293AsnPro: 5.293 ± 0.05
3.257AsnGln: 3.257 ± 1.037
2.443AsnArg: 2.443 ± 0.083
5.293AsnSer: 5.293 ± 0.05
4.072AsnThr: 4.072 ± 1.468
4.479AsnVal: 4.479 ± 2.371
0.814AsnTrp: 0.814 ± 0.257
1.221AsnTyr: 1.221 ± 0.729
0.0AsnXaa: 0.0 ± 0.0
Pro
6.515ProAla: 6.515 ± 2.073
0.0ProCys: 0.0 ± 0.0
2.85ProAsp: 2.85 ± 1.243
2.85ProGlu: 2.85 ± 0.555
2.443ProPhe: 2.443 ± 0.771
1.629ProGly: 1.629 ± 0.174
1.221ProHis: 1.221 ± 0.041
3.664ProIle: 3.664 ± 1.252
3.257ProLys: 3.257 ± 0.349
5.7ProLeu: 5.7 ± 0.266
0.407ProMet: 0.407 ± 0.216
2.443ProAsn: 2.443 ± 0.083
2.443ProPro: 2.443 ± 0.605
2.036ProGln: 2.036 ± 0.39
2.036ProArg: 2.036 ± 0.39
5.293ProSer: 5.293 ± 1.326
4.479ProThr: 4.479 ± 0.381
5.293ProVal: 5.293 ± 2.114
0.814ProTrp: 0.814 ± 0.945
2.036ProTyr: 2.036 ± 1.078
0.0ProXaa: 0.0 ± 0.0
Gln
4.479GlnAla: 4.479 ± 1.683
0.0GlnCys: 0.0 ± 0.0
0.407GlnAsp: 0.407 ± 0.216
1.221GlnGlu: 1.221 ± 0.729
1.629GlnPhe: 1.629 ± 0.174
1.629GlnGly: 1.629 ± 0.862
1.221GlnHis: 1.221 ± 1.417
2.036GlnIle: 2.036 ± 0.39
2.036GlnLys: 2.036 ± 0.298
2.443GlnLeu: 2.443 ± 0.605
0.0GlnMet: 0.0 ± 0.0
2.85GlnAsn: 2.85 ± 0.555
2.036GlnPro: 2.036 ± 0.298
2.036GlnGln: 2.036 ± 0.986
1.629GlnArg: 1.629 ± 0.514
4.072GlnSer: 4.072 ± 0.092
4.479GlnThr: 4.479 ± 0.381
4.072GlnVal: 4.072 ± 1.468
0.407GlnTrp: 0.407 ± 0.216
2.85GlnTyr: 2.85 ± 1.509
0.0GlnXaa: 0.0 ± 0.0
Arg
2.036ArgAla: 2.036 ± 0.986
0.814ArgCys: 0.814 ± 0.945
1.629ArgAsp: 1.629 ± 0.514
3.664ArgGlu: 3.664 ± 1.94
0.814ArgPhe: 0.814 ± 0.257
2.443ArgGly: 2.443 ± 0.083
1.629ArgHis: 1.629 ± 0.862
2.443ArgIle: 2.443 ± 0.083
2.443ArgLys: 2.443 ± 1.459
4.072ArgLeu: 4.072 ± 0.596
0.407ArgMet: 0.407 ± 0.216
2.036ArgAsn: 2.036 ± 0.986
1.629ArgPro: 1.629 ± 0.174
1.629ArgGln: 1.629 ± 1.202
4.072ArgArg: 4.072 ± 0.596
3.664ArgSer: 3.664 ± 1.252
1.629ArgThr: 1.629 ± 1.202
2.443ArgVal: 2.443 ± 0.083
1.221ArgTrp: 1.221 ± 0.041
1.221ArgTyr: 1.221 ± 0.041
0.0ArgXaa: 0.0 ± 0.0
Ser
5.7SerAla: 5.7 ± 2.33
0.814SerCys: 0.814 ± 0.431
4.072SerAsp: 4.072 ± 0.78
3.257SerGlu: 3.257 ± 1.725
4.479SerPhe: 4.479 ± 0.307
3.257SerGly: 3.257 ± 0.34
1.629SerHis: 1.629 ± 0.514
3.664SerIle: 3.664 ± 0.124
3.664SerLys: 3.664 ± 1.5
6.922SerLeu: 6.922 ± 0.464
2.036SerMet: 2.036 ± 0.512
4.072SerAsn: 4.072 ± 0.596
4.072SerPro: 4.072 ± 0.78
2.036SerGln: 2.036 ± 0.39
1.629SerArg: 1.629 ± 0.174
6.922SerSer: 6.922 ± 0.225
5.293SerThr: 5.293 ± 0.05
4.479SerVal: 4.479 ± 0.307
1.629SerTrp: 1.629 ± 1.202
3.257SerTyr: 3.257 ± 1.028
0.0SerXaa: 0.0 ± 0.0
Thr
4.072ThrAla: 4.072 ± 1.468
0.407ThrCys: 0.407 ± 0.216
3.257ThrAsp: 3.257 ± 0.34
3.257ThrGlu: 3.257 ± 1.716
1.629ThrPhe: 1.629 ± 0.514
4.479ThrGly: 4.479 ± 1.069
1.221ThrHis: 1.221 ± 0.041
4.886ThrIle: 4.886 ± 0.523
5.7ThrLys: 5.7 ± 1.642
6.515ThrLeu: 6.515 ± 0.697
0.814ThrMet: 0.814 ± 0.431
5.7ThrAsn: 5.7 ± 1.642
0.814ThrPro: 0.814 ± 0.431
4.479ThrGln: 4.479 ± 0.381
2.443ThrArg: 2.443 ± 0.083
6.515ThrSer: 6.515 ± 1.385
6.922ThrThr: 6.922 ± 1.601
6.107ThrVal: 6.107 ± 1.858
2.036ThrTrp: 2.036 ± 1.078
2.443ThrTyr: 2.443 ± 0.083
0.0ThrXaa: 0.0 ± 0.0
Val
5.7ValAla: 5.7 ± 2.33
1.221ValCys: 1.221 ± 0.041
1.629ValAsp: 1.629 ± 0.174
3.257ValGlu: 3.257 ± 0.34
2.443ValPhe: 2.443 ± 0.771
7.329ValGly: 7.329 ± 3.192
0.407ValHis: 0.407 ± 0.472
2.85ValIle: 2.85 ± 0.133
3.257ValLys: 3.257 ± 0.349
4.886ValLeu: 4.886 ± 1.899
2.036ValMet: 2.036 ± 0.298
4.886ValAsn: 4.886 ± 1.899
4.479ValPro: 4.479 ± 1.683
2.85ValGln: 2.85 ± 1.243
3.257ValArg: 3.257 ± 0.349
4.479ValSer: 4.479 ± 1.757
4.479ValThr: 4.479 ± 0.995
2.85ValVal: 2.85 ± 0.821
0.814ValTrp: 0.814 ± 0.431
4.072ValTyr: 4.072 ± 2.156
0.0ValXaa: 0.0 ± 0.0
Trp
1.221TrpAla: 1.221 ± 0.647
0.814TrpCys: 0.814 ± 0.257
0.814TrpAsp: 0.814 ± 0.257
0.407TrpGlu: 0.407 ± 0.216
0.407TrpPhe: 0.407 ± 0.472
0.407TrpGly: 0.407 ± 0.472
0.814TrpHis: 0.814 ± 0.257
0.407TrpIle: 0.407 ± 0.216
0.0TrpLys: 0.0 ± 0.0
3.664TrpLeu: 3.664 ± 1.5
0.407TrpMet: 0.407 ± 0.216
2.443TrpAsn: 2.443 ± 0.771
1.221TrpPro: 1.221 ± 0.041
0.0TrpGln: 0.0 ± 0.0
0.814TrpArg: 0.814 ± 0.257
1.629TrpSer: 1.629 ± 1.202
2.443TrpThr: 2.443 ± 0.771
1.221TrpVal: 1.221 ± 0.041
0.407TrpTrp: 0.407 ± 0.472
1.221TrpTyr: 1.221 ± 0.647
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.293TyrAla: 5.293 ± 0.638
0.407TyrCys: 0.407 ± 0.216
4.072TyrAsp: 4.072 ± 0.78
1.629TyrGlu: 1.629 ± 1.202
1.221TyrPhe: 1.221 ± 0.647
0.814TyrGly: 0.814 ± 0.431
0.814TyrHis: 0.814 ± 0.431
3.257TyrIle: 3.257 ± 1.028
2.443TyrLys: 2.443 ± 0.083
4.072TyrLeu: 4.072 ± 1.973
1.221TyrMet: 1.221 ± 0.041
0.407TyrAsn: 0.407 ± 0.216
1.221TyrPro: 1.221 ± 0.041
2.036TyrGln: 2.036 ± 0.298
0.814TyrArg: 0.814 ± 0.257
2.443TyrSer: 2.443 ± 0.605
2.443TyrThr: 2.443 ± 0.605
1.629TyrVal: 1.629 ± 0.174
0.407TyrTrp: 0.407 ± 0.216
0.814TyrTyr: 0.814 ± 0.431
0.407TyrXaa: 0.407 ± 0.216
Xaa
0.0XaaAla: 0.0 ± 0.0
0.407XaaCys: 0.407 ± 0.216
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2457 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski