Amino acid dipepetide frequency for Capybara microvirus Cap3_SP_613

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.872AlaAla: 2.872 ± 3.434
0.718AlaCys: 0.718 ± 0.87
3.589AlaAsp: 3.589 ± 1.382
4.307AlaGlu: 4.307 ± 1.326
2.872AlaPhe: 2.872 ± 1.579
5.743AlaGly: 5.743 ± 2.699
0.718AlaHis: 0.718 ± 0.589
2.872AlaIle: 2.872 ± 1.307
0.718AlaLys: 0.718 ± 0.486
3.589AlaLeu: 3.589 ± 2.357
2.154AlaMet: 2.154 ± 0.893
2.872AlaAsn: 2.872 ± 0.961
2.154AlaPro: 2.154 ± 1.459
0.718AlaGln: 0.718 ± 0.859
2.154AlaArg: 2.154 ± 0.742
6.461AlaSer: 6.461 ± 2.982
2.872AlaThr: 2.872 ± 1.435
2.872AlaVal: 2.872 ± 2.564
0.718AlaTrp: 0.718 ± 0.486
4.307AlaTyr: 4.307 ± 1.771
0.0AlaXaa: 0.0 ± 0.0
Cys
0.718CysAla: 0.718 ± 0.589
0.0CysCys: 0.0 ± 0.0
1.436CysAsp: 1.436 ± 0.871
0.0CysGlu: 0.0 ± 0.0
3.589CysPhe: 3.589 ± 2.511
2.154CysGly: 2.154 ± 0.938
0.0CysHis: 0.0 ± 0.0
2.154CysIle: 2.154 ± 0.926
1.436CysLys: 1.436 ± 1.177
1.436CysLeu: 1.436 ± 1.177
0.0CysMet: 0.0 ± 0.0
0.718CysAsn: 0.718 ± 0.87
1.436CysPro: 1.436 ± 0.973
0.0CysGln: 0.0 ± 0.0
0.0CysArg: 0.0 ± 0.0
0.718CysSer: 0.718 ± 0.486
0.718CysThr: 0.718 ± 0.486
1.436CysVal: 1.436 ± 1.177
0.0CysTrp: 0.0 ± 0.0
0.718CysTyr: 0.718 ± 0.589
0.0CysXaa: 0.0 ± 0.0
Asp
0.718AspAla: 0.718 ± 0.486
0.0AspCys: 0.0 ± 0.0
3.589AspAsp: 3.589 ± 0.967
5.743AspGlu: 5.743 ± 2.056
7.179AspPhe: 7.179 ± 1.874
0.718AspGly: 0.718 ± 0.486
0.718AspHis: 0.718 ± 0.486
9.332AspIle: 9.332 ± 4.103
3.589AspLys: 3.589 ± 2.034
3.589AspLeu: 3.589 ± 0.855
1.436AspMet: 1.436 ± 1.75
5.025AspAsn: 5.025 ± 1.776
1.436AspPro: 1.436 ± 0.973
0.718AspGln: 0.718 ± 0.486
0.0AspArg: 0.0 ± 0.0
8.615AspSer: 8.615 ± 2.375
3.589AspThr: 3.589 ± 2.378
2.154AspVal: 2.154 ± 1.146
2.154AspTrp: 2.154 ± 0.742
3.589AspTyr: 3.589 ± 0.92
0.0AspXaa: 0.0 ± 0.0
Glu
2.872GluAla: 2.872 ± 0.747
2.872GluCys: 2.872 ± 0.983
1.436GluAsp: 1.436 ± 0.461
0.0GluGlu: 0.0 ± 0.0
2.872GluPhe: 2.872 ± 1.169
0.718GluGly: 0.718 ± 0.859
0.718GluHis: 0.718 ± 0.486
5.025GluIle: 5.025 ± 1.95
3.589GluLys: 3.589 ± 1.56
3.589GluLeu: 3.589 ± 1.231
0.718GluMet: 0.718 ± 0.486
5.025GluAsn: 5.025 ± 1.447
0.0GluPro: 0.0 ± 0.0
0.718GluGln: 0.718 ± 0.859
0.718GluArg: 0.718 ± 0.87
4.307GluSer: 4.307 ± 1.969
1.436GluThr: 1.436 ± 0.946
1.436GluVal: 1.436 ± 0.756
1.436GluTrp: 1.436 ± 0.946
3.589GluTyr: 3.589 ± 1.08
0.0GluXaa: 0.0 ± 0.0
Phe
2.154PheAla: 2.154 ± 1.459
0.718PheCys: 0.718 ± 0.589
3.589PheAsp: 3.589 ± 1.137
1.436PheGlu: 1.436 ± 1.037
5.743PhePhe: 5.743 ± 2.297
2.154PheGly: 2.154 ± 0.938
0.718PheHis: 0.718 ± 0.486
2.872PheIle: 2.872 ± 1.001
3.589PheLys: 3.589 ± 0.92
7.179PheLeu: 7.179 ± 2.508
0.718PheMet: 0.718 ± 0.512
7.179PheAsn: 7.179 ± 2.592
2.154PhePro: 2.154 ± 0.742
0.718PheGln: 0.718 ± 0.859
3.589PheArg: 3.589 ± 0.958
5.743PheSer: 5.743 ± 2.077
6.461PheThr: 6.461 ± 2.549
4.307PheVal: 4.307 ± 2.305
0.718PheTrp: 0.718 ± 0.486
4.307PheTyr: 4.307 ± 2.348
0.0PheXaa: 0.0 ± 0.0
Gly
2.154GlyAla: 2.154 ± 0.871
0.718GlyCys: 0.718 ± 0.589
2.872GlyAsp: 2.872 ± 1.178
3.589GlyGlu: 3.589 ± 1.627
5.025GlyPhe: 5.025 ± 1.362
2.872GlyGly: 2.872 ± 1.347
0.718GlyHis: 0.718 ± 0.486
4.307GlyIle: 4.307 ± 2.142
2.872GlyLys: 2.872 ± 0.747
5.025GlyLeu: 5.025 ± 1.866
0.718GlyMet: 0.718 ± 0.773
2.872GlyAsn: 2.872 ± 1.347
0.0GlyPro: 0.0 ± 0.0
2.154GlyGln: 2.154 ± 0.938
2.872GlyArg: 2.872 ± 0.983
9.332GlySer: 9.332 ± 1.474
0.718GlyThr: 0.718 ± 0.486
3.589GlyVal: 3.589 ± 1.75
0.0GlyTrp: 0.0 ± 0.0
2.154GlyTyr: 2.154 ± 1.459
0.0GlyXaa: 0.0 ± 0.0
His
1.436HisAla: 1.436 ± 0.973
0.0HisCys: 0.0 ± 0.0
1.436HisAsp: 1.436 ± 0.756
0.718HisGlu: 0.718 ± 0.486
0.0HisPhe: 0.0 ± 0.0
0.718HisGly: 0.718 ± 0.486
1.436HisHis: 1.436 ± 0.973
0.718HisIle: 0.718 ± 0.589
0.0HisLys: 0.0 ± 0.0
0.718HisLeu: 0.718 ± 0.486
0.718HisMet: 0.718 ± 0.499
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
0.718HisArg: 0.718 ± 0.486
2.154HisSer: 2.154 ± 0.991
0.0HisThr: 0.0 ± 0.0
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
1.436HisTyr: 1.436 ± 1.177
0.0HisXaa: 0.0 ± 0.0
Ile
2.872IleAla: 2.872 ± 1.347
1.436IleCys: 1.436 ± 0.461
5.743IleAsp: 5.743 ± 1.079
2.872IleGlu: 2.872 ± 2.101
2.154IlePhe: 2.154 ± 1.077
2.154IleGly: 2.154 ± 0.938
0.718IleHis: 0.718 ± 0.733
1.436IleIle: 1.436 ± 1.201
2.872IleLys: 2.872 ± 1.994
11.486IleLeu: 11.486 ± 3.453
0.718IleMet: 0.718 ± 1.052
6.461IleAsn: 6.461 ± 2.218
4.307IlePro: 4.307 ± 1.5
2.154IleGln: 2.154 ± 1.038
0.718IleArg: 0.718 ± 1.052
3.589IleSer: 3.589 ± 1.44
2.154IleThr: 2.154 ± 0.742
3.589IleVal: 3.589 ± 0.931
0.0IleTrp: 0.0 ± 0.0
8.615IleTyr: 8.615 ± 2.24
0.0IleXaa: 0.0 ± 0.0
Lys
2.154LysAla: 2.154 ± 1.76
0.718LysCys: 0.718 ± 0.589
6.461LysAsp: 6.461 ± 1.5
5.743LysGlu: 5.743 ± 3.623
3.589LysPhe: 3.589 ± 1.484
6.461LysGly: 6.461 ± 3.276
1.436LysHis: 1.436 ± 0.973
5.743LysIle: 5.743 ± 1.079
9.332LysLys: 9.332 ± 4.143
7.179LysLeu: 7.179 ± 1.431
2.154LysMet: 2.154 ± 0.908
6.461LysAsn: 6.461 ± 2.813
0.718LysPro: 0.718 ± 0.486
2.154LysGln: 2.154 ± 1.576
0.718LysArg: 0.718 ± 0.859
4.307LysSer: 4.307 ± 2.152
0.718LysThr: 0.718 ± 0.589
1.436LysVal: 1.436 ± 0.871
0.0LysTrp: 0.0 ± 0.0
1.436LysTyr: 1.436 ± 0.461
0.0LysXaa: 0.0 ± 0.0
Leu
3.589LeuAla: 3.589 ± 1.766
1.436LeuCys: 1.436 ± 0.85
7.179LeuAsp: 7.179 ± 2.029
5.025LeuGlu: 5.025 ± 1.659
5.025LeuPhe: 5.025 ± 1.05
6.461LeuGly: 6.461 ± 1.697
1.436LeuHis: 1.436 ± 0.973
5.025LeuIle: 5.025 ± 2.495
7.179LeuLys: 7.179 ± 2.943
2.872LeuLeu: 2.872 ± 0.921
0.718LeuMet: 0.718 ± 0.859
15.793LeuAsn: 15.793 ± 4.716
5.025LeuPro: 5.025 ± 1.296
5.025LeuGln: 5.025 ± 1.709
5.025LeuArg: 5.025 ± 1.71
8.615LeuSer: 8.615 ± 2.542
2.872LeuThr: 2.872 ± 0.921
5.025LeuVal: 5.025 ± 1.599
0.0LeuTrp: 0.0 ± 0.0
1.436LeuTyr: 1.436 ± 0.946
0.0LeuXaa: 0.0 ± 0.0
Met
0.718MetAla: 0.718 ± 0.859
0.718MetCys: 0.718 ± 1.052
0.718MetAsp: 0.718 ± 0.486
0.0MetGlu: 0.0 ± 0.0
0.718MetPhe: 0.718 ± 0.486
1.436MetGly: 1.436 ± 0.973
0.0MetHis: 0.0 ± 0.0
0.718MetIle: 0.718 ± 0.589
1.436MetLys: 1.436 ± 1.486
2.872MetLeu: 2.872 ± 0.791
0.0MetMet: 0.0 ± 0.0
0.718MetAsn: 0.718 ± 0.87
0.718MetPro: 0.718 ± 0.486
0.0MetGln: 0.0 ± 0.0
0.0MetArg: 0.0 ± 0.0
3.589MetSer: 3.589 ± 1.031
0.718MetThr: 0.718 ± 0.589
0.0MetVal: 0.0 ± 0.0
0.0MetTrp: 0.0 ± 0.0
0.718MetTyr: 0.718 ± 0.859
0.0MetXaa: 0.0 ± 0.0
Asn
8.615AsnAla: 8.615 ± 2.424
0.718AsnCys: 0.718 ± 0.589
5.743AsnAsp: 5.743 ± 2.238
4.307AsnGlu: 4.307 ± 1.587
4.307AsnPhe: 4.307 ± 3.146
2.872AsnGly: 2.872 ± 1.59
0.0AsnHis: 0.0 ± 0.0
6.461AsnIle: 6.461 ± 2.285
6.461AsnLys: 6.461 ± 2.574
13.64AsnLeu: 13.64 ± 2.241
0.0AsnMet: 0.0 ± 0.0
10.05AsnAsn: 10.05 ± 1.348
2.154AsnPro: 2.154 ± 0.784
2.154AsnGln: 2.154 ± 1.038
4.307AsnArg: 4.307 ± 0.857
8.615AsnSer: 8.615 ± 2.276
7.179AsnThr: 7.179 ± 1.595
4.307AsnVal: 4.307 ± 2.312
0.0AsnTrp: 0.0 ± 0.0
5.025AsnTyr: 5.025 ± 1.483
0.0AsnXaa: 0.0 ± 0.0
Pro
2.154ProAla: 2.154 ± 1.459
1.436ProCys: 1.436 ± 1.177
2.872ProAsp: 2.872 ± 0.747
0.718ProGlu: 0.718 ± 0.486
3.589ProPhe: 3.589 ± 1.627
2.154ProGly: 2.154 ± 0.742
0.718ProHis: 0.718 ± 0.589
2.872ProIle: 2.872 ± 1.077
0.718ProLys: 0.718 ± 0.589
4.307ProLeu: 4.307 ± 2.919
0.0ProMet: 0.0 ± 0.0
2.154ProAsn: 2.154 ± 0.991
0.0ProPro: 0.0 ± 0.0
0.718ProGln: 0.718 ± 0.486
0.718ProArg: 0.718 ± 0.486
1.436ProSer: 1.436 ± 1.049
0.718ProThr: 0.718 ± 0.486
2.872ProVal: 2.872 ± 1.946
0.0ProTrp: 0.0 ± 0.0
1.436ProTyr: 1.436 ± 0.756
0.0ProXaa: 0.0 ± 0.0
Gln
5.025GlnAla: 5.025 ± 2.122
0.718GlnCys: 0.718 ± 0.589
2.154GlnAsp: 2.154 ± 1.073
0.0GlnGlu: 0.0 ± 0.0
2.154GlnPhe: 2.154 ± 0.742
2.872GlnGly: 2.872 ± 1.435
0.0GlnHis: 0.0 ± 0.0
0.718GlnIle: 0.718 ± 0.486
3.589GlnLys: 3.589 ± 0.641
4.307GlnLeu: 4.307 ± 2.753
0.718GlnMet: 0.718 ± 0.486
0.718GlnAsn: 0.718 ± 0.859
0.718GlnPro: 0.718 ± 0.486
0.718GlnGln: 0.718 ± 0.859
2.872GlnArg: 2.872 ± 1.578
1.436GlnSer: 1.436 ± 0.789
0.0GlnThr: 0.0 ± 0.0
2.154GlnVal: 2.154 ± 0.742
0.0GlnTrp: 0.0 ± 0.0
0.718GlnTyr: 0.718 ± 0.589
0.0GlnXaa: 0.0 ± 0.0
Arg
2.154ArgAla: 2.154 ± 1.576
1.436ArgCys: 1.436 ± 1.177
1.436ArgAsp: 1.436 ± 1.049
2.154ArgGlu: 2.154 ± 1.038
2.872ArgPhe: 2.872 ± 1.169
1.436ArgGly: 1.436 ± 0.756
0.0ArgHis: 0.0 ± 0.0
1.436ArgIle: 1.436 ± 0.973
0.718ArgLys: 0.718 ± 0.733
5.743ArgLeu: 5.743 ± 1.617
0.0ArgMet: 0.0 ± 0.0
3.589ArgAsn: 3.589 ± 1.031
2.154ArgPro: 2.154 ± 0.742
0.0ArgGln: 0.0 ± 0.0
0.0ArgArg: 0.0 ± 0.0
1.436ArgSer: 1.436 ± 0.85
0.0ArgThr: 0.0 ± 0.0
0.0ArgVal: 0.0 ± 0.0
0.0ArgTrp: 0.0 ± 0.0
2.872ArgTyr: 2.872 ± 1.168
0.0ArgXaa: 0.0 ± 0.0
Ser
5.025SerAla: 5.025 ± 6.01
2.872SerCys: 2.872 ± 1.497
5.743SerAsp: 5.743 ± 1.455
1.436SerGlu: 1.436 ± 1.177
5.743SerPhe: 5.743 ± 2.504
2.154SerGly: 2.154 ± 1.038
0.718SerHis: 0.718 ± 0.486
9.332SerIle: 9.332 ± 3.266
7.897SerLys: 7.897 ± 2.63
5.025SerLeu: 5.025 ± 1.296
0.0SerMet: 0.0 ± 0.0
12.204SerAsn: 12.204 ± 3.072
0.718SerPro: 0.718 ± 0.486
4.307SerGln: 4.307 ± 1.209
2.872SerArg: 2.872 ± 0.921
13.64SerSer: 13.64 ± 4.267
6.461SerThr: 6.461 ± 1.852
6.461SerVal: 6.461 ± 2.093
0.718SerTrp: 0.718 ± 0.486
5.025SerTyr: 5.025 ± 1.367
0.0SerXaa: 0.0 ± 0.0
Thr
2.154ThrAla: 2.154 ± 0.914
0.718ThrCys: 0.718 ± 0.589
0.0ThrAsp: 0.0 ± 0.0
1.436ThrGlu: 1.436 ± 0.973
0.718ThrPhe: 0.718 ± 1.052
4.307ThrGly: 4.307 ± 1.5
0.0ThrHis: 0.0 ± 0.0
0.718ThrIle: 0.718 ± 0.486
2.872ThrLys: 2.872 ± 1.642
4.307ThrLeu: 4.307 ± 1.002
1.436ThrMet: 1.436 ± 0.973
4.307ThrAsn: 4.307 ± 1.582
2.154ThrPro: 2.154 ± 0.871
3.589ThrGln: 3.589 ± 1.859
0.718ThrArg: 0.718 ± 0.486
5.743ThrSer: 5.743 ± 2.299
2.154ThrThr: 2.154 ± 1.649
1.436ThrVal: 1.436 ± 0.946
0.718ThrTrp: 0.718 ± 0.589
2.154ThrTyr: 2.154 ± 1.294
0.0ThrXaa: 0.0 ± 0.0
Val
5.025ValAla: 5.025 ± 1.859
0.718ValCys: 0.718 ± 0.486
4.307ValAsp: 4.307 ± 1.223
2.872ValGlu: 2.872 ± 0.91
1.436ValPhe: 1.436 ± 0.756
1.436ValGly: 1.436 ± 0.789
0.718ValHis: 0.718 ± 0.486
1.436ValIle: 1.436 ± 1.179
5.743ValLys: 5.743 ± 1.845
2.154ValLeu: 2.154 ± 1.938
1.436ValMet: 1.436 ± 0.461
5.743ValAsn: 5.743 ± 2.16
4.307ValPro: 4.307 ± 1.417
0.718ValGln: 0.718 ± 0.859
0.718ValArg: 0.718 ± 0.486
4.307ValSer: 4.307 ± 1.098
2.154ValThr: 2.154 ± 1.459
0.718ValVal: 0.718 ± 0.859
0.0ValTrp: 0.0 ± 0.0
2.872ValTyr: 2.872 ± 1.435
0.0ValXaa: 0.0 ± 0.0
Trp
1.436TrpAla: 1.436 ± 0.461
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
2.154TrpPhe: 2.154 ± 1.074
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.436TrpGln: 1.436 ± 0.973
0.0TrpArg: 0.0 ± 0.0
0.718TrpSer: 0.718 ± 0.589
0.0TrpThr: 0.0 ± 0.0
0.718TrpVal: 0.718 ± 0.486
0.0TrpTrp: 0.0 ± 0.0
0.718TrpTyr: 0.718 ± 0.486
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.436TyrAla: 1.436 ± 0.871
0.718TyrCys: 0.718 ± 0.486
4.307TyrAsp: 4.307 ± 1.111
0.718TyrGlu: 0.718 ± 0.733
4.307TyrPhe: 4.307 ± 1.011
5.743TyrGly: 5.743 ± 1.147
1.436TyrHis: 1.436 ± 1.049
2.872TyrIle: 2.872 ± 0.747
4.307TyrLys: 4.307 ± 1.263
5.743TyrLeu: 5.743 ± 1.221
1.436TyrMet: 1.436 ± 0.461
5.025TyrAsn: 5.025 ± 2.362
1.436TyrPro: 1.436 ± 0.973
3.589TyrGln: 3.589 ± 1.492
0.718TyrArg: 0.718 ± 0.486
3.589TyrSer: 3.589 ± 0.641
0.718TyrThr: 0.718 ± 0.486
4.307TyrVal: 4.307 ± 1.263
0.718TyrTrp: 0.718 ± 0.486
2.154TyrTyr: 2.154 ± 0.742
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (1394 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski