Amino acid dipepetide frequency for Lake Sinai virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.078AlaAla: 11.078 ± 0.452
1.511AlaCys: 1.511 ± 0.557
6.042AlaAsp: 6.042 ± 1.588
4.028AlaGlu: 4.028 ± 0.918
3.525AlaPhe: 3.525 ± 1.163
4.028AlaGly: 4.028 ± 0.365
1.007AlaHis: 1.007 ± 0.364
3.525AlaIle: 3.525 ± 0.731
4.028AlaLys: 4.028 ± 1.375
7.049AlaLeu: 7.049 ± 1.123
1.007AlaMet: 1.007 ± 0.559
1.007AlaAsn: 1.007 ± 0.364
5.539AlaPro: 5.539 ± 0.961
0.504AlaGln: 0.504 ± 0.391
6.546AlaArg: 6.546 ± 0.284
6.546AlaSer: 6.546 ± 3.11
5.539AlaThr: 5.539 ± 0.259
6.042AlaVal: 6.042 ± 0.519
1.511AlaTrp: 1.511 ± 0.802
4.028AlaTyr: 4.028 ± 0.849
0.0AlaXaa: 0.0 ± 0.0
Cys
3.021CysAla: 3.021 ± 1.243
1.007CysCys: 1.007 ± 0.783
2.014CysAsp: 2.014 ± 0.594
1.007CysGlu: 1.007 ± 0.297
1.007CysPhe: 1.007 ± 0.651
1.007CysGly: 1.007 ± 0.297
0.504CysHis: 0.504 ± 0.326
0.504CysIle: 0.504 ± 0.408
0.0CysLys: 0.0 ± 0.0
3.021CysLeu: 3.021 ± 0.826
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.007CysPro: 1.007 ± 0.651
1.511CysGln: 1.511 ± 0.162
2.518CysArg: 2.518 ± 1.079
3.525CysSer: 3.525 ± 1.163
1.007CysThr: 1.007 ± 0.481
1.511CysVal: 1.511 ± 0.776
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.532AspAla: 4.532 ± 1.113
0.504AspCys: 0.504 ± 0.391
5.035AspAsp: 5.035 ± 1.085
2.014AspGlu: 2.014 ± 0.183
1.511AspPhe: 1.511 ± 0.557
7.049AspGly: 7.049 ± 1.848
1.511AspHis: 1.511 ± 0.485
4.028AspIle: 4.028 ± 0.849
2.014AspLys: 2.014 ± 0.546
6.042AspLeu: 6.042 ± 2.386
1.511AspMet: 1.511 ± 0.614
2.014AspAsn: 2.014 ± 1.09
3.525AspPro: 3.525 ± 0.524
2.518AspGln: 2.518 ± 0.502
3.021AspArg: 3.021 ± 0.332
2.518AspSer: 2.518 ± 0.502
5.035AspThr: 5.035 ± 2.127
0.504AspVal: 0.504 ± 0.408
0.504AspTrp: 0.504 ± 0.408
2.518AspTyr: 2.518 ± 0.502
0.0AspXaa: 0.0 ± 0.0
Glu
3.525GluAla: 3.525 ± 1.232
0.0GluCys: 0.0 ± 0.0
1.511GluAsp: 1.511 ± 0.977
0.504GluGlu: 0.504 ± 0.326
1.511GluPhe: 1.511 ± 1.174
3.021GluGly: 3.021 ± 0.494
1.511GluHis: 1.511 ± 1.174
2.014GluIle: 2.014 ± 0.986
0.504GluLys: 0.504 ± 0.326
1.007GluLeu: 1.007 ± 0.297
0.0GluMet: 0.0 ± 0.0
0.0GluAsn: 0.0 ± 0.0
3.021GluPro: 3.021 ± 1.552
1.007GluGln: 1.007 ± 0.481
2.014GluArg: 2.014 ± 0.594
3.021GluSer: 3.021 ± 1.455
1.511GluThr: 1.511 ± 1.225
3.525GluVal: 3.525 ± 1.012
0.0GluTrp: 0.0 ± 0.0
3.021GluTyr: 3.021 ± 0.97
0.0GluXaa: 0.0 ± 0.0
Phe
2.014PheAla: 2.014 ± 0.183
2.014PheCys: 2.014 ± 0.594
3.021PheAsp: 3.021 ± 0.332
1.511PheGlu: 1.511 ± 0.162
2.518PhePhe: 2.518 ± 0.884
3.021PheGly: 3.021 ± 1.455
1.007PheHis: 1.007 ± 0.364
1.511PheIle: 1.511 ± 0.485
0.504PheLys: 0.504 ± 0.326
2.518PheLeu: 2.518 ± 0.25
2.518PheMet: 2.518 ± 0.736
2.518PheAsn: 2.518 ± 1.042
3.021PhePro: 3.021 ± 1.358
1.511PheGln: 1.511 ± 0.977
3.021PheArg: 3.021 ± 1.396
7.049PheSer: 7.049 ± 0.928
0.504PheThr: 0.504 ± 0.408
3.021PheVal: 3.021 ± 0.826
0.504PheTrp: 0.504 ± 0.391
2.518PheTyr: 2.518 ± 0.25
0.0PheXaa: 0.0 ± 0.0
Gly
4.028GlyAla: 4.028 ± 2.037
1.511GlyCys: 1.511 ± 0.776
4.028GlyAsp: 4.028 ± 1.092
1.007GlyGlu: 1.007 ± 0.297
4.028GlyPhe: 4.028 ± 0.365
1.007GlyGly: 1.007 ± 0.651
1.007GlyHis: 1.007 ± 0.651
6.042GlyIle: 6.042 ± 0.137
0.0GlyLys: 0.0 ± 0.0
3.021GlyLeu: 3.021 ± 1.243
0.504GlyMet: 0.504 ± 0.391
1.511GlyAsn: 1.511 ± 0.802
4.028GlyPro: 4.028 ± 0.849
1.007GlyGln: 1.007 ± 0.651
2.014GlyArg: 2.014 ± 1.302
5.539GlySer: 5.539 ± 1.239
2.518GlyThr: 2.518 ± 1.042
3.021GlyVal: 3.021 ± 0.494
1.511GlyTrp: 1.511 ± 0.162
3.021GlyTyr: 3.021 ± 1.093
0.0GlyXaa: 0.0 ± 0.0
His
1.511HisAla: 1.511 ± 0.162
0.0HisCys: 0.0 ± 0.0
2.014HisAsp: 2.014 ± 0.183
2.518HisGlu: 2.518 ± 0.884
0.504HisPhe: 0.504 ± 0.326
0.504HisGly: 0.504 ± 0.326
0.504HisHis: 0.504 ± 0.326
1.007HisIle: 1.007 ± 0.783
0.504HisLys: 0.504 ± 0.326
1.511HisLeu: 1.511 ± 0.557
0.504HisMet: 0.504 ± 0.326
0.504HisAsn: 0.504 ± 0.391
4.028HisPro: 4.028 ± 0.918
0.504HisGln: 0.504 ± 0.408
3.021HisArg: 3.021 ± 1.953
2.014HisSer: 2.014 ± 0.771
2.014HisThr: 2.014 ± 1.131
2.014HisVal: 2.014 ± 0.771
0.504HisTrp: 0.504 ± 0.326
2.014HisTyr: 2.014 ± 0.594
0.0HisXaa: 0.0 ± 0.0
Ile
3.525IleAla: 3.525 ± 1.787
1.511IleCys: 1.511 ± 0.485
5.035IleAsp: 5.035 ± 1.127
2.014IleGlu: 2.014 ± 0.986
1.511IlePhe: 1.511 ± 0.702
2.014IleGly: 2.014 ± 0.771
1.511IleHis: 1.511 ± 0.485
2.518IleIle: 2.518 ± 0.736
2.518IleLys: 2.518 ± 0.64
4.532IleLeu: 4.532 ± 1.297
0.504IleMet: 0.504 ± 0.326
0.504IleAsn: 0.504 ± 0.408
2.014IlePro: 2.014 ± 0.476
1.007IleGln: 1.007 ± 0.817
2.014IleArg: 2.014 ± 0.183
7.049IleSer: 7.049 ± 2.832
2.518IleThr: 2.518 ± 0.64
1.511IleVal: 1.511 ± 0.614
0.504IleTrp: 0.504 ± 0.326
0.504IleTyr: 0.504 ± 0.408
0.0IleXaa: 0.0 ± 0.0
Lys
3.021LysAla: 3.021 ± 0.332
0.504LysCys: 0.504 ± 0.391
0.0LysAsp: 0.0 ± 0.0
0.504LysGlu: 0.504 ± 0.391
1.007LysPhe: 1.007 ± 0.364
1.511LysGly: 1.511 ± 0.485
1.007LysHis: 1.007 ± 0.364
2.014LysIle: 2.014 ± 1.131
0.0LysLys: 0.0 ± 0.0
1.007LysLeu: 1.007 ± 0.481
1.007LysMet: 1.007 ± 0.464
0.504LysAsn: 0.504 ± 0.408
1.007LysPro: 1.007 ± 0.817
0.504LysGln: 0.504 ± 0.391
2.014LysArg: 2.014 ± 0.594
2.518LysSer: 2.518 ± 0.502
1.007LysThr: 1.007 ± 0.364
3.021LysVal: 3.021 ± 0.325
0.504LysTrp: 0.504 ± 0.408
0.504LysTyr: 0.504 ± 0.391
0.0LysXaa: 0.0 ± 0.0
Leu
7.553LeuAla: 7.553 ± 1.348
3.021LeuCys: 3.021 ± 0.332
5.539LeuAsp: 5.539 ± 0.623
3.021LeuGlu: 3.021 ± 0.332
3.525LeuPhe: 3.525 ± 1.151
4.028LeuGly: 4.028 ± 0.951
1.511LeuHis: 1.511 ± 0.977
3.021LeuIle: 3.021 ± 1.243
2.518LeuLys: 2.518 ± 0.64
8.56LeuLeu: 8.56 ± 1.647
2.014LeuMet: 2.014 ± 0.837
4.532LeuAsn: 4.532 ± 1.328
6.042LeuPro: 6.042 ± 1.427
2.518LeuGln: 2.518 ± 1.23
11.581LeuArg: 11.581 ± 1.224
9.567LeuSer: 9.567 ± 0.65
4.532LeuThr: 4.532 ± 1.683
6.042LeuVal: 6.042 ± 1.844
0.504LeuTrp: 0.504 ± 0.391
3.021LeuTyr: 3.021 ± 0.909
0.0LeuXaa: 0.0 ± 0.0
Met
1.007MetAla: 1.007 ± 0.783
0.504MetCys: 0.504 ± 0.326
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
1.511MetGly: 1.511 ± 0.557
1.007MetHis: 1.007 ± 0.783
0.504MetIle: 0.504 ± 0.391
0.504MetLys: 0.504 ± 0.326
4.028MetLeu: 4.028 ± 1.092
0.504MetMet: 0.504 ± 0.326
0.504MetAsn: 0.504 ± 0.408
2.014MetPro: 2.014 ± 1.09
0.504MetGln: 0.504 ± 0.326
2.014MetArg: 2.014 ± 0.986
1.511MetSer: 1.511 ± 0.485
1.007MetThr: 1.007 ± 0.297
1.007MetVal: 1.007 ± 0.364
0.0MetTrp: 0.0 ± 0.0
1.007MetTyr: 1.007 ± 0.783
0.0MetXaa: 0.0 ± 0.0
Asn
1.007AsnAla: 1.007 ± 0.783
0.504AsnCys: 0.504 ± 0.408
0.504AsnAsp: 0.504 ± 0.408
1.007AsnGlu: 1.007 ± 0.651
2.518AsnPhe: 2.518 ± 0.406
1.511AsnGly: 1.511 ± 0.776
1.007AsnHis: 1.007 ± 0.651
1.007AsnIle: 1.007 ± 0.481
1.007AsnLys: 1.007 ± 0.364
2.518AsnLeu: 2.518 ± 0.856
0.0AsnMet: 0.0 ± 0.0
1.511AsnAsn: 1.511 ± 0.162
3.525AsnPro: 3.525 ± 1.333
0.0AsnGln: 0.0 ± 0.0
3.021AsnArg: 3.021 ± 0.325
1.511AsnSer: 1.511 ± 0.702
2.014AsnThr: 2.014 ± 0.729
3.525AsnVal: 3.525 ± 1.787
1.511AsnTrp: 1.511 ± 0.702
0.504AsnTyr: 0.504 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
4.532ProAla: 4.532 ± 1.123
0.504ProCys: 0.504 ± 0.391
4.028ProAsp: 4.028 ± 0.365
0.504ProGlu: 0.504 ± 0.326
3.021ProPhe: 3.021 ± 0.631
2.014ProGly: 2.014 ± 0.546
4.532ProHis: 4.532 ± 0.196
2.014ProIle: 2.014 ± 1.565
1.511ProLys: 1.511 ± 0.702
7.049ProLeu: 7.049 ± 1.187
2.518ProMet: 2.518 ± 1.258
3.021ProAsn: 3.021 ± 1.358
3.525ProPro: 3.525 ± 0.695
1.511ProGln: 1.511 ± 0.802
6.042ProArg: 6.042 ± 2.082
4.028ProSer: 4.028 ± 1.019
8.056ProThr: 8.056 ± 1.697
4.028ProVal: 4.028 ± 0.365
1.511ProTrp: 1.511 ± 0.485
1.511ProTyr: 1.511 ± 0.614
0.0ProXaa: 0.0 ± 0.0
Gln
1.007GlnAla: 1.007 ± 0.481
0.504GlnCys: 0.504 ± 0.391
0.504GlnAsp: 0.504 ± 0.326
0.0GlnGlu: 0.0 ± 0.0
0.504GlnPhe: 0.504 ± 0.391
2.014GlnGly: 2.014 ± 0.476
0.504GlnHis: 0.504 ± 0.326
1.511GlnIle: 1.511 ± 0.162
1.007GlnLys: 1.007 ± 0.783
3.021GlnLeu: 3.021 ± 0.826
0.504GlnMet: 0.504 ± 0.391
0.504GlnAsn: 0.504 ± 0.326
2.014GlnPro: 2.014 ± 0.546
0.0GlnGln: 0.0 ± 0.0
2.518GlnArg: 2.518 ± 1.141
3.021GlnSer: 3.021 ± 1.358
2.518GlnThr: 2.518 ± 1.23
0.504GlnVal: 0.504 ± 0.326
0.0GlnTrp: 0.0 ± 0.0
3.525GlnTyr: 3.525 ± 0.695
0.0GlnXaa: 0.0 ± 0.0
Arg
4.532ArgAla: 4.532 ± 0.196
2.518ArgCys: 2.518 ± 0.736
6.042ArgAsp: 6.042 ± 1.261
2.518ArgGlu: 2.518 ± 0.502
5.539ArgPhe: 5.539 ± 1.954
4.028ArgGly: 4.028 ± 0.849
2.014ArgHis: 2.014 ± 0.476
2.014ArgIle: 2.014 ± 0.771
0.504ArgLys: 0.504 ± 0.326
8.56ArgLeu: 8.56 ± 0.386
1.511ArgMet: 1.511 ± 0.776
5.035ArgAsn: 5.035 ± 1.369
2.518ArgPro: 2.518 ± 0.25
2.014ArgGln: 2.014 ± 0.476
9.063ArgArg: 9.063 ± 2.388
8.056ArgSer: 8.056 ± 1.835
4.532ArgThr: 4.532 ± 0.196
5.539ArgVal: 5.539 ± 1.379
1.007ArgTrp: 1.007 ± 0.297
3.021ArgTyr: 3.021 ± 0.631
0.0ArgXaa: 0.0 ± 0.0
Ser
10.574SerAla: 10.574 ± 2.048
2.518SerCys: 2.518 ± 0.736
5.539SerAsp: 5.539 ± 0.259
2.014SerGlu: 2.014 ± 0.962
2.518SerPhe: 2.518 ± 0.406
4.532SerGly: 4.532 ± 1.235
1.511SerHis: 1.511 ± 0.977
5.539SerIle: 5.539 ± 0.844
2.014SerLys: 2.014 ± 0.183
6.042SerLeu: 6.042 ± 0.766
2.014SerMet: 2.014 ± 0.546
1.511SerAsn: 1.511 ± 0.557
5.035SerPro: 5.035 ± 1.127
5.035SerGln: 5.035 ± 0.499
7.049SerArg: 7.049 ± 0.595
14.602SerSer: 14.602 ± 2.078
4.532SerThr: 4.532 ± 1.297
10.574SerVal: 10.574 ± 2.333
2.518SerTrp: 2.518 ± 1.079
6.546SerTyr: 6.546 ± 1.113
0.0SerXaa: 0.0 ± 0.0
Thr
4.532ThrAla: 4.532 ± 0.815
0.504ThrCys: 0.504 ± 0.408
1.007ThrAsp: 1.007 ± 0.364
2.518ThrGlu: 2.518 ± 0.406
4.028ThrPhe: 4.028 ± 1.019
3.021ThrGly: 3.021 ± 1.358
2.014ThrHis: 2.014 ± 0.837
2.518ThrIle: 2.518 ± 0.406
1.511ThrLys: 1.511 ± 0.614
9.567ThrLeu: 9.567 ± 2.341
0.504ThrMet: 0.504 ± 0.326
0.0ThrAsn: 0.0 ± 0.0
6.546ThrPro: 6.546 ± 0.873
1.007ThrGln: 1.007 ± 0.364
5.035ThrArg: 5.035 ± 2.516
5.539ThrSer: 5.539 ± 0.756
7.049ThrThr: 7.049 ± 3.062
4.532ThrVal: 4.532 ± 1.491
0.504ThrTrp: 0.504 ± 0.391
2.518ThrTyr: 2.518 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
7.553ValAla: 7.553 ± 0.917
2.014ValCys: 2.014 ± 0.771
2.014ValAsp: 2.014 ± 0.729
2.014ValGlu: 2.014 ± 0.183
3.525ValPhe: 3.525 ± 0.609
3.021ValGly: 3.021 ± 0.826
2.014ValHis: 2.014 ± 0.771
2.014ValIle: 2.014 ± 0.476
2.518ValLys: 2.518 ± 0.856
5.539ValLeu: 5.539 ± 0.411
0.504ValMet: 0.504 ± 0.408
1.511ValAsn: 1.511 ± 0.557
4.532ValPro: 4.532 ± 1.847
1.007ValGln: 1.007 ± 0.481
5.035ValArg: 5.035 ± 1.085
6.546ValSer: 6.546 ± 1.208
7.553ValThr: 7.553 ± 0.812
5.539ValVal: 5.539 ± 1.954
0.504ValTrp: 0.504 ± 0.408
3.021ValTyr: 3.021 ± 1.604
0.0ValXaa: 0.0 ± 0.0
Trp
2.014TrpAla: 2.014 ± 0.771
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
1.007TrpGlu: 1.007 ± 0.783
1.007TrpPhe: 1.007 ± 0.364
0.504TrpGly: 0.504 ± 0.408
0.0TrpHis: 0.0 ± 0.0
1.007TrpIle: 1.007 ± 0.297
0.0TrpLys: 0.0 ± 0.0
3.021TrpLeu: 3.021 ± 0.794
0.0TrpMet: 0.0 ± 0.0
1.007TrpAsn: 1.007 ± 0.364
0.504TrpPro: 0.504 ± 0.391
0.504TrpGln: 0.504 ± 0.391
0.0TrpArg: 0.0 ± 0.0
2.518TrpSer: 2.518 ± 0.884
0.0TrpThr: 0.0 ± 0.0
0.504TrpVal: 0.504 ± 0.391
0.0TrpTrp: 0.0 ± 0.0
0.504TrpTyr: 0.504 ± 0.326
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.028TyrAla: 4.028 ± 0.603
3.021TyrCys: 3.021 ± 0.332
4.028TyrAsp: 4.028 ± 1.674
2.518TyrGlu: 2.518 ± 1.369
2.518TyrPhe: 2.518 ± 0.736
1.007TyrGly: 1.007 ± 0.481
2.014TyrHis: 2.014 ± 0.183
0.504TyrIle: 0.504 ± 0.408
0.0TyrLys: 0.0 ± 0.0
5.035TyrLeu: 5.035 ± 2.461
1.007TyrMet: 1.007 ± 0.364
2.014TyrAsn: 2.014 ± 0.729
2.014TyrPro: 2.014 ± 1.179
1.511TyrGln: 1.511 ± 0.557
3.021TyrArg: 3.021 ± 0.909
5.539TyrSer: 5.539 ± 0.876
1.007TyrThr: 1.007 ± 0.297
2.014TyrVal: 2.014 ± 0.546
0.504TyrTrp: 0.504 ± 0.326
4.532TyrTyr: 4.532 ± 1.455
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1987 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski