Amino acid dipepetide frequency for Salmonella virus VSe103

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.497AlaAla: 11.497 ± 1.561
1.226AlaCys: 1.226 ± 0.271
6.132AlaAsp: 6.132 ± 0.799
6.515AlaGlu: 6.515 ± 0.78
3.909AlaPhe: 3.909 ± 0.571
7.435AlaGly: 7.435 ± 0.781
1.993AlaHis: 1.993 ± 0.4
3.756AlaIle: 3.756 ± 0.567
5.365AlaLys: 5.365 ± 0.729
7.665AlaLeu: 7.665 ± 0.881
2.376AlaMet: 2.376 ± 0.425
3.602AlaAsn: 3.602 ± 0.508
3.526AlaPro: 3.526 ± 0.545
3.602AlaGln: 3.602 ± 0.729
4.445AlaArg: 4.445 ± 0.574
5.978AlaSer: 5.978 ± 0.896
5.595AlaThr: 5.595 ± 0.741
7.665AlaVal: 7.665 ± 0.853
1.303AlaTrp: 1.303 ± 0.265
3.066AlaTyr: 3.066 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.209
0.153CysCys: 0.153 ± 0.113
0.613CysAsp: 0.613 ± 0.208
1.226CysGlu: 1.226 ± 0.366
0.383CysPhe: 0.383 ± 0.188
0.537CysGly: 0.537 ± 0.194
0.23CysHis: 0.23 ± 0.123
0.307CysIle: 0.307 ± 0.126
1.073CysLys: 1.073 ± 0.321
0.92CysLeu: 0.92 ± 0.264
0.383CysMet: 0.383 ± 0.182
0.46CysAsn: 0.46 ± 0.234
0.23CysPro: 0.23 ± 0.126
0.153CysGln: 0.153 ± 0.117
0.92CysArg: 0.92 ± 0.283
0.307CysSer: 0.307 ± 0.2
0.613CysThr: 0.613 ± 0.216
0.69CysVal: 0.69 ± 0.218
0.23CysTrp: 0.23 ± 0.122
0.23CysTyr: 0.23 ± 0.126
0.0CysXaa: 0.0 ± 0.0
Asp
6.668AspAla: 6.668 ± 0.684
0.613AspCys: 0.613 ± 0.238
3.986AspAsp: 3.986 ± 0.526
3.832AspGlu: 3.832 ± 0.54
3.066AspPhe: 3.066 ± 0.499
6.362AspGly: 6.362 ± 0.921
0.766AspHis: 0.766 ± 0.241
3.372AspIle: 3.372 ± 0.373
3.372AspLys: 3.372 ± 0.469
5.059AspLeu: 5.059 ± 0.562
1.38AspMet: 1.38 ± 0.245
2.836AspAsn: 2.836 ± 0.502
1.763AspPro: 1.763 ± 0.392
0.537AspGln: 0.537 ± 0.185
2.989AspArg: 2.989 ± 0.459
3.372AspSer: 3.372 ± 0.578
4.292AspThr: 4.292 ± 0.477
3.832AspVal: 3.832 ± 0.44
0.92AspTrp: 0.92 ± 0.234
1.993AspTyr: 1.993 ± 0.423
0.0AspXaa: 0.0 ± 0.0
Glu
6.745GluAla: 6.745 ± 0.838
0.307GluCys: 0.307 ± 0.152
3.526GluAsp: 3.526 ± 0.512
4.599GluGlu: 4.599 ± 0.804
3.219GluPhe: 3.219 ± 0.631
4.599GluGly: 4.599 ± 0.637
1.073GluHis: 1.073 ± 0.278
3.449GluIle: 3.449 ± 0.401
3.832GluLys: 3.832 ± 0.617
6.132GluLeu: 6.132 ± 0.891
2.529GluMet: 2.529 ± 0.448
2.453GluAsn: 2.453 ± 0.359
1.763GluPro: 1.763 ± 0.525
3.219GluGln: 3.219 ± 0.538
3.756GluArg: 3.756 ± 0.601
3.449GluSer: 3.449 ± 0.419
3.679GluThr: 3.679 ± 0.513
4.599GluVal: 4.599 ± 0.575
0.766GluTrp: 0.766 ± 0.306
1.993GluTyr: 1.993 ± 0.436
0.0GluXaa: 0.0 ± 0.0
Phe
2.683PheAla: 2.683 ± 0.552
0.46PheCys: 0.46 ± 0.19
3.296PheAsp: 3.296 ± 0.517
2.989PheGlu: 2.989 ± 0.546
0.46PhePhe: 0.46 ± 0.154
3.066PheGly: 3.066 ± 0.478
0.69PheHis: 0.69 ± 0.232
2.453PheIle: 2.453 ± 0.507
1.686PheLys: 1.686 ± 0.384
2.376PheLeu: 2.376 ± 0.406
0.383PheMet: 0.383 ± 0.16
1.456PheAsn: 1.456 ± 0.393
1.763PhePro: 1.763 ± 0.443
1.533PheGln: 1.533 ± 0.331
2.146PheArg: 2.146 ± 0.311
2.759PheSer: 2.759 ± 0.618
3.372PheThr: 3.372 ± 0.579
2.836PheVal: 2.836 ± 0.508
0.766PheTrp: 0.766 ± 0.254
1.15PheTyr: 1.15 ± 0.337
0.0PheXaa: 0.0 ± 0.0
Gly
7.588GlyAla: 7.588 ± 0.843
0.843GlyCys: 0.843 ± 0.272
3.909GlyAsp: 3.909 ± 0.654
5.672GlyGlu: 5.672 ± 1.012
3.372GlyPhe: 3.372 ± 0.549
6.515GlyGly: 6.515 ± 0.934
1.533GlyHis: 1.533 ± 0.431
2.989GlyIle: 2.989 ± 0.459
5.365GlyLys: 5.365 ± 0.583
5.748GlyLeu: 5.748 ± 0.603
2.299GlyMet: 2.299 ± 0.628
3.679GlyAsn: 3.679 ± 0.529
1.533GlyPro: 1.533 ± 0.323
3.066GlyGln: 3.066 ± 0.498
4.445GlyArg: 4.445 ± 0.502
5.135GlySer: 5.135 ± 0.797
3.909GlyThr: 3.909 ± 0.577
5.595GlyVal: 5.595 ± 0.644
0.996GlyTrp: 0.996 ± 0.278
2.989GlyTyr: 2.989 ± 0.524
0.0GlyXaa: 0.0 ± 0.0
His
1.15HisAla: 1.15 ± 0.333
0.383HisCys: 0.383 ± 0.17
0.92HisAsp: 0.92 ± 0.214
0.843HisGlu: 0.843 ± 0.255
0.766HisPhe: 0.766 ± 0.3
0.69HisGly: 0.69 ± 0.206
0.766HisHis: 0.766 ± 0.289
0.996HisIle: 0.996 ± 0.301
0.92HisLys: 0.92 ± 0.224
1.15HisLeu: 1.15 ± 0.291
0.46HisMet: 0.46 ± 0.197
0.537HisAsn: 0.537 ± 0.207
1.073HisPro: 1.073 ± 0.313
0.92HisGln: 0.92 ± 0.257
1.073HisArg: 1.073 ± 0.302
0.996HisSer: 0.996 ± 0.24
0.996HisThr: 0.996 ± 0.329
0.69HisVal: 0.69 ± 0.222
0.077HisTrp: 0.077 ± 0.069
0.996HisTyr: 0.996 ± 0.308
0.0HisXaa: 0.0 ± 0.0
Ile
4.292IleAla: 4.292 ± 0.668
0.69IleCys: 0.69 ± 0.232
3.986IleAsp: 3.986 ± 0.457
2.989IleGlu: 2.989 ± 0.564
1.303IlePhe: 1.303 ± 0.362
3.296IleGly: 3.296 ± 0.344
0.613IleHis: 0.613 ± 0.176
2.223IleIle: 2.223 ± 0.379
2.683IleLys: 2.683 ± 0.536
3.066IleLeu: 3.066 ± 0.524
1.15IleMet: 1.15 ± 0.309
1.993IleAsn: 1.993 ± 0.44
2.683IlePro: 2.683 ± 0.427
1.916IleGln: 1.916 ± 0.416
2.606IleArg: 2.606 ± 0.353
2.913IleSer: 2.913 ± 0.418
4.599IleThr: 4.599 ± 0.614
3.219IleVal: 3.219 ± 0.499
0.843IleTrp: 0.843 ± 0.232
1.303IleTyr: 1.303 ± 0.311
0.0IleXaa: 0.0 ± 0.0
Lys
5.289LysAla: 5.289 ± 0.789
0.46LysCys: 0.46 ± 0.214
3.756LysAsp: 3.756 ± 0.577
4.139LysGlu: 4.139 ± 0.64
2.299LysPhe: 2.299 ± 0.348
3.679LysGly: 3.679 ± 0.448
1.073LysHis: 1.073 ± 0.262
1.456LysIle: 1.456 ± 0.352
3.066LysLys: 3.066 ± 0.558
5.365LysLeu: 5.365 ± 0.705
2.836LysMet: 2.836 ± 0.507
2.606LysAsn: 2.606 ± 0.408
2.299LysPro: 2.299 ± 0.466
2.453LysGln: 2.453 ± 0.495
3.986LysArg: 3.986 ± 0.52
2.606LysSer: 2.606 ± 0.49
4.139LysThr: 4.139 ± 0.498
4.062LysVal: 4.062 ± 0.594
0.766LysTrp: 0.766 ± 0.247
2.606LysTyr: 2.606 ± 0.477
0.0LysXaa: 0.0 ± 0.0
Leu
6.898LeuAla: 6.898 ± 0.542
0.843LeuCys: 0.843 ± 0.274
4.139LeuAsp: 4.139 ± 0.578
4.522LeuGlu: 4.522 ± 0.669
1.916LeuPhe: 1.916 ± 0.421
4.599LeuGly: 4.599 ± 0.578
0.92LeuHis: 0.92 ± 0.26
4.292LeuIle: 4.292 ± 0.578
5.748LeuLys: 5.748 ± 0.773
6.055LeuLeu: 6.055 ± 0.807
2.069LeuMet: 2.069 ± 0.363
4.522LeuAsn: 4.522 ± 0.596
3.679LeuPro: 3.679 ± 0.515
2.683LeuGln: 2.683 ± 0.362
5.519LeuArg: 5.519 ± 0.785
4.445LeuSer: 4.445 ± 0.527
5.672LeuThr: 5.672 ± 0.569
5.672LeuVal: 5.672 ± 0.508
1.38LeuTrp: 1.38 ± 0.337
2.299LeuTyr: 2.299 ± 0.348
0.0LeuXaa: 0.0 ± 0.0
Met
2.376MetAla: 2.376 ± 0.339
0.23MetCys: 0.23 ± 0.123
1.456MetAsp: 1.456 ± 0.379
1.226MetGlu: 1.226 ± 0.238
1.303MetPhe: 1.303 ± 0.322
1.686MetGly: 1.686 ± 0.307
0.307MetHis: 0.307 ± 0.143
1.073MetIle: 1.073 ± 0.264
1.303MetLys: 1.303 ± 0.371
1.84MetLeu: 1.84 ± 0.406
0.69MetMet: 0.69 ± 0.266
1.15MetAsn: 1.15 ± 0.391
1.303MetPro: 1.303 ± 0.326
0.766MetGln: 0.766 ± 0.258
1.533MetArg: 1.533 ± 0.304
1.993MetSer: 1.993 ± 0.332
2.529MetThr: 2.529 ± 0.376
1.84MetVal: 1.84 ± 0.375
0.46MetTrp: 0.46 ± 0.162
0.766MetTyr: 0.766 ± 0.265
0.0MetXaa: 0.0 ± 0.0
Asn
3.986AsnAla: 3.986 ± 0.568
0.613AsnCys: 0.613 ± 0.223
2.376AsnAsp: 2.376 ± 0.342
2.299AsnGlu: 2.299 ± 0.376
1.763AsnPhe: 1.763 ± 0.363
4.599AsnGly: 4.599 ± 0.755
0.613AsnHis: 0.613 ± 0.207
3.219AsnIle: 3.219 ± 0.388
1.916AsnLys: 1.916 ± 0.527
3.909AsnLeu: 3.909 ± 0.395
0.46AsnMet: 0.46 ± 0.186
2.376AsnAsn: 2.376 ± 0.452
1.84AsnPro: 1.84 ± 0.376
1.303AsnGln: 1.303 ± 0.284
2.683AsnArg: 2.683 ± 0.46
2.146AsnSer: 2.146 ± 0.369
1.84AsnThr: 1.84 ± 0.357
3.909AsnVal: 3.909 ± 0.383
0.843AsnTrp: 0.843 ± 0.28
1.456AsnTyr: 1.456 ± 0.304
0.0AsnXaa: 0.0 ± 0.0
Pro
2.683ProAla: 2.683 ± 0.459
0.46ProCys: 0.46 ± 0.182
2.683ProAsp: 2.683 ± 0.487
3.679ProGlu: 3.679 ± 0.436
1.61ProPhe: 1.61 ± 0.318
3.142ProGly: 3.142 ± 0.465
0.537ProHis: 0.537 ± 0.17
1.456ProIle: 1.456 ± 0.41
2.836ProLys: 2.836 ± 0.48
3.296ProLeu: 3.296 ± 0.493
0.92ProMet: 0.92 ± 0.265
1.38ProAsn: 1.38 ± 0.449
1.456ProPro: 1.456 ± 0.411
1.303ProGln: 1.303 ± 0.283
1.916ProArg: 1.916 ± 0.406
2.146ProSer: 2.146 ± 0.389
1.686ProThr: 1.686 ± 0.277
3.986ProVal: 3.986 ± 0.511
0.537ProTrp: 0.537 ± 0.268
1.61ProTyr: 1.61 ± 0.378
0.0ProXaa: 0.0 ± 0.0
Gln
4.292GlnAla: 4.292 ± 0.634
0.23GlnCys: 0.23 ± 0.124
1.456GlnAsp: 1.456 ± 0.306
2.376GlnGlu: 2.376 ± 0.54
1.456GlnPhe: 1.456 ± 0.379
1.993GlnGly: 1.993 ± 0.426
0.613GlnHis: 0.613 ± 0.185
1.916GlnIle: 1.916 ± 0.449
2.069GlnLys: 2.069 ± 0.365
2.759GlnLeu: 2.759 ± 0.494
1.073GlnMet: 1.073 ± 0.273
1.84GlnAsn: 1.84 ± 0.342
1.993GlnPro: 1.993 ± 0.295
2.146GlnGln: 2.146 ± 0.489
1.686GlnArg: 1.686 ± 0.332
1.993GlnSer: 1.993 ± 0.462
1.686GlnThr: 1.686 ± 0.345
2.683GlnVal: 2.683 ± 0.417
0.613GlnTrp: 0.613 ± 0.175
1.456GlnTyr: 1.456 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
4.829ArgAla: 4.829 ± 0.489
0.383ArgCys: 0.383 ± 0.131
3.449ArgAsp: 3.449 ± 0.476
3.756ArgGlu: 3.756 ± 0.541
1.993ArgPhe: 1.993 ± 0.421
4.599ArgGly: 4.599 ± 0.593
0.92ArgHis: 0.92 ± 0.258
3.142ArgIle: 3.142 ± 0.459
3.756ArgLys: 3.756 ± 0.624
3.986ArgLeu: 3.986 ± 0.443
1.916ArgMet: 1.916 ± 0.38
3.142ArgAsn: 3.142 ± 0.535
1.993ArgPro: 1.993 ± 0.369
2.836ArgGln: 2.836 ± 0.467
4.905ArgArg: 4.905 ± 0.781
1.84ArgSer: 1.84 ± 0.26
3.142ArgThr: 3.142 ± 0.522
4.216ArgVal: 4.216 ± 0.508
0.92ArgTrp: 0.92 ± 0.3
1.456ArgTyr: 1.456 ± 0.383
0.0ArgXaa: 0.0 ± 0.0
Ser
5.902SerAla: 5.902 ± 0.974
0.383SerCys: 0.383 ± 0.194
3.449SerAsp: 3.449 ± 0.542
2.759SerGlu: 2.759 ± 0.441
2.299SerPhe: 2.299 ± 0.314
6.745SerGly: 6.745 ± 0.64
0.766SerHis: 0.766 ± 0.209
2.376SerIle: 2.376 ± 0.455
2.299SerLys: 2.299 ± 0.481
5.212SerLeu: 5.212 ± 0.616
0.996SerMet: 0.996 ± 0.296
2.683SerAsn: 2.683 ± 0.424
2.069SerPro: 2.069 ± 0.486
1.916SerGln: 1.916 ± 0.394
2.759SerArg: 2.759 ± 0.416
3.142SerSer: 3.142 ± 0.606
4.829SerThr: 4.829 ± 0.844
5.289SerVal: 5.289 ± 0.668
0.766SerTrp: 0.766 ± 0.2
1.84SerTyr: 1.84 ± 0.396
0.0SerXaa: 0.0 ± 0.0
Thr
6.438ThrAla: 6.438 ± 0.678
0.766ThrCys: 0.766 ± 0.306
4.752ThrAsp: 4.752 ± 0.58
3.219ThrGlu: 3.219 ± 0.543
2.989ThrPhe: 2.989 ± 0.51
5.902ThrGly: 5.902 ± 0.728
0.92ThrHis: 0.92 ± 0.302
3.142ThrIle: 3.142 ± 0.429
3.449ThrLys: 3.449 ± 0.529
5.135ThrLeu: 5.135 ± 0.678
1.073ThrMet: 1.073 ± 0.349
1.916ThrAsn: 1.916 ± 0.315
4.216ThrPro: 4.216 ± 0.501
1.916ThrGln: 1.916 ± 0.338
3.219ThrArg: 3.219 ± 0.459
4.292ThrSer: 4.292 ± 0.657
4.062ThrThr: 4.062 ± 0.47
4.599ThrVal: 4.599 ± 0.687
0.843ThrTrp: 0.843 ± 0.252
2.529ThrTyr: 2.529 ± 0.461
0.0ThrXaa: 0.0 ± 0.0
Val
8.124ValAla: 8.124 ± 0.814
0.766ValCys: 0.766 ± 0.245
4.062ValAsp: 4.062 ± 0.546
6.055ValGlu: 6.055 ± 0.579
2.376ValPhe: 2.376 ± 0.486
3.909ValGly: 3.909 ± 0.59
0.92ValHis: 0.92 ± 0.201
4.522ValIle: 4.522 ± 0.606
5.135ValLys: 5.135 ± 0.677
4.369ValLeu: 4.369 ± 0.485
1.38ValMet: 1.38 ± 0.329
3.449ValAsn: 3.449 ± 0.562
2.376ValPro: 2.376 ± 0.585
1.916ValGln: 1.916 ± 0.362
3.296ValArg: 3.296 ± 0.524
6.132ValSer: 6.132 ± 0.844
5.902ValThr: 5.902 ± 0.66
5.519ValVal: 5.519 ± 0.79
0.92ValTrp: 0.92 ± 0.244
2.836ValTyr: 2.836 ± 0.413
0.0ValXaa: 0.0 ± 0.0
Trp
1.303TrpAla: 1.303 ± 0.409
0.077TrpCys: 0.077 ± 0.077
0.996TrpAsp: 0.996 ± 0.29
0.383TrpGlu: 0.383 ± 0.162
0.69TrpPhe: 0.69 ± 0.27
0.996TrpGly: 0.996 ± 0.216
0.307TrpHis: 0.307 ± 0.227
0.613TrpIle: 0.613 ± 0.205
0.613TrpLys: 0.613 ± 0.243
1.763TrpLeu: 1.763 ± 0.406
0.383TrpMet: 0.383 ± 0.167
0.537TrpAsn: 0.537 ± 0.186
0.46TrpPro: 0.46 ± 0.2
0.69TrpGln: 0.69 ± 0.211
1.38TrpArg: 1.38 ± 0.394
0.69TrpSer: 0.69 ± 0.213
0.69TrpThr: 0.69 ± 0.203
1.226TrpVal: 1.226 ± 0.256
0.153TrpTrp: 0.153 ± 0.101
0.383TrpTyr: 0.383 ± 0.208
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.142TyrAla: 3.142 ± 0.462
0.537TyrCys: 0.537 ± 0.21
2.069TyrAsp: 2.069 ± 0.532
2.606TyrGlu: 2.606 ± 0.431
1.15TyrPhe: 1.15 ± 0.343
2.836TyrGly: 2.836 ± 0.435
0.843TyrHis: 0.843 ± 0.242
1.61TyrIle: 1.61 ± 0.343
2.529TyrLys: 2.529 ± 0.434
2.069TyrLeu: 2.069 ± 0.381
0.996TyrMet: 0.996 ± 0.24
1.533TyrAsn: 1.533 ± 0.348
1.303TyrPro: 1.303 ± 0.324
1.456TyrGln: 1.456 ± 0.352
1.916TyrArg: 1.916 ± 0.445
2.146TyrSer: 2.146 ± 0.397
2.146TyrThr: 2.146 ± 0.368
1.84TyrVal: 1.84 ± 0.34
0.23TyrTrp: 0.23 ± 0.19
1.226TyrTyr: 1.226 ± 0.249
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 57 proteins (13048 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski