Amino acid dipepetide frequency for Salmonella virus VSt472

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.754AlaAla: 7.754 ± 1.236
0.485AlaCys: 0.485 ± 0.174
4.362AlaAsp: 4.362 ± 0.65
7.408AlaGlu: 7.408 ± 0.962
1.662AlaPhe: 1.662 ± 0.383
7.062AlaGly: 7.062 ± 0.855
1.038AlaHis: 1.038 ± 0.308
5.677AlaIle: 5.677 ± 0.585
6.439AlaLys: 6.439 ± 0.755
6.854AlaLeu: 6.854 ± 0.758
2.423AlaMet: 2.423 ± 0.425
3.669AlaAsn: 3.669 ± 0.466
1.939AlaPro: 1.939 ± 0.312
2.977AlaGln: 2.977 ± 0.696
4.985AlaArg: 4.985 ± 0.636
5.262AlaSer: 5.262 ± 0.656
4.985AlaThr: 4.985 ± 0.783
5.4AlaVal: 5.4 ± 0.7
1.454AlaTrp: 1.454 ± 0.34
2.769AlaTyr: 2.769 ± 0.438
0.0AlaXaa: 0.0 ± 0.0
Cys
1.523CysAla: 1.523 ± 0.301
0.277CysCys: 0.277 ± 0.131
0.831CysAsp: 0.831 ± 0.229
1.315CysGlu: 1.315 ± 0.297
0.554CysPhe: 0.554 ± 0.202
1.869CysGly: 1.869 ± 0.436
0.692CysHis: 0.692 ± 0.211
0.969CysIle: 0.969 ± 0.287
1.385CysLys: 1.385 ± 0.326
0.831CysLeu: 0.831 ± 0.222
0.208CysMet: 0.208 ± 0.126
1.038CysAsn: 1.038 ± 0.317
0.623CysPro: 0.623 ± 0.192
0.415CysGln: 0.415 ± 0.17
1.108CysArg: 1.108 ± 0.327
1.038CysSer: 1.038 ± 0.287
0.692CysThr: 0.692 ± 0.198
0.277CysVal: 0.277 ± 0.145
0.277CysTrp: 0.277 ± 0.122
0.485CysTyr: 0.485 ± 0.202
0.0CysXaa: 0.0 ± 0.0
Asp
6.646AspAla: 6.646 ± 0.871
1.108AspCys: 1.108 ± 0.287
4.916AspAsp: 4.916 ± 0.824
4.5AspGlu: 4.5 ± 0.537
3.739AspPhe: 3.739 ± 0.537
6.439AspGly: 6.439 ± 0.925
1.108AspHis: 1.108 ± 0.277
4.154AspIle: 4.154 ± 0.5
4.5AspLys: 4.5 ± 0.541
2.354AspLeu: 2.354 ± 0.456
1.8AspMet: 1.8 ± 0.415
2.631AspAsn: 2.631 ± 0.393
1.731AspPro: 1.731 ± 0.33
1.177AspGln: 1.177 ± 0.344
2.769AspArg: 2.769 ± 0.444
3.115AspSer: 3.115 ± 0.615
2.839AspThr: 2.839 ± 0.469
4.085AspVal: 4.085 ± 0.545
0.831AspTrp: 0.831 ± 0.226
2.769AspTyr: 2.769 ± 0.431
0.0AspXaa: 0.0 ± 0.0
Glu
5.331GluAla: 5.331 ± 0.62
1.246GluCys: 1.246 ± 0.249
3.531GluAsp: 3.531 ± 0.445
5.192GluGlu: 5.192 ± 0.671
2.562GluPhe: 2.562 ± 0.478
2.769GluGly: 2.769 ± 0.396
0.692GluHis: 0.692 ± 0.229
5.331GluIle: 5.331 ± 0.526
4.085GluLys: 4.085 ± 0.605
5.816GluLeu: 5.816 ± 0.606
2.562GluMet: 2.562 ± 0.465
2.492GluAsn: 2.492 ± 0.381
2.492GluPro: 2.492 ± 0.355
4.223GluGln: 4.223 ± 0.578
3.392GluArg: 3.392 ± 0.518
3.877GluSer: 3.877 ± 0.471
2.631GluThr: 2.631 ± 0.435
3.115GluVal: 3.115 ± 0.511
1.662GluTrp: 1.662 ± 0.321
2.839GluTyr: 2.839 ± 0.443
0.0GluXaa: 0.0 ± 0.0
Phe
1.592PheAla: 1.592 ± 0.343
1.038PheCys: 1.038 ± 0.253
3.185PheAsp: 3.185 ± 0.43
2.008PheGlu: 2.008 ± 0.364
0.831PhePhe: 0.831 ± 0.236
2.7PheGly: 2.7 ± 0.291
0.692PheHis: 0.692 ± 0.195
2.839PheIle: 2.839 ± 0.508
1.523PheLys: 1.523 ± 0.282
2.354PheLeu: 2.354 ± 0.348
1.454PheMet: 1.454 ± 0.294
2.423PheAsn: 2.423 ± 0.471
1.177PhePro: 1.177 ± 0.26
1.246PheGln: 1.246 ± 0.241
1.454PheArg: 1.454 ± 0.316
2.423PheSer: 2.423 ± 0.501
1.939PheThr: 1.939 ± 0.343
2.008PheVal: 2.008 ± 0.395
0.485PheTrp: 0.485 ± 0.182
1.177PheTyr: 1.177 ± 0.265
0.0PheXaa: 0.0 ± 0.0
Gly
5.4GlyAla: 5.4 ± 0.771
1.662GlyCys: 1.662 ± 0.44
5.262GlyAsp: 5.262 ± 0.616
4.846GlyGlu: 4.846 ± 0.521
2.423GlyPhe: 2.423 ± 0.398
4.916GlyGly: 4.916 ± 0.801
0.969GlyHis: 0.969 ± 0.244
4.639GlyIle: 4.639 ± 0.515
6.092GlyLys: 6.092 ± 0.698
4.777GlyLeu: 4.777 ± 0.585
2.354GlyMet: 2.354 ± 0.337
2.769GlyAsn: 2.769 ± 0.372
0.623GlyPro: 0.623 ± 0.2
1.8GlyGln: 1.8 ± 0.395
4.016GlyArg: 4.016 ± 0.548
5.539GlySer: 5.539 ± 0.842
3.6GlyThr: 3.6 ± 0.585
5.262GlyVal: 5.262 ± 0.571
1.038GlyTrp: 1.038 ± 0.271
4.154GlyTyr: 4.154 ± 0.534
0.0GlyXaa: 0.0 ± 0.0
His
1.177HisAla: 1.177 ± 0.319
0.623HisCys: 0.623 ± 0.212
1.038HisAsp: 1.038 ± 0.217
1.038HisGlu: 1.038 ± 0.257
0.554HisPhe: 0.554 ± 0.167
2.839HisGly: 2.839 ± 0.687
0.762HisHis: 0.762 ± 0.244
0.969HisIle: 0.969 ± 0.348
1.523HisLys: 1.523 ± 0.316
1.662HisLeu: 1.662 ± 0.384
0.485HisMet: 0.485 ± 0.18
0.969HisAsn: 0.969 ± 0.23
0.692HisPro: 0.692 ± 0.185
0.554HisGln: 0.554 ± 0.178
1.177HisArg: 1.177 ± 0.283
0.692HisSer: 0.692 ± 0.207
0.692HisThr: 0.692 ± 0.241
1.246HisVal: 1.246 ± 0.326
0.069HisTrp: 0.069 ± 0.076
0.831HisTyr: 0.831 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
5.123IleAla: 5.123 ± 0.529
1.108IleCys: 1.108 ± 0.285
4.985IleAsp: 4.985 ± 0.493
4.708IleGlu: 4.708 ± 0.465
2.077IlePhe: 2.077 ± 0.356
4.777IleGly: 4.777 ± 0.525
1.385IleHis: 1.385 ± 0.305
4.362IleIle: 4.362 ± 0.581
3.877IleLys: 3.877 ± 0.557
4.5IleLeu: 4.5 ± 0.636
1.731IleMet: 1.731 ± 0.415
3.462IleAsn: 3.462 ± 0.46
2.285IlePro: 2.285 ± 0.358
1.8IleGln: 1.8 ± 0.327
3.6IleArg: 3.6 ± 0.493
4.985IleSer: 4.985 ± 0.632
4.431IleThr: 4.431 ± 0.488
4.846IleVal: 4.846 ± 0.447
0.554IleTrp: 0.554 ± 0.195
1.662IleTyr: 1.662 ± 0.354
0.0IleXaa: 0.0 ± 0.0
Lys
6.231LysAla: 6.231 ± 0.624
0.9LysCys: 0.9 ± 0.245
3.808LysAsp: 3.808 ± 0.544
3.254LysGlu: 3.254 ± 0.611
2.492LysPhe: 2.492 ± 0.406
3.392LysGly: 3.392 ± 0.604
2.008LysHis: 2.008 ± 0.508
4.708LysIle: 4.708 ± 0.472
5.331LysLys: 5.331 ± 0.832
5.192LysLeu: 5.192 ± 0.66
3.185LysMet: 3.185 ± 0.462
2.908LysAsn: 2.908 ± 0.427
2.354LysPro: 2.354 ± 0.487
3.185LysGln: 3.185 ± 0.596
3.739LysArg: 3.739 ± 0.573
4.846LysSer: 4.846 ± 0.64
3.877LysThr: 3.877 ± 0.556
3.946LysVal: 3.946 ± 0.485
1.385LysTrp: 1.385 ± 0.301
2.423LysTyr: 2.423 ± 0.377
0.0LysXaa: 0.0 ± 0.0
Leu
5.539LeuAla: 5.539 ± 0.877
0.692LeuCys: 0.692 ± 0.205
3.531LeuAsp: 3.531 ± 0.534
3.946LeuGlu: 3.946 ± 0.444
2.354LeuPhe: 2.354 ± 0.333
4.846LeuGly: 4.846 ± 0.581
1.385LeuHis: 1.385 ± 0.271
4.431LeuIle: 4.431 ± 0.54
4.916LeuLys: 4.916 ± 0.681
4.569LeuLeu: 4.569 ± 0.668
1.731LeuMet: 1.731 ± 0.323
3.323LeuAsn: 3.323 ± 0.598
3.254LeuPro: 3.254 ± 0.545
3.046LeuGln: 3.046 ± 0.522
4.639LeuArg: 4.639 ± 0.493
4.639LeuSer: 4.639 ± 0.5
4.362LeuThr: 4.362 ± 0.476
4.916LeuVal: 4.916 ± 0.662
1.731LeuTrp: 1.731 ± 0.323
2.492LeuTyr: 2.492 ± 0.45
0.0LeuXaa: 0.0 ± 0.0
Met
3.185MetAla: 3.185 ± 0.509
0.554MetCys: 0.554 ± 0.207
1.8MetAsp: 1.8 ± 0.349
1.662MetGlu: 1.662 ± 0.307
0.969MetPhe: 0.969 ± 0.228
1.523MetGly: 1.523 ± 0.404
0.623MetHis: 0.623 ± 0.249
1.8MetIle: 1.8 ± 0.425
2.7MetLys: 2.7 ± 0.415
2.008MetLeu: 2.008 ± 0.368
1.177MetMet: 1.177 ± 0.327
1.454MetAsn: 1.454 ± 0.257
1.731MetPro: 1.731 ± 0.404
1.246MetGln: 1.246 ± 0.328
1.662MetArg: 1.662 ± 0.275
2.492MetSer: 2.492 ± 0.46
1.592MetThr: 1.592 ± 0.364
0.9MetVal: 0.9 ± 0.262
0.415MetTrp: 0.415 ± 0.152
1.038MetTyr: 1.038 ± 0.269
0.0MetXaa: 0.0 ± 0.0
Asn
4.292AsnAla: 4.292 ± 0.636
0.346AsnCys: 0.346 ± 0.126
3.531AsnAsp: 3.531 ± 0.577
2.423AsnGlu: 2.423 ± 0.347
1.038AsnPhe: 1.038 ± 0.288
4.569AsnGly: 4.569 ± 0.476
1.662AsnHis: 1.662 ± 0.345
2.908AsnIle: 2.908 ± 0.386
3.323AsnLys: 3.323 ± 0.516
2.839AsnLeu: 2.839 ± 0.397
1.108AsnMet: 1.108 ± 0.263
2.423AsnAsn: 2.423 ± 0.435
1.592AsnPro: 1.592 ± 0.37
1.108AsnGln: 1.108 ± 0.242
2.562AsnArg: 2.562 ± 0.444
2.839AsnSer: 2.839 ± 0.414
2.285AsnThr: 2.285 ± 0.49
3.185AsnVal: 3.185 ± 0.585
0.554AsnTrp: 0.554 ± 0.211
1.592AsnTyr: 1.592 ± 0.319
0.0AsnXaa: 0.0 ± 0.0
Pro
3.115ProAla: 3.115 ± 0.567
0.692ProCys: 0.692 ± 0.19
2.354ProAsp: 2.354 ± 0.41
3.115ProGlu: 3.115 ± 0.425
1.108ProPhe: 1.108 ± 0.263
2.146ProGly: 2.146 ± 0.436
0.485ProHis: 0.485 ± 0.172
1.8ProIle: 1.8 ± 0.335
1.523ProLys: 1.523 ± 0.33
1.8ProLeu: 1.8 ± 0.424
0.692ProMet: 0.692 ± 0.213
1.315ProAsn: 1.315 ± 0.278
0.762ProPro: 0.762 ± 0.274
1.385ProGln: 1.385 ± 0.407
1.315ProArg: 1.315 ± 0.295
2.146ProSer: 2.146 ± 0.371
1.939ProThr: 1.939 ± 0.325
3.115ProVal: 3.115 ± 0.482
0.485ProTrp: 0.485 ± 0.217
1.454ProTyr: 1.454 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
2.7GlnAla: 2.7 ± 0.564
0.346GlnCys: 0.346 ± 0.166
1.592GlnAsp: 1.592 ± 0.381
2.7GlnGlu: 2.7 ± 0.391
1.8GlnPhe: 1.8 ± 0.356
0.9GlnGly: 0.9 ± 0.258
1.177GlnHis: 1.177 ± 0.318
2.423GlnIle: 2.423 ± 0.401
2.562GlnLys: 2.562 ± 0.467
3.669GlnLeu: 3.669 ± 0.589
1.662GlnMet: 1.662 ± 0.38
0.969GlnAsn: 0.969 ± 0.271
1.592GlnPro: 1.592 ± 0.34
2.631GlnGln: 2.631 ± 0.585
2.008GlnArg: 2.008 ± 0.412
2.839GlnSer: 2.839 ± 0.414
2.008GlnThr: 2.008 ± 0.501
2.215GlnVal: 2.215 ± 0.356
0.692GlnTrp: 0.692 ± 0.251
1.246GlnTyr: 1.246 ± 0.341
0.0GlnXaa: 0.0 ± 0.0
Arg
5.054ArgAla: 5.054 ± 0.595
1.177ArgCys: 1.177 ± 0.38
2.562ArgAsp: 2.562 ± 0.488
3.323ArgGlu: 3.323 ± 0.539
1.939ArgPhe: 1.939 ± 0.417
3.046ArgGly: 3.046 ± 0.463
0.623ArgHis: 0.623 ± 0.201
4.085ArgIle: 4.085 ± 0.5
4.292ArgLys: 4.292 ± 0.803
4.292ArgLeu: 4.292 ± 0.617
1.523ArgMet: 1.523 ± 0.357
3.323ArgAsn: 3.323 ± 0.441
1.177ArgPro: 1.177 ± 0.261
1.939ArgGln: 1.939 ± 0.513
3.046ArgArg: 3.046 ± 0.469
3.115ArgSer: 3.115 ± 0.43
1.939ArgThr: 1.939 ± 0.426
3.531ArgVal: 3.531 ± 0.53
0.692ArgTrp: 0.692 ± 0.226
2.839ArgTyr: 2.839 ± 0.336
0.0ArgXaa: 0.0 ± 0.0
Ser
5.954SerAla: 5.954 ± 0.656
1.454SerCys: 1.454 ± 0.26
4.708SerAsp: 4.708 ± 0.539
3.6SerGlu: 3.6 ± 0.52
2.423SerPhe: 2.423 ± 0.506
6.508SerGly: 6.508 ± 0.665
0.969SerHis: 0.969 ± 0.215
3.115SerIle: 3.115 ± 0.453
3.877SerLys: 3.877 ± 0.593
5.4SerLeu: 5.4 ± 0.724
1.8SerMet: 1.8 ± 0.312
2.631SerAsn: 2.631 ± 0.363
2.354SerPro: 2.354 ± 0.372
2.7SerGln: 2.7 ± 0.371
3.323SerArg: 3.323 ± 0.505
3.808SerSer: 3.808 ± 0.587
4.016SerThr: 4.016 ± 0.811
3.808SerVal: 3.808 ± 0.541
1.108SerTrp: 1.108 ± 0.231
2.492SerTyr: 2.492 ± 0.426
0.0SerXaa: 0.0 ± 0.0
Thr
4.5ThrAla: 4.5 ± 0.877
0.831ThrCys: 0.831 ± 0.259
3.115ThrAsp: 3.115 ± 0.414
4.085ThrGlu: 4.085 ± 0.473
1.662ThrPhe: 1.662 ± 0.453
4.985ThrGly: 4.985 ± 0.572
0.831ThrHis: 0.831 ± 0.211
3.115ThrIle: 3.115 ± 0.541
3.185ThrLys: 3.185 ± 0.472
3.877ThrLeu: 3.877 ± 0.529
1.385ThrMet: 1.385 ± 0.275
2.354ThrAsn: 2.354 ± 0.548
2.977ThrPro: 2.977 ± 0.518
1.869ThrGln: 1.869 ± 0.341
2.7ThrArg: 2.7 ± 0.557
4.016ThrSer: 4.016 ± 0.656
3.739ThrThr: 3.739 ± 0.816
2.977ThrVal: 2.977 ± 0.494
0.692ThrTrp: 0.692 ± 0.245
1.454ThrTyr: 1.454 ± 0.275
0.0ThrXaa: 0.0 ± 0.0
Val
5.469ValAla: 5.469 ± 0.763
1.038ValCys: 1.038 ± 0.306
4.292ValAsp: 4.292 ± 0.588
4.016ValGlu: 4.016 ± 0.485
2.215ValPhe: 2.215 ± 0.464
3.462ValGly: 3.462 ± 0.57
1.385ValHis: 1.385 ± 0.345
5.539ValIle: 5.539 ± 0.67
4.569ValLys: 4.569 ± 0.563
3.669ValLeu: 3.669 ± 0.508
2.077ValMet: 2.077 ± 0.349
3.739ValAsn: 3.739 ± 0.496
1.523ValPro: 1.523 ± 0.308
2.285ValGln: 2.285 ± 0.332
2.562ValArg: 2.562 ± 0.395
4.223ValSer: 4.223 ± 0.547
3.808ValThr: 3.808 ± 0.791
4.292ValVal: 4.292 ± 0.748
1.038ValTrp: 1.038 ± 0.277
2.215ValTyr: 2.215 ± 0.351
0.0ValXaa: 0.0 ± 0.0
Trp
1.108TrpAla: 1.108 ± 0.292
0.346TrpCys: 0.346 ± 0.185
0.692TrpAsp: 0.692 ± 0.227
0.623TrpGlu: 0.623 ± 0.179
0.831TrpPhe: 0.831 ± 0.253
0.415TrpGly: 0.415 ± 0.214
0.485TrpHis: 0.485 ± 0.172
1.038TrpIle: 1.038 ± 0.277
1.315TrpLys: 1.315 ± 0.35
1.523TrpLeu: 1.523 ± 0.341
0.277TrpMet: 0.277 ± 0.157
0.762TrpAsn: 0.762 ± 0.236
0.692TrpPro: 0.692 ± 0.21
0.831TrpGln: 0.831 ± 0.277
1.523TrpArg: 1.523 ± 0.303
0.831TrpSer: 0.831 ± 0.226
0.831TrpThr: 0.831 ± 0.238
1.246TrpVal: 1.246 ± 0.279
0.138TrpTrp: 0.138 ± 0.098
0.346TrpTyr: 0.346 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.908TyrAla: 2.908 ± 0.448
0.485TyrCys: 0.485 ± 0.167
3.115TyrAsp: 3.115 ± 0.464
2.077TyrGlu: 2.077 ± 0.316
1.315TyrPhe: 1.315 ± 0.384
2.977TyrGly: 2.977 ± 0.512
0.623TyrHis: 0.623 ± 0.228
2.354TyrIle: 2.354 ± 0.427
2.146TyrLys: 2.146 ± 0.394
2.354TyrLeu: 2.354 ± 0.48
0.831TyrMet: 0.831 ± 0.236
1.523TyrAsn: 1.523 ± 0.329
1.177TyrPro: 1.177 ± 0.288
1.385TyrGln: 1.385 ± 0.288
1.939TyrArg: 1.939 ± 0.406
3.323TyrSer: 3.323 ± 0.45
2.146TyrThr: 2.146 ± 0.421
3.046TyrVal: 3.046 ± 0.454
0.554TyrTrp: 0.554 ± 0.178
1.246TyrTyr: 1.246 ± 0.363
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 82 proteins (14445 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski