Amino acid dipepetide frequency for Salmonella phage vB_SpuP_Spp16

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.391AlaAla: 5.391 ± 0.729
0.539AlaCys: 0.539 ± 0.199
4.775AlaAsp: 4.775 ± 0.587
5.083AlaGlu: 5.083 ± 1.024
2.234AlaPhe: 2.234 ± 0.354
4.775AlaGly: 4.775 ± 0.659
1.078AlaHis: 1.078 ± 0.297
4.313AlaIle: 4.313 ± 0.59
5.468AlaLys: 5.468 ± 0.756
7.317AlaLeu: 7.317 ± 0.688
2.234AlaMet: 2.234 ± 0.391
4.467AlaAsn: 4.467 ± 1.235
2.002AlaPro: 2.002 ± 0.393
3.543AlaGln: 3.543 ± 0.713
4.39AlaArg: 4.39 ± 0.679
4.313AlaSer: 4.313 ± 0.545
4.775AlaThr: 4.775 ± 0.623
4.621AlaVal: 4.621 ± 0.784
0.462AlaTrp: 0.462 ± 0.137
3.158AlaTyr: 3.158 ± 0.425
0.0AlaXaa: 0.0 ± 0.0
Cys
0.231CysAla: 0.231 ± 0.149
0.231CysCys: 0.231 ± 0.141
0.693CysAsp: 0.693 ± 0.287
0.308CysGlu: 0.308 ± 0.152
0.462CysPhe: 0.462 ± 0.227
0.924CysGly: 0.924 ± 0.268
0.308CysHis: 0.308 ± 0.167
0.616CysIle: 0.616 ± 0.352
0.462CysLys: 0.462 ± 0.179
0.924CysLeu: 0.924 ± 0.33
0.539CysMet: 0.539 ± 0.227
0.462CysAsn: 0.462 ± 0.187
0.847CysPro: 0.847 ± 0.328
0.308CysGln: 0.308 ± 0.183
0.693CysArg: 0.693 ± 0.271
0.385CysSer: 0.385 ± 0.145
0.539CysThr: 0.539 ± 0.194
1.155CysVal: 1.155 ± 0.347
0.0CysTrp: 0.0 ± 0.0
0.77CysTyr: 0.77 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
4.082AspAla: 4.082 ± 0.511
0.385AspCys: 0.385 ± 0.178
3.62AspAsp: 3.62 ± 0.474
3.774AspGlu: 3.774 ± 0.459
2.696AspPhe: 2.696 ± 0.407
4.467AspGly: 4.467 ± 0.706
0.616AspHis: 0.616 ± 0.257
4.544AspIle: 4.544 ± 0.657
4.929AspLys: 4.929 ± 0.557
5.083AspLeu: 5.083 ± 0.696
1.925AspMet: 1.925 ± 0.421
3.004AspAsn: 3.004 ± 0.513
1.617AspPro: 1.617 ± 0.465
1.771AspGln: 1.771 ± 0.332
2.773AspArg: 2.773 ± 0.439
3.697AspSer: 3.697 ± 0.572
3.851AspThr: 3.851 ± 0.559
4.236AspVal: 4.236 ± 0.565
0.616AspTrp: 0.616 ± 0.174
2.696AspTyr: 2.696 ± 0.509
0.0AspXaa: 0.0 ± 0.0
Glu
6.624GluAla: 6.624 ± 0.777
0.847GluCys: 0.847 ± 0.321
4.929GluAsp: 4.929 ± 0.608
6.007GluGlu: 6.007 ± 0.927
2.311GluPhe: 2.311 ± 0.393
3.62GluGly: 3.62 ± 0.542
2.079GluHis: 2.079 ± 0.398
3.158GluIle: 3.158 ± 0.53
3.697GluLys: 3.697 ± 0.504
6.161GluLeu: 6.161 ± 0.662
2.465GluMet: 2.465 ± 0.423
2.079GluAsn: 2.079 ± 0.36
2.696GluPro: 2.696 ± 0.548
4.313GluGln: 4.313 ± 0.697
3.543GluArg: 3.543 ± 0.515
4.082GluSer: 4.082 ± 0.464
3.235GluThr: 3.235 ± 0.45
4.005GluVal: 4.005 ± 0.65
0.77GluTrp: 0.77 ± 0.221
3.928GluTyr: 3.928 ± 0.558
0.0GluXaa: 0.0 ± 0.0
Phe
1.771PheAla: 1.771 ± 0.45
0.385PheCys: 0.385 ± 0.209
2.773PheAsp: 2.773 ± 0.501
1.925PheGlu: 1.925 ± 0.37
1.309PhePhe: 1.309 ± 0.331
2.85PheGly: 2.85 ± 0.342
0.462PheHis: 0.462 ± 0.214
2.619PheIle: 2.619 ± 0.402
2.311PheLys: 2.311 ± 0.448
2.388PheLeu: 2.388 ± 0.374
0.77PheMet: 0.77 ± 0.199
2.85PheAsn: 2.85 ± 0.47
0.924PhePro: 0.924 ± 0.225
1.54PheGln: 1.54 ± 0.265
1.54PheArg: 1.54 ± 0.389
1.771PheSer: 1.771 ± 0.354
2.85PheThr: 2.85 ± 0.356
2.157PheVal: 2.157 ± 0.326
0.462PheTrp: 0.462 ± 0.183
0.77PheTyr: 0.77 ± 0.272
0.0PheXaa: 0.0 ± 0.0
Gly
4.852GlyAla: 4.852 ± 0.652
0.616GlyCys: 0.616 ± 0.242
4.698GlyAsp: 4.698 ± 0.439
3.62GlyGlu: 3.62 ± 0.401
1.925GlyPhe: 1.925 ± 0.348
4.236GlyGly: 4.236 ± 0.706
1.232GlyHis: 1.232 ± 0.276
4.467GlyIle: 4.467 ± 0.527
5.776GlyLys: 5.776 ± 0.694
4.698GlyLeu: 4.698 ± 0.72
2.002GlyMet: 2.002 ± 0.425
3.774GlyAsn: 3.774 ± 0.425
1.54GlyPro: 1.54 ± 0.243
1.54GlyGln: 1.54 ± 0.371
4.159GlyArg: 4.159 ± 0.534
4.544GlySer: 4.544 ± 0.451
4.39GlyThr: 4.39 ± 0.549
5.237GlyVal: 5.237 ± 0.658
0.847GlyTrp: 0.847 ± 0.226
2.542GlyTyr: 2.542 ± 0.458
0.0GlyXaa: 0.0 ± 0.0
His
1.001HisAla: 1.001 ± 0.303
0.385HisCys: 0.385 ± 0.15
1.001HisAsp: 1.001 ± 0.257
1.155HisGlu: 1.155 ± 0.363
0.77HisPhe: 0.77 ± 0.174
1.771HisGly: 1.771 ± 0.429
0.539HisHis: 0.539 ± 0.231
0.616HisIle: 0.616 ± 0.203
1.155HisLys: 1.155 ± 0.388
1.463HisLeu: 1.463 ± 0.368
0.693HisMet: 0.693 ± 0.217
1.386HisAsn: 1.386 ± 0.32
0.462HisPro: 0.462 ± 0.201
0.77HisGln: 0.77 ± 0.179
1.078HisArg: 1.078 ± 0.29
0.924HisSer: 0.924 ± 0.245
0.693HisThr: 0.693 ± 0.187
1.155HisVal: 1.155 ± 0.28
0.308HisTrp: 0.308 ± 0.18
1.078HisTyr: 1.078 ± 0.29
0.0HisXaa: 0.0 ± 0.0
Ile
4.159IleAla: 4.159 ± 0.537
0.308IleCys: 0.308 ± 0.171
3.004IleAsp: 3.004 ± 0.451
3.774IleGlu: 3.774 ± 0.691
1.155IlePhe: 1.155 ± 0.241
4.544IleGly: 4.544 ± 0.664
1.463IleHis: 1.463 ± 0.351
2.619IleIle: 2.619 ± 0.658
4.544IleLys: 4.544 ± 0.505
4.236IleLeu: 4.236 ± 0.709
1.309IleMet: 1.309 ± 0.297
3.466IleAsn: 3.466 ± 0.594
2.619IlePro: 2.619 ± 0.468
2.465IleGln: 2.465 ± 0.434
3.158IleArg: 3.158 ± 0.402
3.62IleSer: 3.62 ± 0.41
2.85IleThr: 2.85 ± 0.595
3.389IleVal: 3.389 ± 0.692
1.001IleTrp: 1.001 ± 0.296
2.619IleTyr: 2.619 ± 0.444
0.0IleXaa: 0.0 ± 0.0
Lys
6.547LysAla: 6.547 ± 0.886
0.693LysCys: 0.693 ± 0.285
5.314LysAsp: 5.314 ± 0.453
6.238LysGlu: 6.238 ± 0.722
2.388LysPhe: 2.388 ± 0.395
4.698LysGly: 4.698 ± 0.67
1.309LysHis: 1.309 ± 0.274
2.696LysIle: 2.696 ± 0.487
3.081LysLys: 3.081 ± 0.448
6.701LysLeu: 6.701 ± 0.716
1.617LysMet: 1.617 ± 0.305
2.234LysAsn: 2.234 ± 0.392
3.466LysPro: 3.466 ± 0.532
4.082LysGln: 4.082 ± 0.694
4.39LysArg: 4.39 ± 0.655
2.542LysSer: 2.542 ± 0.445
2.85LysThr: 2.85 ± 0.549
5.006LysVal: 5.006 ± 0.587
0.693LysTrp: 0.693 ± 0.227
2.927LysTyr: 2.927 ± 0.435
0.0LysXaa: 0.0 ± 0.0
Leu
6.392LeuAla: 6.392 ± 0.95
1.001LeuCys: 1.001 ± 0.284
5.391LeuAsp: 5.391 ± 0.654
6.778LeuGlu: 6.778 ± 0.763
2.079LeuPhe: 2.079 ± 0.381
5.083LeuGly: 5.083 ± 0.598
1.925LeuHis: 1.925 ± 0.396
5.083LeuIle: 5.083 ± 0.568
4.852LeuLys: 4.852 ± 0.735
4.852LeuLeu: 4.852 ± 0.623
2.388LeuMet: 2.388 ± 0.431
4.082LeuAsn: 4.082 ± 0.584
4.082LeuPro: 4.082 ± 0.646
4.005LeuGln: 4.005 ± 0.414
4.005LeuArg: 4.005 ± 0.603
5.468LeuSer: 5.468 ± 0.555
5.622LeuThr: 5.622 ± 0.571
5.006LeuVal: 5.006 ± 0.754
1.309LeuTrp: 1.309 ± 0.266
3.466LeuTyr: 3.466 ± 0.473
0.0LeuXaa: 0.0 ± 0.0
Met
2.773MetAla: 2.773 ± 0.45
0.385MetCys: 0.385 ± 0.157
1.54MetAsp: 1.54 ± 0.358
1.386MetGlu: 1.386 ± 0.275
1.617MetPhe: 1.617 ± 0.29
1.463MetGly: 1.463 ± 0.406
0.462MetHis: 0.462 ± 0.159
1.771MetIle: 1.771 ± 0.381
1.848MetLys: 1.848 ± 0.449
3.158MetLeu: 3.158 ± 0.558
0.693MetMet: 0.693 ± 0.22
1.155MetAsn: 1.155 ± 0.28
0.924MetPro: 0.924 ± 0.28
0.847MetGln: 0.847 ± 0.255
1.771MetArg: 1.771 ± 0.303
2.311MetSer: 2.311 ± 0.455
1.155MetThr: 1.155 ± 0.367
1.386MetVal: 1.386 ± 0.399
0.385MetTrp: 0.385 ± 0.163
0.693MetTyr: 0.693 ± 0.181
0.0MetXaa: 0.0 ± 0.0
Asn
3.389AsnAla: 3.389 ± 0.616
0.77AsnCys: 0.77 ± 0.277
3.312AsnAsp: 3.312 ± 0.531
3.389AsnGlu: 3.389 ± 0.577
1.617AsnPhe: 1.617 ± 0.32
5.006AsnGly: 5.006 ± 0.744
1.001AsnHis: 1.001 ± 0.25
3.389AsnIle: 3.389 ± 0.603
3.774AsnLys: 3.774 ± 0.503
3.312AsnLeu: 3.312 ± 0.445
1.54AsnMet: 1.54 ± 0.329
3.928AsnAsn: 3.928 ± 0.775
2.157AsnPro: 2.157 ± 0.345
1.155AsnGln: 1.155 ± 0.258
2.234AsnArg: 2.234 ± 0.377
3.158AsnSer: 3.158 ± 0.451
2.85AsnThr: 2.85 ± 0.444
2.927AsnVal: 2.927 ± 0.433
0.77AsnTrp: 0.77 ± 0.224
2.465AsnTyr: 2.465 ± 0.352
0.0AsnXaa: 0.0 ± 0.0
Pro
1.925ProAla: 1.925 ± 0.556
0.077ProCys: 0.077 ± 0.073
2.234ProAsp: 2.234 ± 0.368
3.312ProGlu: 3.312 ± 0.503
1.001ProPhe: 1.001 ± 0.266
1.386ProGly: 1.386 ± 0.276
0.385ProHis: 0.385 ± 0.159
2.157ProIle: 2.157 ± 0.391
2.773ProLys: 2.773 ± 0.445
3.466ProLeu: 3.466 ± 0.434
0.385ProMet: 0.385 ± 0.22
2.157ProAsn: 2.157 ± 0.357
1.001ProPro: 1.001 ± 0.306
2.465ProGln: 2.465 ± 0.376
1.694ProArg: 1.694 ± 0.357
2.388ProSer: 2.388 ± 0.463
2.157ProThr: 2.157 ± 0.285
3.851ProVal: 3.851 ± 0.436
0.231ProTrp: 0.231 ± 0.12
2.002ProTyr: 2.002 ± 0.358
0.0ProXaa: 0.0 ± 0.0
Gln
4.159GlnAla: 4.159 ± 0.705
0.616GlnCys: 0.616 ± 0.3
1.54GlnAsp: 1.54 ± 0.35
3.543GlnGlu: 3.543 ± 0.697
1.617GlnPhe: 1.617 ± 0.332
3.081GlnGly: 3.081 ± 0.495
0.693GlnHis: 0.693 ± 0.199
1.617GlnIle: 1.617 ± 0.406
2.85GlnLys: 2.85 ± 0.45
4.005GlnLeu: 4.005 ± 0.63
1.54GlnMet: 1.54 ± 0.378
1.771GlnAsn: 1.771 ± 0.398
1.309GlnPro: 1.309 ± 0.267
3.081GlnGln: 3.081 ± 0.619
2.079GlnArg: 2.079 ± 0.274
2.619GlnSer: 2.619 ± 0.486
1.848GlnThr: 1.848 ± 0.325
2.773GlnVal: 2.773 ± 0.431
0.462GlnTrp: 0.462 ± 0.149
1.694GlnTyr: 1.694 ± 0.335
0.0GlnXaa: 0.0 ± 0.0
Arg
4.082ArgAla: 4.082 ± 0.709
0.539ArgCys: 0.539 ± 0.248
3.081ArgAsp: 3.081 ± 0.586
4.236ArgGlu: 4.236 ± 0.58
2.157ArgPhe: 2.157 ± 0.431
3.389ArgGly: 3.389 ± 0.5
1.309ArgHis: 1.309 ± 0.231
2.927ArgIle: 2.927 ± 0.514
4.082ArgLys: 4.082 ± 0.631
5.622ArgLeu: 5.622 ± 0.655
1.694ArgMet: 1.694 ± 0.364
2.927ArgAsn: 2.927 ± 0.397
1.54ArgPro: 1.54 ± 0.34
2.079ArgGln: 2.079 ± 0.402
2.927ArgArg: 2.927 ± 0.432
1.925ArgSer: 1.925 ± 0.407
3.004ArgThr: 3.004 ± 0.459
3.158ArgVal: 3.158 ± 0.446
0.924ArgTrp: 0.924 ± 0.337
2.388ArgTyr: 2.388 ± 0.399
0.0ArgXaa: 0.0 ± 0.0
Ser
4.698SerAla: 4.698 ± 0.558
0.77SerCys: 0.77 ± 0.29
3.312SerAsp: 3.312 ± 0.494
3.928SerGlu: 3.928 ± 0.521
1.386SerPhe: 1.386 ± 0.264
4.005SerGly: 4.005 ± 0.598
0.693SerHis: 0.693 ± 0.186
4.082SerIle: 4.082 ± 0.579
4.852SerLys: 4.852 ± 0.589
4.236SerLeu: 4.236 ± 0.729
1.925SerMet: 1.925 ± 0.445
2.85SerAsn: 2.85 ± 0.52
2.465SerPro: 2.465 ± 0.42
2.542SerGln: 2.542 ± 0.427
3.235SerArg: 3.235 ± 0.406
4.005SerSer: 4.005 ± 0.652
4.159SerThr: 4.159 ± 0.552
4.159SerVal: 4.159 ± 0.534
0.847SerTrp: 0.847 ± 0.277
2.157SerTyr: 2.157 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
4.159ThrAla: 4.159 ± 0.638
0.231ThrCys: 0.231 ± 0.146
3.158ThrAsp: 3.158 ± 0.487
3.928ThrGlu: 3.928 ± 0.469
2.311ThrPhe: 2.311 ± 0.475
4.159ThrGly: 4.159 ± 0.635
1.155ThrHis: 1.155 ± 0.315
3.543ThrIle: 3.543 ± 0.787
3.851ThrLys: 3.851 ± 0.512
4.621ThrLeu: 4.621 ± 0.572
1.232ThrMet: 1.232 ± 0.245
2.619ThrAsn: 2.619 ± 0.55
2.927ThrPro: 2.927 ± 0.408
2.079ThrGln: 2.079 ± 0.339
2.773ThrArg: 2.773 ± 0.361
3.62ThrSer: 3.62 ± 0.658
3.312ThrThr: 3.312 ± 0.548
4.621ThrVal: 4.621 ± 0.527
0.77ThrTrp: 0.77 ± 0.245
2.311ThrTyr: 2.311 ± 0.393
0.0ThrXaa: 0.0 ± 0.0
Val
4.929ValAla: 4.929 ± 0.525
1.309ValCys: 1.309 ± 0.313
3.158ValAsp: 3.158 ± 0.538
4.467ValGlu: 4.467 ± 0.585
2.465ValPhe: 2.465 ± 0.375
4.082ValGly: 4.082 ± 0.558
1.078ValHis: 1.078 ± 0.325
3.774ValIle: 3.774 ± 0.629
4.698ValLys: 4.698 ± 0.545
5.622ValLeu: 5.622 ± 0.635
1.54ValMet: 1.54 ± 0.39
3.389ValAsn: 3.389 ± 0.421
2.465ValPro: 2.465 ± 0.408
2.079ValGln: 2.079 ± 0.401
3.928ValArg: 3.928 ± 0.492
5.083ValSer: 5.083 ± 0.52
4.236ValThr: 4.236 ± 0.721
5.699ValVal: 5.699 ± 0.764
0.231ValTrp: 0.231 ± 0.106
3.543ValTyr: 3.543 ± 0.729
0.0ValXaa: 0.0 ± 0.0
Trp
0.616TrpAla: 0.616 ± 0.186
0.154TrpCys: 0.154 ± 0.114
0.847TrpAsp: 0.847 ± 0.201
0.924TrpGlu: 0.924 ± 0.269
1.078TrpPhe: 1.078 ± 0.212
0.539TrpGly: 0.539 ± 0.209
0.154TrpHis: 0.154 ± 0.094
0.308TrpIle: 0.308 ± 0.119
0.77TrpLys: 0.77 ± 0.243
1.617TrpLeu: 1.617 ± 0.352
0.0TrpMet: 0.0 ± 0.0
0.616TrpAsn: 0.616 ± 0.212
0.385TrpPro: 0.385 ± 0.154
0.462TrpGln: 0.462 ± 0.205
0.693TrpArg: 0.693 ± 0.21
0.847TrpSer: 0.847 ± 0.182
0.539TrpThr: 0.539 ± 0.212
0.847TrpVal: 0.847 ± 0.298
0.462TrpTrp: 0.462 ± 0.195
0.385TrpTyr: 0.385 ± 0.159
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.158TyrAla: 3.158 ± 0.485
0.616TyrCys: 0.616 ± 0.164
2.002TyrAsp: 2.002 ± 0.422
2.773TyrGlu: 2.773 ± 0.42
1.848TyrPhe: 1.848 ± 0.291
2.619TyrGly: 2.619 ± 0.514
0.462TyrHis: 0.462 ± 0.193
2.002TyrIle: 2.002 ± 0.46
3.697TyrLys: 3.697 ± 0.627
3.389TyrLeu: 3.389 ± 0.491
1.155TyrMet: 1.155 ± 0.338
3.004TyrAsn: 3.004 ± 0.486
1.617TyrPro: 1.617 ± 0.265
1.771TyrGln: 1.771 ± 0.381
2.927TyrArg: 2.927 ± 0.463
3.004TyrSer: 3.004 ± 0.569
2.465TyrThr: 2.465 ± 0.561
2.388TyrVal: 2.388 ± 0.378
0.693TyrTrp: 0.693 ± 0.238
2.002TyrTyr: 2.002 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (12985 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski