Amino acid dipepetide frequency for Staphylococcus phage phiSP44-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.451AlaAla: 2.451 ± 0.393
0.306AlaCys: 0.306 ± 0.131
3.6AlaAsp: 3.6 ± 0.547
4.825AlaGlu: 4.825 ± 0.555
2.681AlaPhe: 2.681 ± 0.393
2.681AlaGly: 2.681 ± 0.577
1.072AlaHis: 1.072 ± 0.238
6.893AlaIle: 6.893 ± 1.034
5.438AlaLys: 5.438 ± 0.737
5.208AlaLeu: 5.208 ± 0.641
1.149AlaMet: 1.149 ± 0.342
4.365AlaAsn: 4.365 ± 0.594
1.225AlaPro: 1.225 ± 0.275
2.144AlaGln: 2.144 ± 0.405
2.374AlaArg: 2.374 ± 0.351
2.221AlaSer: 2.221 ± 0.558
4.672AlaThr: 4.672 ± 0.561
4.059AlaVal: 4.059 ± 0.628
0.613AlaTrp: 0.613 ± 0.209
1.838AlaTyr: 1.838 ± 0.333
0.0AlaXaa: 0.0 ± 0.0
Cys
0.153CysAla: 0.153 ± 0.119
0.0CysCys: 0.0 ± 0.0
0.23CysAsp: 0.23 ± 0.114
0.077CysGlu: 0.077 ± 0.075
0.077CysPhe: 0.077 ± 0.073
0.153CysGly: 0.153 ± 0.106
0.306CysHis: 0.306 ± 0.145
0.153CysIle: 0.153 ± 0.092
0.383CysLys: 0.383 ± 0.158
0.077CysLeu: 0.077 ± 0.073
0.23CysMet: 0.23 ± 0.136
0.46CysAsn: 0.46 ± 0.182
0.0CysPro: 0.0 ± 0.0
0.23CysGln: 0.23 ± 0.123
0.383CysArg: 0.383 ± 0.176
0.306CysSer: 0.306 ± 0.155
0.23CysThr: 0.23 ± 0.135
0.23CysVal: 0.23 ± 0.127
0.23CysTrp: 0.23 ± 0.119
0.077CysTyr: 0.077 ± 0.082
0.0CysXaa: 0.0 ± 0.0
Asp
3.676AspAla: 3.676 ± 0.465
0.077AspCys: 0.077 ± 0.087
5.514AspAsp: 5.514 ± 0.886
6.587AspGlu: 6.587 ± 0.732
3.829AspPhe: 3.829 ± 0.573
3.753AspGly: 3.753 ± 0.501
0.766AspHis: 0.766 ± 0.23
4.902AspIle: 4.902 ± 0.665
5.285AspLys: 5.285 ± 0.757
5.591AspLeu: 5.591 ± 0.778
1.455AspMet: 1.455 ± 0.333
3.676AspAsn: 3.676 ± 0.439
1.379AspPro: 1.379 ± 0.273
1.072AspGln: 1.072 ± 0.288
1.991AspArg: 1.991 ± 0.383
4.059AspSer: 4.059 ± 0.596
3.37AspThr: 3.37 ± 0.567
4.442AspVal: 4.442 ± 0.48
0.383AspTrp: 0.383 ± 0.151
3.446AspTyr: 3.446 ± 0.516
0.0AspXaa: 0.0 ± 0.0
Glu
5.438GluAla: 5.438 ± 0.661
0.689GluCys: 0.689 ± 0.218
3.523GluAsp: 3.523 ± 0.564
7.276GluGlu: 7.276 ± 0.958
3.983GluPhe: 3.983 ± 0.445
3.6GluGly: 3.6 ± 0.557
1.302GluHis: 1.302 ± 0.34
4.672GluIle: 4.672 ± 0.72
5.208GluLys: 5.208 ± 0.703
7.123GluLeu: 7.123 ± 0.842
2.987GluMet: 2.987 ± 0.449
4.212GluAsn: 4.212 ± 0.758
1.379GluPro: 1.379 ± 0.343
3.217GluGln: 3.217 ± 0.5
4.442GluArg: 4.442 ± 0.627
4.212GluSer: 4.212 ± 0.616
2.757GluThr: 2.757 ± 0.45
5.208GluVal: 5.208 ± 0.701
0.996GluTrp: 0.996 ± 0.326
2.91GluTyr: 2.91 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
2.374PheAla: 2.374 ± 0.485
0.23PheCys: 0.23 ± 0.11
2.757PheAsp: 2.757 ± 0.535
3.6PheGlu: 3.6 ± 0.599
1.838PhePhe: 1.838 ± 0.341
3.14PheGly: 3.14 ± 0.472
0.46PheHis: 0.46 ± 0.185
3.14PheIle: 3.14 ± 0.402
4.825PheLys: 4.825 ± 0.515
2.374PheLeu: 2.374 ± 0.594
1.149PheMet: 1.149 ± 0.291
2.987PheAsn: 2.987 ± 0.45
1.149PhePro: 1.149 ± 0.339
1.072PheGln: 1.072 ± 0.282
1.608PheArg: 1.608 ± 0.302
2.221PheSer: 2.221 ± 0.608
2.298PheThr: 2.298 ± 0.404
3.983PheVal: 3.983 ± 0.548
0.536PheTrp: 0.536 ± 0.19
1.685PheTyr: 1.685 ± 0.309
0.0PheXaa: 0.0 ± 0.0
Gly
2.834GlyAla: 2.834 ± 0.707
0.306GlyCys: 0.306 ± 0.15
2.374GlyAsp: 2.374 ± 0.433
3.14GlyGlu: 3.14 ± 0.534
2.834GlyPhe: 2.834 ± 0.465
2.834GlyGly: 2.834 ± 0.45
0.842GlyHis: 0.842 ± 0.243
4.059GlyIle: 4.059 ± 0.587
4.902GlyLys: 4.902 ± 0.71
5.208GlyLeu: 5.208 ± 0.783
1.149GlyMet: 1.149 ± 0.314
3.14GlyAsn: 3.14 ± 0.422
0.689GlyPro: 0.689 ± 0.322
1.991GlyGln: 1.991 ± 0.421
2.298GlyArg: 2.298 ± 0.418
2.987GlySer: 2.987 ± 0.394
3.217GlyThr: 3.217 ± 0.386
4.902GlyVal: 4.902 ± 0.651
0.383GlyTrp: 0.383 ± 0.162
3.217GlyTyr: 3.217 ± 0.526
0.0GlyXaa: 0.0 ± 0.0
His
0.613HisAla: 0.613 ± 0.232
0.153HisCys: 0.153 ± 0.096
1.302HisAsp: 1.302 ± 0.296
0.919HisGlu: 0.919 ± 0.279
0.842HisPhe: 0.842 ± 0.252
1.149HisGly: 1.149 ± 0.303
0.536HisHis: 0.536 ± 0.204
1.608HisIle: 1.608 ± 0.425
1.991HisLys: 1.991 ± 0.472
1.608HisLeu: 1.608 ± 0.343
0.23HisMet: 0.23 ± 0.117
1.072HisAsn: 1.072 ± 0.288
0.613HisPro: 0.613 ± 0.196
1.455HisGln: 1.455 ± 0.375
0.46HisArg: 0.46 ± 0.201
1.072HisSer: 1.072 ± 0.21
1.072HisThr: 1.072 ± 0.305
0.842HisVal: 0.842 ± 0.254
0.153HisTrp: 0.153 ± 0.098
1.225HisTyr: 1.225 ± 0.305
0.0HisXaa: 0.0 ± 0.0
Ile
4.672IleAla: 4.672 ± 0.759
0.23IleCys: 0.23 ± 0.128
6.51IleAsp: 6.51 ± 0.724
5.514IleGlu: 5.514 ± 0.756
2.604IlePhe: 2.604 ± 0.415
3.983IleGly: 3.983 ± 0.831
1.455IleHis: 1.455 ± 0.315
4.595IleIle: 4.595 ± 0.561
7.046IleLys: 7.046 ± 0.67
4.365IleLeu: 4.365 ± 0.555
1.225IleMet: 1.225 ± 0.385
5.208IleAsn: 5.208 ± 0.597
2.834IlePro: 2.834 ± 0.415
2.298IleGln: 2.298 ± 0.46
2.91IleArg: 2.91 ± 0.572
4.442IleSer: 4.442 ± 0.49
4.059IleThr: 4.059 ± 0.588
5.285IleVal: 5.285 ± 0.587
0.613IleTrp: 0.613 ± 0.304
3.829IleTyr: 3.829 ± 0.668
0.0IleXaa: 0.0 ± 0.0
Lys
6.28LysAla: 6.28 ± 0.735
0.23LysCys: 0.23 ± 0.135
5.514LysAsp: 5.514 ± 0.732
6.05LysGlu: 6.05 ± 0.96
3.523LysPhe: 3.523 ± 0.567
4.672LysGly: 4.672 ± 0.537
1.685LysHis: 1.685 ± 0.372
6.51LysIle: 6.51 ± 0.704
7.582LysLys: 7.582 ± 1.088
7.582LysLeu: 7.582 ± 0.704
2.374LysMet: 2.374 ± 0.367
4.289LysAsn: 4.289 ± 0.578
3.753LysPro: 3.753 ± 1.006
5.285LysGln: 5.285 ± 0.432
3.753LysArg: 3.753 ± 0.597
5.285LysSer: 5.285 ± 0.634
6.357LysThr: 6.357 ± 0.753
5.131LysVal: 5.131 ± 0.531
0.919LysTrp: 0.919 ± 0.249
3.983LysTyr: 3.983 ± 0.626
0.0LysXaa: 0.0 ± 0.0
Leu
4.825LeuAla: 4.825 ± 0.597
0.383LeuCys: 0.383 ± 0.198
6.204LeuAsp: 6.204 ± 0.666
6.433LeuGlu: 6.433 ± 0.748
3.293LeuPhe: 3.293 ± 0.441
3.753LeuGly: 3.753 ± 0.711
1.225LeuHis: 1.225 ± 0.285
5.897LeuIle: 5.897 ± 0.731
8.042LeuLys: 8.042 ± 0.912
6.433LeuLeu: 6.433 ± 0.77
2.451LeuMet: 2.451 ± 0.437
5.744LeuAsn: 5.744 ± 0.755
2.144LeuPro: 2.144 ± 0.445
2.91LeuGln: 2.91 ± 0.421
3.217LeuArg: 3.217 ± 0.566
5.667LeuSer: 5.667 ± 0.878
5.361LeuThr: 5.361 ± 0.649
3.983LeuVal: 3.983 ± 0.578
0.919LeuTrp: 0.919 ± 0.394
1.991LeuTyr: 1.991 ± 0.38
0.0LeuXaa: 0.0 ± 0.0
Met
1.608MetAla: 1.608 ± 0.323
0.0MetCys: 0.0 ± 0.0
1.991MetAsp: 1.991 ± 0.366
2.221MetGlu: 2.221 ± 0.501
0.919MetPhe: 0.919 ± 0.268
1.072MetGly: 1.072 ± 0.287
0.383MetHis: 0.383 ± 0.139
2.298MetIle: 2.298 ± 0.413
1.762MetLys: 1.762 ± 0.308
1.915MetLeu: 1.915 ± 0.312
0.766MetMet: 0.766 ± 0.208
1.379MetAsn: 1.379 ± 0.392
0.689MetPro: 0.689 ± 0.217
0.766MetGln: 0.766 ± 0.317
1.149MetArg: 1.149 ± 0.267
1.685MetSer: 1.685 ± 0.355
1.379MetThr: 1.379 ± 0.318
0.766MetVal: 0.766 ± 0.232
0.383MetTrp: 0.383 ± 0.144
1.149MetTyr: 1.149 ± 0.297
0.0MetXaa: 0.0 ± 0.0
Asn
4.519AsnAla: 4.519 ± 0.759
0.306AsnCys: 0.306 ± 0.168
4.519AsnAsp: 4.519 ± 0.775
4.902AsnGlu: 4.902 ± 0.664
1.991AsnPhe: 1.991 ± 0.389
4.902AsnGly: 4.902 ± 0.718
0.919AsnHis: 0.919 ± 0.23
3.37AsnIle: 3.37 ± 0.43
6.433AsnLys: 6.433 ± 0.831
4.289AsnLeu: 4.289 ± 0.721
1.838AsnMet: 1.838 ± 0.358
4.212AsnAsn: 4.212 ± 0.845
2.144AsnPro: 2.144 ± 0.383
1.685AsnGln: 1.685 ± 0.335
2.374AsnArg: 2.374 ± 0.353
4.672AsnSer: 4.672 ± 0.533
3.676AsnThr: 3.676 ± 0.574
4.672AsnVal: 4.672 ± 0.573
0.842AsnTrp: 0.842 ± 0.248
2.681AsnTyr: 2.681 ± 0.552
0.0AsnXaa: 0.0 ± 0.0
Pro
1.302ProAla: 1.302 ± 0.339
0.153ProCys: 0.153 ± 0.102
1.302ProAsp: 1.302 ± 0.305
1.532ProGlu: 1.532 ± 0.429
1.072ProPhe: 1.072 ± 0.306
0.766ProGly: 0.766 ± 0.25
0.689ProHis: 0.689 ± 0.233
2.144ProIle: 2.144 ± 0.503
3.523ProLys: 3.523 ± 0.827
1.455ProLeu: 1.455 ± 0.313
0.766ProMet: 0.766 ± 0.267
2.374ProAsn: 2.374 ± 0.414
0.842ProPro: 0.842 ± 0.387
1.379ProGln: 1.379 ± 0.434
1.225ProArg: 1.225 ± 0.289
2.144ProSer: 2.144 ± 0.379
1.762ProThr: 1.762 ± 0.364
1.532ProVal: 1.532 ± 0.358
0.153ProTrp: 0.153 ± 0.103
0.996ProTyr: 0.996 ± 0.254
0.0ProXaa: 0.0 ± 0.0
Gln
2.757GlnAla: 2.757 ± 0.393
0.306GlnCys: 0.306 ± 0.161
2.068GlnAsp: 2.068 ± 0.296
2.221GlnGlu: 2.221 ± 0.583
1.915GlnPhe: 1.915 ± 0.342
1.532GlnGly: 1.532 ± 0.325
0.996GlnHis: 0.996 ± 0.259
2.374GlnIle: 2.374 ± 0.386
3.37GlnLys: 3.37 ± 0.528
3.446GlnLeu: 3.446 ± 0.598
0.306GlnMet: 0.306 ± 0.14
3.983GlnAsn: 3.983 ± 0.666
0.766GlnPro: 0.766 ± 0.406
2.221GlnGln: 2.221 ± 0.738
1.608GlnArg: 1.608 ± 0.322
1.608GlnSer: 1.608 ± 0.33
2.681GlnThr: 2.681 ± 0.522
2.374GlnVal: 2.374 ± 0.433
0.153GlnTrp: 0.153 ± 0.096
1.379GlnTyr: 1.379 ± 0.351
0.0GlnXaa: 0.0 ± 0.0
Arg
2.068ArgAla: 2.068 ± 0.391
0.077ArgCys: 0.077 ± 0.084
1.991ArgAsp: 1.991 ± 0.449
2.451ArgGlu: 2.451 ± 0.526
1.915ArgPhe: 1.915 ± 0.355
2.298ArgGly: 2.298 ± 0.465
1.379ArgHis: 1.379 ± 0.378
3.217ArgIle: 3.217 ± 0.428
3.6ArgLys: 3.6 ± 0.505
4.902ArgLeu: 4.902 ± 0.606
1.302ArgMet: 1.302 ± 0.268
2.144ArgAsn: 2.144 ± 0.384
1.072ArgPro: 1.072 ± 0.382
1.608ArgGln: 1.608 ± 0.373
1.072ArgArg: 1.072 ± 0.257
2.451ArgSer: 2.451 ± 0.452
2.298ArgThr: 2.298 ± 0.407
2.298ArgVal: 2.298 ± 0.324
0.383ArgTrp: 0.383 ± 0.183
2.681ArgTyr: 2.681 ± 0.456
0.0ArgXaa: 0.0 ± 0.0
Ser
3.6SerAla: 3.6 ± 0.671
0.153SerCys: 0.153 ± 0.106
3.6SerAsp: 3.6 ± 0.599
4.825SerGlu: 4.825 ± 0.747
2.374SerPhe: 2.374 ± 0.589
3.753SerGly: 3.753 ± 0.539
1.379SerHis: 1.379 ± 0.372
5.361SerIle: 5.361 ± 0.736
5.667SerLys: 5.667 ± 0.593
5.821SerLeu: 5.821 ± 0.709
0.996SerMet: 0.996 ± 0.273
4.289SerAsn: 4.289 ± 0.518
1.149SerPro: 1.149 ± 0.301
1.532SerGln: 1.532 ± 0.345
3.293SerArg: 3.293 ± 0.499
2.527SerSer: 2.527 ± 0.655
3.37SerThr: 3.37 ± 0.559
4.212SerVal: 4.212 ± 0.507
0.536SerTrp: 0.536 ± 0.256
1.915SerTyr: 1.915 ± 0.408
0.0SerXaa: 0.0 ± 0.0
Thr
3.293ThrAla: 3.293 ± 0.476
0.0ThrCys: 0.0 ± 0.0
3.6ThrAsp: 3.6 ± 0.546
3.829ThrGlu: 3.829 ± 0.575
2.374ThrPhe: 2.374 ± 0.448
3.293ThrGly: 3.293 ± 0.539
1.149ThrHis: 1.149 ± 0.235
5.055ThrIle: 5.055 ± 0.697
5.821ThrLys: 5.821 ± 0.797
5.361ThrLeu: 5.361 ± 0.635
1.302ThrMet: 1.302 ± 0.326
3.37ThrAsn: 3.37 ± 0.575
1.762ThrPro: 1.762 ± 0.322
2.144ThrGln: 2.144 ± 0.394
2.221ThrArg: 2.221 ± 0.434
3.523ThrSer: 3.523 ± 0.524
3.906ThrThr: 3.906 ± 0.809
4.519ThrVal: 4.519 ± 0.896
0.46ThrTrp: 0.46 ± 0.315
2.527ThrTyr: 2.527 ± 0.431
0.0ThrXaa: 0.0 ± 0.0
Val
4.059ValAla: 4.059 ± 0.594
0.23ValCys: 0.23 ± 0.123
4.978ValAsp: 4.978 ± 0.471
5.591ValGlu: 5.591 ± 0.757
2.604ValPhe: 2.604 ± 0.522
3.14ValGly: 3.14 ± 0.548
1.149ValHis: 1.149 ± 0.281
4.519ValIle: 4.519 ± 0.542
5.055ValLys: 5.055 ± 0.567
5.131ValLeu: 5.131 ± 0.595
1.379ValMet: 1.379 ± 0.294
4.059ValAsn: 4.059 ± 0.563
2.298ValPro: 2.298 ± 0.397
2.221ValGln: 2.221 ± 0.322
1.915ValArg: 1.915 ± 0.29
5.514ValSer: 5.514 ± 0.638
3.983ValThr: 3.983 ± 0.693
5.055ValVal: 5.055 ± 0.695
0.689ValTrp: 0.689 ± 0.152
3.063ValTyr: 3.063 ± 0.554
0.0ValXaa: 0.0 ± 0.0
Trp
0.996TrpAla: 0.996 ± 0.299
0.077TrpCys: 0.077 ± 0.077
0.383TrpAsp: 0.383 ± 0.144
0.23TrpGlu: 0.23 ± 0.152
0.613TrpPhe: 0.613 ± 0.253
0.46TrpGly: 0.46 ± 0.203
0.077TrpHis: 0.077 ± 0.077
0.383TrpIle: 0.383 ± 0.152
0.306TrpLys: 0.306 ± 0.147
0.766TrpLeu: 0.766 ± 0.255
0.153TrpMet: 0.153 ± 0.101
1.225TrpAsn: 1.225 ± 0.6
0.077TrpPro: 0.077 ± 0.069
0.383TrpGln: 0.383 ± 0.245
0.46TrpArg: 0.46 ± 0.153
1.455TrpSer: 1.455 ± 0.307
0.766TrpThr: 0.766 ± 0.217
0.766TrpVal: 0.766 ± 0.253
0.0TrpTrp: 0.0 ± 0.0
0.46TrpTyr: 0.46 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.298TyrAla: 2.298 ± 0.357
0.077TyrCys: 0.077 ± 0.074
3.293TyrAsp: 3.293 ± 0.564
2.91TyrGlu: 2.91 ± 0.404
2.221TyrPhe: 2.221 ± 0.547
2.374TyrGly: 2.374 ± 0.56
1.149TyrHis: 1.149 ± 0.328
2.527TyrIle: 2.527 ± 0.483
4.519TyrLys: 4.519 ± 0.574
2.221TyrLeu: 2.221 ± 0.42
0.996TyrMet: 0.996 ± 0.313
2.757TyrAsn: 2.757 ± 0.42
1.149TyrPro: 1.149 ± 0.271
2.451TyrGln: 2.451 ± 0.664
2.451TyrArg: 2.451 ± 0.491
2.298TyrSer: 2.298 ± 0.456
2.298TyrThr: 2.298 ± 0.352
2.374TyrVal: 2.374 ± 0.388
0.689TyrTrp: 0.689 ± 0.254
1.685TyrTyr: 1.685 ± 0.49
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (13058 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski