Amino acid dipepetide frequency for Staphylococcus phage 66

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.515AlaAla: 0.515 ± 0.56
0.172AlaCys: 0.172 ± 0.235
1.889AlaAsp: 1.889 ± 0.468
2.06AlaGlu: 2.06 ± 0.713
1.717AlaPhe: 1.717 ± 0.428
2.919AlaGly: 2.919 ± 0.871
0.343AlaHis: 0.343 ± 0.226
2.747AlaIle: 2.747 ± 0.559
4.293AlaLys: 4.293 ± 0.896
3.606AlaLeu: 3.606 ± 0.629
1.03AlaMet: 1.03 ± 0.405
2.06AlaAsn: 2.06 ± 0.531
1.202AlaPro: 1.202 ± 0.447
0.687AlaGln: 0.687 ± 0.34
2.06AlaArg: 2.06 ± 0.477
1.545AlaSer: 1.545 ± 0.4
2.576AlaThr: 2.576 ± 0.759
2.576AlaVal: 2.576 ± 0.647
0.343AlaTrp: 0.343 ± 0.215
3.606AlaTyr: 3.606 ± 0.743
0.0AlaXaa: 0.0 ± 0.0
Cys
0.343CysAla: 0.343 ± 0.244
0.172CysCys: 0.172 ± 0.194
0.515CysAsp: 0.515 ± 0.305
0.172CysGlu: 0.172 ± 0.138
0.859CysPhe: 0.859 ± 0.333
0.343CysGly: 0.343 ± 0.234
0.172CysHis: 0.172 ± 0.138
0.687CysIle: 0.687 ± 0.327
0.0CysLys: 0.0 ± 0.0
0.687CysLeu: 0.687 ± 0.452
0.859CysMet: 0.859 ± 0.354
0.343CysAsn: 0.343 ± 0.188
0.0CysPro: 0.0 ± 0.0
0.515CysGln: 0.515 ± 0.495
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
0.859CysThr: 0.859 ± 0.291
0.343CysVal: 0.343 ± 0.269
0.0CysTrp: 0.0 ± 0.0
0.172CysTyr: 0.172 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
2.747AspAla: 2.747 ± 0.612
0.343AspCys: 0.343 ± 0.193
6.868AspAsp: 6.868 ± 1.205
5.323AspGlu: 5.323 ± 1.137
3.606AspPhe: 3.606 ± 0.601
3.777AspGly: 3.777 ± 1.487
0.859AspHis: 0.859 ± 0.252
7.383AspIle: 7.383 ± 1.641
5.323AspLys: 5.323 ± 1.018
6.525AspLeu: 6.525 ± 0.89
1.03AspMet: 1.03 ± 0.572
6.696AspAsn: 6.696 ± 1.234
0.859AspPro: 0.859 ± 0.392
1.374AspGln: 1.374 ± 0.396
1.889AspArg: 1.889 ± 0.595
3.777AspSer: 3.777 ± 1.113
4.293AspThr: 4.293 ± 1.074
3.949AspVal: 3.949 ± 0.584
0.343AspTrp: 0.343 ± 0.204
5.495AspTyr: 5.495 ± 1.011
0.0AspXaa: 0.0 ± 0.0
Glu
2.576GluAla: 2.576 ± 0.804
0.515GluCys: 0.515 ± 0.316
3.091GluAsp: 3.091 ± 0.982
4.636GluGlu: 4.636 ± 1.76
3.777GluPhe: 3.777 ± 0.972
1.202GluGly: 1.202 ± 0.415
1.889GluHis: 1.889 ± 0.595
5.151GluIle: 5.151 ± 0.902
4.293GluLys: 4.293 ± 0.82
4.808GluLeu: 4.808 ± 0.855
3.606GluMet: 3.606 ± 0.778
4.293GluAsn: 4.293 ± 0.916
1.545GluPro: 1.545 ± 0.461
2.919GluGln: 2.919 ± 0.758
2.576GluArg: 2.576 ± 0.622
4.636GluSer: 4.636 ± 1.323
3.091GluThr: 3.091 ± 0.956
2.747GluVal: 2.747 ± 0.635
0.515GluTrp: 0.515 ± 0.259
3.949GluTyr: 3.949 ± 0.85
0.0GluXaa: 0.0 ± 0.0
Phe
1.202PheAla: 1.202 ± 0.449
0.343PheCys: 0.343 ± 0.23
4.121PheAsp: 4.121 ± 1.085
2.576PheGlu: 2.576 ± 0.612
1.717PhePhe: 1.717 ± 0.475
1.889PheGly: 1.889 ± 0.575
1.03PheHis: 1.03 ± 0.351
3.606PheIle: 3.606 ± 0.586
4.293PheLys: 4.293 ± 1.087
5.151PheLeu: 5.151 ± 0.802
1.202PheMet: 1.202 ± 0.428
4.636PheAsn: 4.636 ± 0.999
1.545PhePro: 1.545 ± 0.441
2.576PheGln: 2.576 ± 0.625
1.374PheArg: 1.374 ± 0.409
3.777PheSer: 3.777 ± 1.124
3.949PheThr: 3.949 ± 0.905
3.262PheVal: 3.262 ± 0.669
0.515PheTrp: 0.515 ± 0.28
3.091PheTyr: 3.091 ± 0.721
0.0PheXaa: 0.0 ± 0.0
Gly
1.717GlyAla: 1.717 ± 0.527
0.343GlyCys: 0.343 ± 0.215
3.091GlyAsp: 3.091 ± 0.782
1.717GlyGlu: 1.717 ± 0.978
3.262GlyPhe: 3.262 ± 0.95
3.262GlyGly: 3.262 ± 1.001
1.03GlyHis: 1.03 ± 0.361
3.091GlyIle: 3.091 ± 0.618
4.121GlyLys: 4.121 ± 0.89
3.262GlyLeu: 3.262 ± 0.844
1.374GlyMet: 1.374 ± 0.507
4.636GlyAsn: 4.636 ± 1.255
0.0GlyPro: 0.0 ± 0.0
1.889GlyGln: 1.889 ± 0.72
1.717GlyArg: 1.717 ± 0.56
2.747GlySer: 2.747 ± 1.171
2.576GlyThr: 2.576 ± 0.78
3.606GlyVal: 3.606 ± 0.991
0.687GlyTrp: 0.687 ± 0.251
2.747GlyTyr: 2.747 ± 0.928
0.0GlyXaa: 0.0 ± 0.0
His
0.172HisAla: 0.172 ± 0.138
0.0HisCys: 0.0 ± 0.0
1.545HisAsp: 1.545 ± 0.461
1.202HisGlu: 1.202 ± 0.537
2.232HisPhe: 2.232 ± 0.396
0.859HisGly: 0.859 ± 0.458
0.687HisHis: 0.687 ± 0.318
1.717HisIle: 1.717 ± 0.597
1.889HisLys: 1.889 ± 0.571
1.202HisLeu: 1.202 ± 0.527
0.687HisMet: 0.687 ± 0.332
1.889HisAsn: 1.889 ± 0.509
0.172HisPro: 0.172 ± 0.138
1.03HisGln: 1.03 ± 0.503
0.859HisArg: 0.859 ± 0.382
1.03HisSer: 1.03 ± 0.455
1.374HisThr: 1.374 ± 0.555
1.202HisVal: 1.202 ± 0.448
0.172HisTrp: 0.172 ± 0.165
2.06HisTyr: 2.06 ± 0.547
0.0HisXaa: 0.0 ± 0.0
Ile
3.434IleAla: 3.434 ± 0.596
0.343IleCys: 0.343 ± 0.255
9.272IleAsp: 9.272 ± 1.137
4.979IleGlu: 4.979 ± 1.076
1.545IlePhe: 1.545 ± 0.571
3.091IleGly: 3.091 ± 1.193
1.889IleHis: 1.889 ± 0.471
4.808IleIle: 4.808 ± 1.362
6.353IleLys: 6.353 ± 1.169
5.838IleLeu: 5.838 ± 1.183
1.202IleMet: 1.202 ± 0.455
8.07IleAsn: 8.07 ± 1.216
2.232IlePro: 2.232 ± 0.65
1.717IleGln: 1.717 ± 0.521
1.717IleArg: 1.717 ± 0.551
4.121IleSer: 4.121 ± 0.6
4.293IleThr: 4.293 ± 0.91
3.091IleVal: 3.091 ± 0.551
0.515IleTrp: 0.515 ± 0.354
5.151IleTyr: 5.151 ± 1.206
0.0IleXaa: 0.0 ± 0.0
Lys
2.576LysAla: 2.576 ± 0.808
0.859LysCys: 0.859 ± 0.339
4.979LysAsp: 4.979 ± 0.949
6.868LysGlu: 6.868 ± 1.362
3.777LysPhe: 3.777 ± 0.539
3.949LysGly: 3.949 ± 1.104
2.232LysHis: 2.232 ± 0.645
6.01LysIle: 6.01 ± 0.993
7.383LysLys: 7.383 ± 1.068
7.383LysLeu: 7.383 ± 0.704
2.404LysMet: 2.404 ± 0.497
5.666LysAsn: 5.666 ± 0.924
2.404LysPro: 2.404 ± 0.535
3.091LysGln: 3.091 ± 0.699
4.808LysArg: 4.808 ± 0.999
6.868LysSer: 6.868 ± 0.873
4.636LysThr: 4.636 ± 0.672
3.777LysVal: 3.777 ± 0.754
0.687LysTrp: 0.687 ± 0.28
4.464LysTyr: 4.464 ± 0.932
0.0LysXaa: 0.0 ± 0.0
Leu
4.293LeuAla: 4.293 ± 0.665
0.172LeuCys: 0.172 ± 0.17
4.464LeuAsp: 4.464 ± 0.568
4.636LeuGlu: 4.636 ± 1.039
4.293LeuPhe: 4.293 ± 0.576
3.434LeuGly: 3.434 ± 0.775
1.545LeuHis: 1.545 ± 0.456
5.151LeuIle: 5.151 ± 0.68
8.929LeuLys: 8.929 ± 1.165
6.696LeuLeu: 6.696 ± 1.389
2.232LeuMet: 2.232 ± 0.731
7.555LeuAsn: 7.555 ± 1.059
1.717LeuPro: 1.717 ± 0.603
4.121LeuGln: 4.121 ± 0.901
3.606LeuArg: 3.606 ± 0.639
6.181LeuSer: 6.181 ± 1.212
5.666LeuThr: 5.666 ± 1.058
3.091LeuVal: 3.091 ± 0.65
0.343LeuTrp: 0.343 ± 0.234
4.636LeuTyr: 4.636 ± 1.122
0.0LeuXaa: 0.0 ± 0.0
Met
0.859MetAla: 0.859 ± 0.403
0.172MetCys: 0.172 ± 0.138
1.202MetAsp: 1.202 ± 0.474
1.545MetGlu: 1.545 ± 0.46
1.717MetPhe: 1.717 ± 0.602
0.859MetGly: 0.859 ± 0.286
0.515MetHis: 0.515 ± 0.405
1.717MetIle: 1.717 ± 0.559
3.091MetLys: 3.091 ± 0.669
2.747MetLeu: 2.747 ± 1.283
0.343MetMet: 0.343 ± 0.237
1.717MetAsn: 1.717 ± 0.596
0.172MetPro: 0.172 ± 0.158
2.576MetGln: 2.576 ± 0.65
1.545MetArg: 1.545 ± 0.408
1.374MetSer: 1.374 ± 0.399
2.747MetThr: 2.747 ± 0.61
1.717MetVal: 1.717 ± 0.756
0.343MetTrp: 0.343 ± 0.27
1.202MetTyr: 1.202 ± 0.375
0.0MetXaa: 0.0 ± 0.0
Asn
4.464AsnAla: 4.464 ± 0.936
0.859AsnCys: 0.859 ± 0.387
7.212AsnAsp: 7.212 ± 0.928
6.868AsnGlu: 6.868 ± 0.991
4.121AsnPhe: 4.121 ± 0.959
5.838AsnGly: 5.838 ± 1.062
2.576AsnHis: 2.576 ± 0.746
5.495AsnIle: 5.495 ± 0.995
6.353AsnLys: 6.353 ± 1.123
4.464AsnLeu: 4.464 ± 0.961
2.06AsnMet: 2.06 ± 0.636
5.151AsnAsn: 5.151 ± 0.888
2.404AsnPro: 2.404 ± 0.602
3.606AsnGln: 3.606 ± 0.844
2.404AsnArg: 2.404 ± 0.758
4.979AsnSer: 4.979 ± 0.976
5.495AsnThr: 5.495 ± 0.834
4.808AsnVal: 4.808 ± 0.704
1.202AsnTrp: 1.202 ± 0.691
3.777AsnTyr: 3.777 ± 0.589
0.0AsnXaa: 0.0 ± 0.0
Pro
0.687ProAla: 0.687 ± 0.316
0.172ProCys: 0.172 ± 0.138
1.545ProAsp: 1.545 ± 0.577
1.717ProGlu: 1.717 ± 0.582
1.545ProPhe: 1.545 ± 0.467
0.0ProGly: 0.0 ± 0.0
0.343ProHis: 0.343 ± 0.27
1.889ProIle: 1.889 ± 0.53
2.404ProLys: 2.404 ± 0.607
1.545ProLeu: 1.545 ± 0.511
1.03ProMet: 1.03 ± 0.52
1.202ProAsn: 1.202 ± 0.485
0.515ProPro: 0.515 ± 0.378
1.374ProGln: 1.374 ± 0.484
0.515ProArg: 0.515 ± 0.293
1.717ProSer: 1.717 ± 0.585
2.06ProThr: 2.06 ± 0.819
1.202ProVal: 1.202 ± 0.411
0.343ProTrp: 0.343 ± 0.244
2.06ProTyr: 2.06 ± 0.533
0.0ProXaa: 0.0 ± 0.0
Gln
2.06GlnAla: 2.06 ± 0.64
0.859GlnCys: 0.859 ± 0.502
2.232GlnAsp: 2.232 ± 0.668
1.545GlnGlu: 1.545 ± 0.49
1.374GlnPhe: 1.374 ± 0.583
1.889GlnGly: 1.889 ± 0.463
0.687GlnHis: 0.687 ± 0.364
3.262GlnIle: 3.262 ± 0.641
3.262GlnLys: 3.262 ± 0.776
4.121GlnLeu: 4.121 ± 0.76
1.03GlnMet: 1.03 ± 0.422
4.979GlnAsn: 4.979 ± 0.78
1.545GlnPro: 1.545 ± 0.559
2.747GlnGln: 2.747 ± 1.135
0.515GlnArg: 0.515 ± 0.379
3.091GlnSer: 3.091 ± 0.787
1.202GlnThr: 1.202 ± 0.405
1.889GlnVal: 1.889 ± 0.542
0.859GlnTrp: 0.859 ± 0.525
2.747GlnTyr: 2.747 ± 0.546
0.0GlnXaa: 0.0 ± 0.0
Arg
1.889ArgAla: 1.889 ± 0.784
0.172ArgCys: 0.172 ± 0.19
2.576ArgAsp: 2.576 ± 1.103
2.747ArgGlu: 2.747 ± 0.657
3.091ArgPhe: 3.091 ± 0.766
1.545ArgGly: 1.545 ± 0.538
1.202ArgHis: 1.202 ± 0.591
1.545ArgIle: 1.545 ± 0.572
2.06ArgLys: 2.06 ± 0.443
2.232ArgLeu: 2.232 ± 0.698
1.545ArgMet: 1.545 ± 0.501
2.919ArgAsn: 2.919 ± 0.754
0.687ArgPro: 0.687 ± 0.305
1.889ArgGln: 1.889 ± 0.431
1.03ArgArg: 1.03 ± 0.329
1.545ArgSer: 1.545 ± 0.509
0.859ArgThr: 0.859 ± 0.422
2.576ArgVal: 2.576 ± 0.759
0.343ArgTrp: 0.343 ± 0.214
1.889ArgTyr: 1.889 ± 0.432
0.0ArgXaa: 0.0 ± 0.0
Ser
2.919SerAla: 2.919 ± 0.56
0.172SerCys: 0.172 ± 0.194
4.464SerAsp: 4.464 ± 0.579
4.464SerGlu: 4.464 ± 0.9
3.434SerPhe: 3.434 ± 0.74
3.777SerGly: 3.777 ± 1.444
0.687SerHis: 0.687 ± 0.285
3.949SerIle: 3.949 ± 0.585
7.212SerLys: 7.212 ± 1.119
5.666SerLeu: 5.666 ± 0.983
1.374SerMet: 1.374 ± 0.495
4.979SerAsn: 4.979 ± 1.305
1.717SerPro: 1.717 ± 0.494
2.747SerGln: 2.747 ± 0.587
2.404SerArg: 2.404 ± 0.442
4.293SerSer: 4.293 ± 1.2
2.576SerThr: 2.576 ± 0.989
2.919SerVal: 2.919 ± 0.591
0.343SerTrp: 0.343 ± 0.218
3.091SerTyr: 3.091 ± 0.891
0.0SerXaa: 0.0 ± 0.0
Thr
1.202ThrAla: 1.202 ± 0.414
0.343ThrCys: 0.343 ± 0.417
4.464ThrAsp: 4.464 ± 0.635
3.777ThrGlu: 3.777 ± 1.386
4.121ThrPhe: 4.121 ± 0.539
3.262ThrGly: 3.262 ± 0.686
1.545ThrHis: 1.545 ± 0.591
5.666ThrIle: 5.666 ± 0.814
5.323ThrLys: 5.323 ± 0.998
5.666ThrLeu: 5.666 ± 0.75
1.717ThrMet: 1.717 ± 0.475
4.636ThrAsn: 4.636 ± 0.793
1.374ThrPro: 1.374 ± 0.699
2.232ThrGln: 2.232 ± 0.713
1.545ThrArg: 1.545 ± 0.394
4.464ThrSer: 4.464 ± 0.827
3.777ThrThr: 3.777 ± 1.027
2.404ThrVal: 2.404 ± 0.695
0.687ThrTrp: 0.687 ± 0.437
2.576ThrTyr: 2.576 ± 0.546
0.0ThrXaa: 0.0 ± 0.0
Val
2.06ValAla: 2.06 ± 0.661
0.687ValCys: 0.687 ± 0.3
3.091ValAsp: 3.091 ± 0.547
2.232ValGlu: 2.232 ± 0.621
2.404ValPhe: 2.404 ± 0.701
1.374ValGly: 1.374 ± 0.451
0.859ValHis: 0.859 ± 0.402
3.949ValIle: 3.949 ± 0.715
3.949ValLys: 3.949 ± 0.612
3.777ValLeu: 3.777 ± 0.662
1.202ValMet: 1.202 ± 0.384
4.979ValAsn: 4.979 ± 0.856
2.06ValPro: 2.06 ± 0.843
2.747ValGln: 2.747 ± 0.971
3.091ValArg: 3.091 ± 0.793
3.091ValSer: 3.091 ± 0.636
3.949ValThr: 3.949 ± 0.904
3.434ValVal: 3.434 ± 0.855
0.515ValTrp: 0.515 ± 0.434
3.091ValTyr: 3.091 ± 0.815
0.0ValXaa: 0.0 ± 0.0
Trp
0.343TrpAla: 0.343 ± 0.373
0.0TrpCys: 0.0 ± 0.0
1.202TrpAsp: 1.202 ± 0.377
0.343TrpGlu: 0.343 ± 0.255
0.515TrpPhe: 0.515 ± 0.293
0.515TrpGly: 0.515 ± 0.462
0.343TrpHis: 0.343 ± 0.3
1.202TrpIle: 1.202 ± 0.558
0.515TrpLys: 0.515 ± 0.437
1.545TrpLeu: 1.545 ± 0.628
0.343TrpMet: 0.343 ± 0.228
0.687TrpAsn: 0.687 ± 0.521
0.0TrpPro: 0.0 ± 0.0
0.343TrpGln: 0.343 ± 0.258
0.0TrpArg: 0.0 ± 0.0
0.343TrpSer: 0.343 ± 0.228
0.515TrpThr: 0.515 ± 0.226
0.172TrpVal: 0.172 ± 0.158
0.0TrpTrp: 0.0 ± 0.0
0.343TrpTyr: 0.343 ± 0.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.889TyrAla: 1.889 ± 0.518
0.343TyrCys: 0.343 ± 0.244
4.808TyrAsp: 4.808 ± 0.823
2.404TyrGlu: 2.404 ± 0.858
2.747TyrPhe: 2.747 ± 0.818
3.091TyrGly: 3.091 ± 0.824
1.545TyrHis: 1.545 ± 0.645
4.636TyrIle: 4.636 ± 1.248
3.777TyrLys: 3.777 ± 0.573
5.838TyrLeu: 5.838 ± 1.292
1.717TyrMet: 1.717 ± 0.668
7.04TyrAsn: 7.04 ± 1.059
1.717TyrPro: 1.717 ± 0.377
1.717TyrGln: 1.717 ± 0.481
0.687TyrArg: 0.687 ± 0.271
3.606TyrSer: 3.606 ± 0.801
4.293TyrThr: 4.293 ± 0.627
3.777TyrVal: 3.777 ± 0.507
0.515TyrTrp: 0.515 ± 0.257
4.464TyrTyr: 4.464 ± 1.082
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 27 proteins (5825 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski