Amino acid dipepetide frequency for Staphylococcus phage phiSa119

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.912AlaAla: 1.912 ± 0.634
0.153AlaCys: 0.153 ± 0.107
2.371AlaAsp: 2.371 ± 0.387
4.206AlaGlu: 4.206 ± 0.507
2.676AlaPhe: 2.676 ± 0.665
3.594AlaGly: 3.594 ± 0.707
0.765AlaHis: 0.765 ± 0.239
4.435AlaIle: 4.435 ± 0.777
4.435AlaLys: 4.435 ± 0.509
4.741AlaLeu: 4.741 ± 0.548
1.453AlaMet: 1.453 ± 0.294
3.212AlaAsn: 3.212 ± 0.528
1.606AlaPro: 1.606 ± 0.342
2.524AlaGln: 2.524 ± 0.468
2.753AlaArg: 2.753 ± 0.408
3.212AlaSer: 3.212 ± 0.559
3.441AlaThr: 3.441 ± 0.559
3.135AlaVal: 3.135 ± 0.458
0.382AlaTrp: 0.382 ± 0.159
2.524AlaTyr: 2.524 ± 0.441
0.0AlaXaa: 0.0 ± 0.0
Cys
0.229CysAla: 0.229 ± 0.139
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.459CysGlu: 0.459 ± 0.199
0.076CysPhe: 0.076 ± 0.066
0.229CysGly: 0.229 ± 0.115
0.153CysHis: 0.153 ± 0.107
0.688CysIle: 0.688 ± 0.215
0.382CysLys: 0.382 ± 0.167
0.076CysLeu: 0.076 ± 0.075
0.076CysMet: 0.076 ± 0.08
0.229CysAsn: 0.229 ± 0.125
0.153CysPro: 0.153 ± 0.103
0.153CysGln: 0.153 ± 0.094
0.306CysArg: 0.306 ± 0.251
0.229CysSer: 0.229 ± 0.148
0.076CysThr: 0.076 ± 0.066
0.306CysVal: 0.306 ± 0.169
0.076CysTrp: 0.076 ± 0.076
0.229CysTyr: 0.229 ± 0.128
0.0CysXaa: 0.0 ± 0.0
Asp
2.753AspAla: 2.753 ± 0.401
0.306AspCys: 0.306 ± 0.138
4.129AspAsp: 4.129 ± 0.763
5.353AspGlu: 5.353 ± 0.883
3.059AspPhe: 3.059 ± 0.467
4.665AspGly: 4.665 ± 0.75
0.612AspHis: 0.612 ± 0.205
5.429AspIle: 5.429 ± 0.513
5.429AspLys: 5.429 ± 0.706
5.735AspLeu: 5.735 ± 0.722
2.141AspMet: 2.141 ± 0.497
3.671AspAsn: 3.671 ± 0.496
1.529AspPro: 1.529 ± 0.35
0.765AspGln: 0.765 ± 0.231
2.141AspArg: 2.141 ± 0.437
3.518AspSer: 3.518 ± 0.531
3.365AspThr: 3.365 ± 0.507
3.594AspVal: 3.594 ± 0.668
0.535AspTrp: 0.535 ± 0.181
2.829AspTyr: 2.829 ± 0.437
0.0AspXaa: 0.0 ± 0.0
Glu
3.976GluAla: 3.976 ± 0.724
0.612GluCys: 0.612 ± 0.19
3.594GluAsp: 3.594 ± 0.519
6.959GluGlu: 6.959 ± 0.857
3.059GluPhe: 3.059 ± 0.48
3.059GluGly: 3.059 ± 0.595
1.453GluHis: 1.453 ± 0.357
6.118GluIle: 6.118 ± 0.675
8.106GluLys: 8.106 ± 0.93
7.112GluLeu: 7.112 ± 0.983
2.447GluMet: 2.447 ± 0.424
5.276GluAsn: 5.276 ± 0.57
1.376GluPro: 1.376 ± 0.337
3.441GluGln: 3.441 ± 0.573
3.824GluArg: 3.824 ± 0.519
4.741GluSer: 4.741 ± 0.625
3.747GluThr: 3.747 ± 0.766
4.435GluVal: 4.435 ± 0.508
0.918GluTrp: 0.918 ± 0.232
4.359GluTyr: 4.359 ± 0.666
0.0GluXaa: 0.0 ± 0.0
Phe
2.218PheAla: 2.218 ± 0.555
0.153PheCys: 0.153 ± 0.095
2.6PheAsp: 2.6 ± 0.42
2.906PheGlu: 2.906 ± 0.488
1.071PhePhe: 1.071 ± 0.256
2.6PheGly: 2.6 ± 0.427
0.612PheHis: 0.612 ± 0.214
3.976PheIle: 3.976 ± 0.559
4.053PheLys: 4.053 ± 0.589
2.753PheLeu: 2.753 ± 0.461
1.453PheMet: 1.453 ± 0.369
3.212PheAsn: 3.212 ± 0.459
0.688PhePro: 0.688 ± 0.24
0.688PheGln: 0.688 ± 0.212
1.682PheArg: 1.682 ± 0.338
2.524PheSer: 2.524 ± 0.434
3.059PheThr: 3.059 ± 0.566
2.447PheVal: 2.447 ± 0.525
0.229PheTrp: 0.229 ± 0.13
1.376PheTyr: 1.376 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
2.676GlyAla: 2.676 ± 0.545
0.306GlyCys: 0.306 ± 0.147
3.824GlyAsp: 3.824 ± 0.442
3.594GlyGlu: 3.594 ± 0.671
2.6GlyPhe: 2.6 ± 0.393
4.435GlyGly: 4.435 ± 0.856
1.3GlyHis: 1.3 ± 0.4
4.053GlyIle: 4.053 ± 0.675
7.571GlyLys: 7.571 ± 0.856
5.123GlyLeu: 5.123 ± 0.792
1.224GlyMet: 1.224 ± 0.315
3.671GlyAsn: 3.671 ± 0.592
1.224GlyPro: 1.224 ± 0.304
2.141GlyGln: 2.141 ± 0.469
2.141GlyArg: 2.141 ± 0.357
3.518GlySer: 3.518 ± 0.553
2.829GlyThr: 2.829 ± 0.572
3.9GlyVal: 3.9 ± 0.785
0.918GlyTrp: 0.918 ± 0.305
3.212GlyTyr: 3.212 ± 0.617
0.0GlyXaa: 0.0 ± 0.0
His
1.3HisAla: 1.3 ± 0.348
0.076HisCys: 0.076 ± 0.09
1.147HisAsp: 1.147 ± 0.346
0.841HisGlu: 0.841 ± 0.281
1.3HisPhe: 1.3 ± 0.315
0.841HisGly: 0.841 ± 0.27
0.459HisHis: 0.459 ± 0.209
1.988HisIle: 1.988 ± 0.346
1.529HisLys: 1.529 ± 0.359
1.3HisLeu: 1.3 ± 0.255
0.153HisMet: 0.153 ± 0.113
1.224HisAsn: 1.224 ± 0.336
0.306HisPro: 0.306 ± 0.162
0.688HisGln: 0.688 ± 0.202
0.229HisArg: 0.229 ± 0.164
0.918HisSer: 0.918 ± 0.189
1.453HisThr: 1.453 ± 0.315
1.224HisVal: 1.224 ± 0.366
0.229HisTrp: 0.229 ± 0.114
1.071HisTyr: 1.071 ± 0.325
0.0HisXaa: 0.0 ± 0.0
Ile
4.894IleAla: 4.894 ± 0.649
0.306IleCys: 0.306 ± 0.145
4.818IleAsp: 4.818 ± 0.683
7.035IleGlu: 7.035 ± 0.729
3.135IlePhe: 3.135 ± 0.503
3.9IleGly: 3.9 ± 0.503
1.682IleHis: 1.682 ± 0.429
4.206IleIle: 4.206 ± 0.713
9.1IleLys: 9.1 ± 0.93
4.512IleLeu: 4.512 ± 0.593
1.529IleMet: 1.529 ± 0.298
5.582IleAsn: 5.582 ± 0.799
2.294IlePro: 2.294 ± 0.46
2.753IleGln: 2.753 ± 0.374
3.059IleArg: 3.059 ± 0.55
4.741IleSer: 4.741 ± 0.618
5.582IleThr: 5.582 ± 0.661
5.047IleVal: 5.047 ± 0.523
0.841IleTrp: 0.841 ± 0.407
3.212IleTyr: 3.212 ± 0.526
0.0IleXaa: 0.0 ± 0.0
Lys
6.118LysAla: 6.118 ± 0.741
0.306LysCys: 0.306 ± 0.176
7.035LysAsp: 7.035 ± 0.548
7.723LysGlu: 7.723 ± 0.828
3.976LysPhe: 3.976 ± 0.581
6.118LysGly: 6.118 ± 0.807
1.606LysHis: 1.606 ± 0.377
6.806LysIle: 6.806 ± 0.788
8.259LysLys: 8.259 ± 0.804
7.8LysLeu: 7.8 ± 0.674
2.371LysMet: 2.371 ± 0.395
6.118LysAsn: 6.118 ± 0.784
2.906LysPro: 2.906 ± 0.53
4.282LysGln: 4.282 ± 0.525
4.282LysArg: 4.282 ± 0.82
5.812LysSer: 5.812 ± 0.83
5.812LysThr: 5.812 ± 0.805
5.965LysVal: 5.965 ± 0.647
1.3LysTrp: 1.3 ± 0.268
4.741LysTyr: 4.741 ± 0.691
0.0LysXaa: 0.0 ± 0.0
Leu
3.594LeuAla: 3.594 ± 0.541
0.459LeuCys: 0.459 ± 0.186
3.671LeuAsp: 3.671 ± 0.443
6.5LeuGlu: 6.5 ± 0.864
3.365LeuPhe: 3.365 ± 0.531
3.365LeuGly: 3.365 ± 0.555
1.376LeuHis: 1.376 ± 0.297
6.271LeuIle: 6.271 ± 0.727
8.182LeuLys: 8.182 ± 0.765
6.423LeuLeu: 6.423 ± 0.842
1.912LeuMet: 1.912 ± 0.429
5.735LeuAsn: 5.735 ± 0.75
2.218LeuPro: 2.218 ± 0.421
3.059LeuGln: 3.059 ± 0.477
3.059LeuArg: 3.059 ± 0.477
5.123LeuSer: 5.123 ± 0.649
3.824LeuThr: 3.824 ± 0.545
4.053LeuVal: 4.053 ± 0.572
0.918LeuTrp: 0.918 ± 0.277
3.976LeuTyr: 3.976 ± 0.672
0.0LeuXaa: 0.0 ± 0.0
Met
1.147MetAla: 1.147 ± 0.34
0.153MetCys: 0.153 ± 0.103
1.759MetAsp: 1.759 ± 0.379
0.841MetGlu: 0.841 ± 0.301
1.071MetPhe: 1.071 ± 0.196
1.147MetGly: 1.147 ± 0.388
0.382MetHis: 0.382 ± 0.139
2.218MetIle: 2.218 ± 0.411
2.218MetLys: 2.218 ± 0.404
1.606MetLeu: 1.606 ± 0.34
0.612MetMet: 0.612 ± 0.226
2.218MetAsn: 2.218 ± 0.486
0.765MetPro: 0.765 ± 0.254
0.841MetGln: 0.841 ± 0.234
1.224MetArg: 1.224 ± 0.295
1.988MetSer: 1.988 ± 0.313
1.453MetThr: 1.453 ± 0.313
1.3MetVal: 1.3 ± 0.276
0.306MetTrp: 0.306 ± 0.164
0.765MetTyr: 0.765 ± 0.281
0.0MetXaa: 0.0 ± 0.0
Asn
3.976AsnAla: 3.976 ± 0.549
0.076AsnCys: 0.076 ± 0.074
4.129AsnAsp: 4.129 ± 0.526
4.971AsnGlu: 4.971 ± 0.533
1.835AsnPhe: 1.835 ± 0.623
5.812AsnGly: 5.812 ± 0.599
0.918AsnHis: 0.918 ± 0.282
4.894AsnIle: 4.894 ± 0.629
7.647AsnLys: 7.647 ± 0.867
3.441AsnLeu: 3.441 ± 0.493
1.529AsnMet: 1.529 ± 0.302
4.971AsnAsn: 4.971 ± 0.838
2.829AsnPro: 2.829 ± 0.392
2.906AsnGln: 2.906 ± 0.464
2.829AsnArg: 2.829 ± 0.44
4.053AsnSer: 4.053 ± 0.597
3.212AsnThr: 3.212 ± 0.43
3.747AsnVal: 3.747 ± 0.623
1.071AsnTrp: 1.071 ± 0.447
3.288AsnTyr: 3.288 ± 0.498
0.0AsnXaa: 0.0 ± 0.0
Pro
0.841ProAla: 0.841 ± 0.209
0.0ProCys: 0.0 ± 0.0
1.606ProAsp: 1.606 ± 0.341
2.218ProGlu: 2.218 ± 0.371
0.994ProPhe: 0.994 ± 0.229
1.071ProGly: 1.071 ± 0.243
0.382ProHis: 0.382 ± 0.152
2.6ProIle: 2.6 ± 0.442
2.982ProLys: 2.982 ± 0.566
1.912ProLeu: 1.912 ± 0.332
0.765ProMet: 0.765 ± 0.215
1.529ProAsn: 1.529 ± 0.299
1.071ProPro: 1.071 ± 0.212
0.765ProGln: 0.765 ± 0.244
0.994ProArg: 0.994 ± 0.271
1.988ProSer: 1.988 ± 0.437
1.682ProThr: 1.682 ± 0.38
1.376ProVal: 1.376 ± 0.316
0.229ProTrp: 0.229 ± 0.105
1.376ProTyr: 1.376 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
2.982GlnAla: 2.982 ± 0.401
0.382GlnCys: 0.382 ± 0.196
1.606GlnAsp: 1.606 ± 0.374
2.906GlnGlu: 2.906 ± 0.596
1.529GlnPhe: 1.529 ± 0.32
2.294GlnGly: 2.294 ± 0.405
1.071GlnHis: 1.071 ± 0.275
2.753GlnIle: 2.753 ± 0.391
2.829GlnLys: 2.829 ± 0.412
2.6GlnLeu: 2.6 ± 0.331
0.459GlnMet: 0.459 ± 0.167
3.212GlnAsn: 3.212 ± 0.563
0.994GlnPro: 0.994 ± 0.258
1.606GlnGln: 1.606 ± 0.379
1.759GlnArg: 1.759 ± 0.282
2.141GlnSer: 2.141 ± 0.349
1.759GlnThr: 1.759 ± 0.375
2.294GlnVal: 2.294 ± 0.358
0.459GlnTrp: 0.459 ± 0.17
1.224GlnTyr: 1.224 ± 0.252
0.0GlnXaa: 0.0 ± 0.0
Arg
2.218ArgAla: 2.218 ± 0.424
0.153ArgCys: 0.153 ± 0.103
2.829ArgAsp: 2.829 ± 0.383
3.671ArgGlu: 3.671 ± 0.546
1.453ArgPhe: 1.453 ± 0.349
1.682ArgGly: 1.682 ± 0.34
0.841ArgHis: 0.841 ± 0.24
3.212ArgIle: 3.212 ± 0.568
3.594ArgLys: 3.594 ± 0.5
3.9ArgLeu: 3.9 ± 0.503
0.994ArgMet: 0.994 ± 0.248
2.829ArgAsn: 2.829 ± 0.395
0.765ArgPro: 0.765 ± 0.297
1.071ArgGln: 1.071 ± 0.264
2.218ArgArg: 2.218 ± 0.425
1.453ArgSer: 1.453 ± 0.266
2.447ArgThr: 2.447 ± 0.557
2.371ArgVal: 2.371 ± 0.346
0.688ArgTrp: 0.688 ± 0.199
2.753ArgTyr: 2.753 ± 0.45
0.0ArgXaa: 0.0 ± 0.0
Ser
3.212SerAla: 3.212 ± 0.501
0.153SerCys: 0.153 ± 0.104
4.512SerAsp: 4.512 ± 0.768
5.353SerGlu: 5.353 ± 0.649
2.524SerPhe: 2.524 ± 0.595
3.594SerGly: 3.594 ± 0.564
1.147SerHis: 1.147 ± 0.241
5.2SerIle: 5.2 ± 0.556
5.276SerLys: 5.276 ± 0.647
4.282SerLeu: 4.282 ± 0.591
1.3SerMet: 1.3 ± 0.255
3.9SerAsn: 3.9 ± 0.636
0.688SerPro: 0.688 ± 0.194
2.447SerGln: 2.447 ± 0.42
1.912SerArg: 1.912 ± 0.356
3.518SerSer: 3.518 ± 0.525
3.135SerThr: 3.135 ± 0.403
3.135SerVal: 3.135 ± 0.58
0.612SerTrp: 0.612 ± 0.246
2.6SerTyr: 2.6 ± 0.598
0.0SerXaa: 0.0 ± 0.0
Thr
3.441ThrAla: 3.441 ± 0.56
0.076ThrCys: 0.076 ± 0.084
3.594ThrAsp: 3.594 ± 0.741
3.824ThrGlu: 3.824 ± 0.485
2.065ThrPhe: 2.065 ± 0.417
4.206ThrGly: 4.206 ± 0.833
1.682ThrHis: 1.682 ± 0.383
4.512ThrIle: 4.512 ± 0.755
4.435ThrLys: 4.435 ± 0.693
3.824ThrLeu: 3.824 ± 0.411
0.688ThrMet: 0.688 ± 0.182
3.212ThrAsn: 3.212 ± 0.689
2.6ThrPro: 2.6 ± 0.483
1.529ThrGln: 1.529 ± 0.309
2.141ThrArg: 2.141 ± 0.425
3.518ThrSer: 3.518 ± 0.75
3.288ThrThr: 3.288 ± 0.665
4.435ThrVal: 4.435 ± 0.727
0.841ThrTrp: 0.841 ± 0.212
2.524ThrTyr: 2.524 ± 0.478
0.0ThrXaa: 0.0 ± 0.0
Val
3.135ValAla: 3.135 ± 0.5
0.153ValCys: 0.153 ± 0.129
4.282ValAsp: 4.282 ± 0.592
4.359ValGlu: 4.359 ± 0.754
1.759ValPhe: 1.759 ± 0.42
4.053ValGly: 4.053 ± 0.58
1.071ValHis: 1.071 ± 0.213
4.665ValIle: 4.665 ± 0.584
6.729ValLys: 6.729 ± 0.693
4.741ValLeu: 4.741 ± 0.533
1.453ValMet: 1.453 ± 0.296
4.129ValAsn: 4.129 ± 0.522
1.453ValPro: 1.453 ± 0.392
2.6ValGln: 2.6 ± 0.508
1.912ValArg: 1.912 ± 0.332
2.906ValSer: 2.906 ± 0.42
3.288ValThr: 3.288 ± 0.481
3.976ValVal: 3.976 ± 0.638
0.688ValTrp: 0.688 ± 0.245
2.065ValTyr: 2.065 ± 0.411
0.0ValXaa: 0.0 ± 0.0
Trp
0.535TrpAla: 0.535 ± 0.184
0.0TrpCys: 0.0 ± 0.0
0.841TrpAsp: 0.841 ± 0.229
0.841TrpGlu: 0.841 ± 0.232
0.841TrpPhe: 0.841 ± 0.229
0.918TrpGly: 0.918 ± 0.26
0.0TrpHis: 0.0 ± 0.0
0.994TrpIle: 0.994 ± 0.238
1.147TrpLys: 1.147 ± 0.239
1.3TrpLeu: 1.3 ± 0.284
0.459TrpMet: 0.459 ± 0.17
0.918TrpAsn: 0.918 ± 0.326
0.153TrpPro: 0.153 ± 0.099
0.612TrpGln: 0.612 ± 0.183
0.306TrpArg: 0.306 ± 0.139
0.229TrpSer: 0.229 ± 0.115
0.535TrpThr: 0.535 ± 0.14
0.765TrpVal: 0.765 ± 0.189
0.076TrpTrp: 0.076 ± 0.066
0.459TrpTyr: 0.459 ± 0.151
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.371TyrAla: 2.371 ± 0.327
0.306TyrCys: 0.306 ± 0.159
3.212TyrAsp: 3.212 ± 0.581
4.053TyrGlu: 4.053 ± 0.616
1.912TyrPhe: 1.912 ± 0.395
2.676TyrGly: 2.676 ± 0.555
0.765TyrHis: 0.765 ± 0.243
3.212TyrIle: 3.212 ± 0.676
5.429TyrLys: 5.429 ± 0.625
3.9TyrLeu: 3.9 ± 0.604
0.918TyrMet: 0.918 ± 0.298
3.365TyrAsn: 3.365 ± 0.515
0.765TyrPro: 0.765 ± 0.194
1.988TyrGln: 1.988 ± 0.381
2.371TyrArg: 2.371 ± 0.452
2.371TyrSer: 2.371 ± 0.465
2.447TyrThr: 2.447 ± 0.413
1.912TyrVal: 1.912 ± 0.414
0.612TyrTrp: 0.612 ± 0.194
1.3TyrTyr: 1.3 ± 0.413
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 68 proteins (13078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski