Amino acid dipepetide frequency for Staphylococcus phage SA345ruMSSAST8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.757AlaAla: 1.757 ± 0.721
0.319AlaCys: 0.319 ± 0.156
2.395AlaAsp: 2.395 ± 0.467
3.833AlaGlu: 3.833 ± 0.679
2.874AlaPhe: 2.874 ± 0.682
3.354AlaGly: 3.354 ± 0.531
0.479AlaHis: 0.479 ± 0.188
5.19AlaIle: 5.19 ± 0.759
4.95AlaLys: 4.95 ± 0.613
5.03AlaLeu: 5.03 ± 0.626
1.597AlaMet: 1.597 ± 0.449
3.513AlaAsn: 3.513 ± 0.543
1.677AlaPro: 1.677 ± 0.354
2.874AlaGln: 2.874 ± 0.618
2.874AlaArg: 2.874 ± 0.484
3.433AlaSer: 3.433 ± 0.459
3.194AlaThr: 3.194 ± 0.737
2.555AlaVal: 2.555 ± 0.526
0.559AlaTrp: 0.559 ± 0.243
2.795AlaTyr: 2.795 ± 0.382
0.0AlaXaa: 0.0 ± 0.0
Cys
0.479CysAla: 0.479 ± 0.182
0.08CysCys: 0.08 ± 0.079
0.08CysAsp: 0.08 ± 0.091
0.24CysGlu: 0.24 ± 0.133
0.319CysPhe: 0.319 ± 0.203
0.24CysGly: 0.24 ± 0.138
0.08CysHis: 0.08 ± 0.078
0.958CysIle: 0.958 ± 0.271
0.319CysLys: 0.319 ± 0.163
0.24CysLeu: 0.24 ± 0.148
0.0CysMet: 0.0 ± 0.0
0.319CysAsn: 0.319 ± 0.15
0.16CysPro: 0.16 ± 0.11
0.08CysGln: 0.08 ± 0.078
0.399CysArg: 0.399 ± 0.185
0.319CysSer: 0.319 ± 0.194
0.479CysThr: 0.479 ± 0.179
0.24CysVal: 0.24 ± 0.14
0.08CysTrp: 0.08 ± 0.085
0.399CysTyr: 0.399 ± 0.185
0.0CysXaa: 0.0 ± 0.0
Asp
3.513AspAla: 3.513 ± 0.482
0.479AspCys: 0.479 ± 0.207
3.912AspAsp: 3.912 ± 0.807
5.35AspGlu: 5.35 ± 1.054
3.274AspPhe: 3.274 ± 0.517
5.669AspGly: 5.669 ± 0.744
0.798AspHis: 0.798 ± 0.268
4.791AspIle: 4.791 ± 0.65
5.19AspLys: 5.19 ± 0.699
4.551AspLeu: 4.551 ± 0.535
1.677AspMet: 1.677 ± 0.303
4.95AspAsn: 4.95 ± 0.94
1.437AspPro: 1.437 ± 0.389
1.118AspGln: 1.118 ± 0.332
2.156AspArg: 2.156 ± 0.553
3.513AspSer: 3.513 ± 0.728
2.635AspThr: 2.635 ± 0.622
3.912AspVal: 3.912 ± 0.565
0.559AspTrp: 0.559 ± 0.241
3.194AspTyr: 3.194 ± 0.591
0.0AspXaa: 0.0 ± 0.0
Glu
4.791GluAla: 4.791 ± 0.755
0.479GluCys: 0.479 ± 0.207
3.513GluAsp: 3.513 ± 0.619
7.186GluGlu: 7.186 ± 1.018
3.753GluPhe: 3.753 ± 0.595
2.475GluGly: 2.475 ± 0.386
1.517GluHis: 1.517 ± 0.39
7.027GluIle: 7.027 ± 0.915
7.027GluLys: 7.027 ± 1.14
8.304GluLeu: 8.304 ± 1.12
2.395GluMet: 2.395 ± 0.479
5.829GluAsn: 5.829 ± 0.806
1.757GluPro: 1.757 ± 0.344
3.593GluGln: 3.593 ± 0.578
4.312GluArg: 4.312 ± 0.569
4.791GluSer: 4.791 ± 0.616
3.673GluThr: 3.673 ± 0.782
4.631GluVal: 4.631 ± 0.601
1.118GluTrp: 1.118 ± 0.269
4.232GluTyr: 4.232 ± 0.646
0.0GluXaa: 0.0 ± 0.0
Phe
2.475PheAla: 2.475 ± 0.563
0.16PheCys: 0.16 ± 0.119
3.114PheAsp: 3.114 ± 0.452
3.992PheGlu: 3.992 ± 0.536
1.118PhePhe: 1.118 ± 0.316
2.236PheGly: 2.236 ± 0.4
0.479PheHis: 0.479 ± 0.187
3.992PheIle: 3.992 ± 0.638
5.11PheLys: 5.11 ± 0.619
2.954PheLeu: 2.954 ± 0.477
1.198PheMet: 1.198 ± 0.397
4.152PheAsn: 4.152 ± 0.577
0.878PhePro: 0.878 ± 0.33
0.639PheGln: 0.639 ± 0.222
1.517PheArg: 1.517 ± 0.332
2.954PheSer: 2.954 ± 0.823
2.475PheThr: 2.475 ± 0.507
1.996PheVal: 1.996 ± 0.313
0.16PheTrp: 0.16 ± 0.107
1.517PheTyr: 1.517 ± 0.434
0.0PheXaa: 0.0 ± 0.0
Gly
2.555GlyAla: 2.555 ± 0.688
0.399GlyCys: 0.399 ± 0.162
3.593GlyAsp: 3.593 ± 0.639
3.433GlyGlu: 3.433 ± 0.75
2.795GlyPhe: 2.795 ± 0.579
3.753GlyGly: 3.753 ± 1.108
1.278GlyHis: 1.278 ± 0.317
4.232GlyIle: 4.232 ± 0.883
5.989GlyLys: 5.989 ± 0.813
5.43GlyLeu: 5.43 ± 0.914
0.798GlyMet: 0.798 ± 0.279
3.194GlyAsn: 3.194 ± 0.549
1.278GlyPro: 1.278 ± 0.4
1.836GlyGln: 1.836 ± 0.474
2.954GlyArg: 2.954 ± 0.46
2.236GlySer: 2.236 ± 0.449
2.715GlyThr: 2.715 ± 0.48
3.274GlyVal: 3.274 ± 0.707
1.038GlyTrp: 1.038 ± 0.286
2.555GlyTyr: 2.555 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
1.118HisAla: 1.118 ± 0.358
0.08HisCys: 0.08 ± 0.074
0.958HisAsp: 0.958 ± 0.263
1.357HisGlu: 1.357 ± 0.328
1.198HisPhe: 1.198 ± 0.276
0.719HisGly: 0.719 ± 0.307
0.16HisHis: 0.16 ± 0.112
1.996HisIle: 1.996 ± 0.389
1.278HisLys: 1.278 ± 0.321
1.597HisLeu: 1.597 ± 0.284
0.16HisMet: 0.16 ± 0.11
0.798HisAsn: 0.798 ± 0.268
0.639HisPro: 0.639 ± 0.21
0.399HisGln: 0.399 ± 0.16
0.479HisArg: 0.479 ± 0.227
0.479HisSer: 0.479 ± 0.172
0.798HisThr: 0.798 ± 0.255
0.878HisVal: 0.878 ± 0.257
0.16HisTrp: 0.16 ± 0.112
1.198HisTyr: 1.198 ± 0.268
0.0HisXaa: 0.0 ± 0.0
Ile
5.35IleAla: 5.35 ± 0.853
0.399IleCys: 0.399 ± 0.165
6.468IleAsp: 6.468 ± 0.788
7.027IleGlu: 7.027 ± 0.724
3.194IlePhe: 3.194 ± 0.585
3.833IleGly: 3.833 ± 0.735
1.757IleHis: 1.757 ± 0.39
4.152IleIle: 4.152 ± 0.536
9.182IleLys: 9.182 ± 0.721
4.152IleLeu: 4.152 ± 0.509
1.437IleMet: 1.437 ± 0.311
6.547IleAsn: 6.547 ± 1.129
2.316IlePro: 2.316 ± 0.544
3.593IleGln: 3.593 ± 0.568
2.954IleArg: 2.954 ± 0.488
5.03IleSer: 5.03 ± 0.779
4.551IleThr: 4.551 ± 0.571
4.95IleVal: 4.95 ± 0.53
0.958IleTrp: 0.958 ± 0.458
3.114IleTyr: 3.114 ± 0.613
0.0IleXaa: 0.0 ± 0.0
Lys
5.19LysAla: 5.19 ± 0.701
0.16LysCys: 0.16 ± 0.114
6.547LysAsp: 6.547 ± 0.701
9.422LysGlu: 9.422 ± 1.24
3.753LysPhe: 3.753 ± 0.505
5.589LysGly: 5.589 ± 0.957
1.278LysHis: 1.278 ± 0.378
7.266LysIle: 7.266 ± 0.909
7.905LysLys: 7.905 ± 0.788
7.506LysLeu: 7.506 ± 0.667
2.395LysMet: 2.395 ± 0.445
5.909LysAsn: 5.909 ± 0.739
2.874LysPro: 2.874 ± 0.508
4.152LysGln: 4.152 ± 0.586
4.631LysArg: 4.631 ± 0.556
5.589LysSer: 5.589 ± 0.831
5.509LysThr: 5.509 ± 0.708
5.669LysVal: 5.669 ± 0.649
1.118LysTrp: 1.118 ± 0.355
4.072LysTyr: 4.072 ± 0.635
0.0LysXaa: 0.0 ± 0.0
Leu
3.673LeuAla: 3.673 ± 0.697
0.479LeuCys: 0.479 ± 0.207
5.829LeuAsp: 5.829 ± 0.65
7.745LeuGlu: 7.745 ± 0.883
3.513LeuPhe: 3.513 ± 0.466
3.354LeuGly: 3.354 ± 0.721
1.118LeuHis: 1.118 ± 0.262
5.669LeuIle: 5.669 ± 0.774
8.384LeuLys: 8.384 ± 0.826
6.707LeuLeu: 6.707 ± 0.804
2.236LeuMet: 2.236 ± 0.529
5.509LeuAsn: 5.509 ± 0.7
2.156LeuPro: 2.156 ± 0.439
3.114LeuGln: 3.114 ± 0.447
3.354LeuArg: 3.354 ± 0.517
5.43LeuSer: 5.43 ± 0.755
4.471LeuThr: 4.471 ± 0.602
3.673LeuVal: 3.673 ± 0.531
0.559LeuTrp: 0.559 ± 0.221
2.475LeuTyr: 2.475 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
0.958MetAla: 0.958 ± 0.329
0.16MetCys: 0.16 ± 0.113
1.597MetAsp: 1.597 ± 0.397
1.278MetGlu: 1.278 ± 0.32
0.719MetPhe: 0.719 ± 0.222
1.437MetGly: 1.437 ± 0.555
0.16MetHis: 0.16 ± 0.127
2.395MetIle: 2.395 ± 0.437
1.996MetLys: 1.996 ± 0.439
1.836MetLeu: 1.836 ± 0.317
1.118MetMet: 1.118 ± 0.302
1.278MetAsn: 1.278 ± 0.283
0.958MetPro: 0.958 ± 0.323
1.198MetGln: 1.198 ± 0.304
1.198MetArg: 1.198 ± 0.31
1.757MetSer: 1.757 ± 0.328
1.677MetThr: 1.677 ± 0.402
1.517MetVal: 1.517 ± 0.32
0.479MetTrp: 0.479 ± 0.161
0.559MetTyr: 0.559 ± 0.21
0.0MetXaa: 0.0 ± 0.0
Asn
4.471AsnAla: 4.471 ± 0.578
0.16AsnCys: 0.16 ± 0.118
4.631AsnAsp: 4.631 ± 0.667
5.27AsnGlu: 5.27 ± 1.051
1.836AsnPhe: 1.836 ± 0.546
5.11AsnGly: 5.11 ± 0.716
1.118AsnHis: 1.118 ± 0.282
4.711AsnIle: 4.711 ± 0.65
7.665AsnLys: 7.665 ± 1.069
4.471AsnLeu: 4.471 ± 0.608
1.357AsnMet: 1.357 ± 0.313
4.392AsnAsn: 4.392 ± 0.711
2.236AsnPro: 2.236 ± 0.331
2.874AsnGln: 2.874 ± 0.584
2.954AsnArg: 2.954 ± 0.484
3.274AsnSer: 3.274 ± 0.453
3.753AsnThr: 3.753 ± 0.439
3.673AsnVal: 3.673 ± 0.608
0.878AsnTrp: 0.878 ± 0.408
2.954AsnTyr: 2.954 ± 0.493
0.0AsnXaa: 0.0 ± 0.0
Pro
1.198ProAla: 1.198 ± 0.328
0.0ProCys: 0.0 ± 0.0
1.437ProAsp: 1.437 ± 0.285
2.395ProGlu: 2.395 ± 0.393
1.517ProPhe: 1.517 ± 0.365
1.038ProGly: 1.038 ± 0.291
0.639ProHis: 0.639 ± 0.222
2.795ProIle: 2.795 ± 0.453
2.635ProLys: 2.635 ± 0.568
1.677ProLeu: 1.677 ± 0.37
0.639ProMet: 0.639 ± 0.18
0.878ProAsn: 0.878 ± 0.265
0.798ProPro: 0.798 ± 0.202
0.719ProGln: 0.719 ± 0.242
1.357ProArg: 1.357 ± 0.285
1.517ProSer: 1.517 ± 0.338
1.118ProThr: 1.118 ± 0.293
1.677ProVal: 1.677 ± 0.306
0.16ProTrp: 0.16 ± 0.113
1.198ProTyr: 1.198 ± 0.258
0.0ProXaa: 0.0 ± 0.0
Gln
2.874GlnAla: 2.874 ± 0.424
0.479GlnCys: 0.479 ± 0.215
1.996GlnAsp: 1.996 ± 0.457
2.715GlnGlu: 2.715 ± 0.519
0.878GlnPhe: 0.878 ± 0.217
1.916GlnGly: 1.916 ± 0.399
0.479GlnHis: 0.479 ± 0.171
2.954GlnIle: 2.954 ± 0.471
3.433GlnLys: 3.433 ± 0.501
3.513GlnLeu: 3.513 ± 0.553
1.038GlnMet: 1.038 ± 0.271
2.795GlnAsn: 2.795 ± 0.48
0.719GlnPro: 0.719 ± 0.205
2.236GlnGln: 2.236 ± 0.613
1.996GlnArg: 1.996 ± 0.423
2.156GlnSer: 2.156 ± 0.415
1.677GlnThr: 1.677 ± 0.358
2.236GlnVal: 2.236 ± 0.47
0.399GlnTrp: 0.399 ± 0.166
1.597GlnTyr: 1.597 ± 0.344
0.0GlnXaa: 0.0 ± 0.0
Arg
2.795ArgAla: 2.795 ± 0.476
0.24ArgCys: 0.24 ± 0.121
2.555ArgAsp: 2.555 ± 0.46
3.593ArgGlu: 3.593 ± 0.597
2.076ArgPhe: 2.076 ± 0.381
1.677ArgGly: 1.677 ± 0.471
0.559ArgHis: 0.559 ± 0.215
3.513ArgIle: 3.513 ± 0.65
4.072ArgLys: 4.072 ± 0.741
4.392ArgLeu: 4.392 ± 0.56
1.198ArgMet: 1.198 ± 0.245
2.715ArgAsn: 2.715 ± 0.475
0.719ArgPro: 0.719 ± 0.189
1.278ArgGln: 1.278 ± 0.339
2.635ArgArg: 2.635 ± 0.413
1.677ArgSer: 1.677 ± 0.401
2.635ArgThr: 2.635 ± 0.542
2.555ArgVal: 2.555 ± 0.41
0.639ArgTrp: 0.639 ± 0.267
2.874ArgTyr: 2.874 ± 0.607
0.0ArgXaa: 0.0 ± 0.0
Ser
3.433SerAla: 3.433 ± 0.595
0.559SerCys: 0.559 ± 0.254
4.312SerAsp: 4.312 ± 0.811
5.829SerGlu: 5.829 ± 0.732
2.635SerPhe: 2.635 ± 0.541
2.715SerGly: 2.715 ± 0.824
1.757SerHis: 1.757 ± 0.348
5.27SerIle: 5.27 ± 0.92
5.19SerLys: 5.19 ± 0.731
3.593SerLeu: 3.593 ± 0.507
1.038SerMet: 1.038 ± 0.24
4.312SerAsn: 4.312 ± 0.587
1.038SerPro: 1.038 ± 0.299
1.916SerGln: 1.916 ± 0.37
1.677SerArg: 1.677 ± 0.376
3.194SerSer: 3.194 ± 0.588
3.433SerThr: 3.433 ± 0.588
2.715SerVal: 2.715 ± 0.536
0.16SerTrp: 0.16 ± 0.104
2.156SerTyr: 2.156 ± 0.455
0.0SerXaa: 0.0 ± 0.0
Thr
3.194ThrAla: 3.194 ± 0.537
0.24ThrCys: 0.24 ± 0.134
3.114ThrAsp: 3.114 ± 0.818
3.513ThrGlu: 3.513 ± 0.472
1.916ThrPhe: 1.916 ± 0.445
3.833ThrGly: 3.833 ± 0.822
1.198ThrHis: 1.198 ± 0.278
4.471ThrIle: 4.471 ± 0.486
5.19ThrLys: 5.19 ± 0.596
4.471ThrLeu: 4.471 ± 0.463
0.958ThrMet: 0.958 ± 0.237
3.194ThrAsn: 3.194 ± 0.555
1.836ThrPro: 1.836 ± 0.441
1.836ThrGln: 1.836 ± 0.399
2.555ThrArg: 2.555 ± 0.444
3.354ThrSer: 3.354 ± 0.66
3.194ThrThr: 3.194 ± 0.616
3.114ThrVal: 3.114 ± 0.444
0.958ThrTrp: 0.958 ± 0.299
2.475ThrTyr: 2.475 ± 0.539
0.0ThrXaa: 0.0 ± 0.0
Val
2.954ValAla: 2.954 ± 0.481
0.24ValCys: 0.24 ± 0.148
3.194ValAsp: 3.194 ± 0.538
3.274ValGlu: 3.274 ± 0.591
2.635ValPhe: 2.635 ± 0.546
3.274ValGly: 3.274 ± 0.592
0.878ValHis: 0.878 ± 0.267
4.551ValIle: 4.551 ± 0.539
5.749ValLys: 5.749 ± 0.662
4.392ValLeu: 4.392 ± 0.629
1.597ValMet: 1.597 ± 0.341
4.392ValAsn: 4.392 ± 0.469
1.118ValPro: 1.118 ± 0.252
2.076ValGln: 2.076 ± 0.512
2.076ValArg: 2.076 ± 0.421
2.954ValSer: 2.954 ± 0.424
3.833ValThr: 3.833 ± 0.635
3.753ValVal: 3.753 ± 0.693
0.719ValTrp: 0.719 ± 0.22
2.395ValTyr: 2.395 ± 0.349
0.0ValXaa: 0.0 ± 0.0
Trp
0.24TrpAla: 0.24 ± 0.186
0.08TrpCys: 0.08 ± 0.084
0.719TrpAsp: 0.719 ± 0.245
1.118TrpGlu: 1.118 ± 0.251
0.719TrpPhe: 0.719 ± 0.238
0.559TrpGly: 0.559 ± 0.216
0.08TrpHis: 0.08 ± 0.084
1.118TrpIle: 1.118 ± 0.266
1.038TrpLys: 1.038 ± 0.267
1.038TrpLeu: 1.038 ± 0.295
0.399TrpMet: 0.399 ± 0.152
0.719TrpAsn: 0.719 ± 0.27
0.08TrpPro: 0.08 ± 0.07
0.479TrpGln: 0.479 ± 0.181
0.479TrpArg: 0.479 ± 0.178
0.798TrpSer: 0.798 ± 0.272
0.319TrpThr: 0.319 ± 0.159
0.559TrpVal: 0.559 ± 0.178
0.08TrpTrp: 0.08 ± 0.069
0.559TrpTyr: 0.559 ± 0.209
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.236TyrAla: 2.236 ± 0.32
0.319TyrCys: 0.319 ± 0.165
2.715TyrAsp: 2.715 ± 0.476
3.833TyrGlu: 3.833 ± 0.678
2.395TyrPhe: 2.395 ± 0.542
2.555TyrGly: 2.555 ± 0.43
0.639TyrHis: 0.639 ± 0.222
3.912TyrIle: 3.912 ± 0.618
4.312TyrLys: 4.312 ± 0.633
3.593TyrLeu: 3.593 ± 0.582
0.958TyrMet: 0.958 ± 0.295
2.395TyrAsn: 2.395 ± 0.338
0.719TyrPro: 0.719 ± 0.264
2.076TyrGln: 2.076 ± 0.443
1.836TyrArg: 1.836 ± 0.446
2.475TyrSer: 2.475 ± 0.522
2.475TyrThr: 2.475 ± 0.4
2.475TyrVal: 2.475 ± 0.527
0.399TyrTrp: 0.399 ± 0.2
1.278TyrTyr: 1.278 ± 0.459
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 64 proteins (12525 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski