Amino acid dipepetide frequency for Staphylococcus phage SP197

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.225AlaAla: 2.225 ± 0.368
0.307AlaCys: 0.307 ± 0.136
3.99AlaAsp: 3.99 ± 0.431
5.141AlaGlu: 5.141 ± 0.752
2.992AlaPhe: 2.992 ± 0.476
2.762AlaGly: 2.762 ± 0.544
0.921AlaHis: 0.921 ± 0.22
5.831AlaIle: 5.831 ± 1.011
6.138AlaLys: 6.138 ± 1.059
5.755AlaLeu: 5.755 ± 0.704
1.304AlaMet: 1.304 ± 0.276
4.45AlaAsn: 4.45 ± 0.64
0.691AlaPro: 0.691 ± 0.207
2.379AlaGln: 2.379 ± 0.427
2.379AlaArg: 2.379 ± 0.436
2.839AlaSer: 2.839 ± 0.514
4.22AlaThr: 4.22 ± 0.613
3.76AlaVal: 3.76 ± 0.553
0.537AlaTrp: 0.537 ± 0.218
2.225AlaTyr: 2.225 ± 0.482
0.0AlaXaa: 0.0 ± 0.0
Cys
0.153CysAla: 0.153 ± 0.108
0.0CysCys: 0.0 ± 0.0
0.307CysAsp: 0.307 ± 0.151
0.23CysGlu: 0.23 ± 0.155
0.0CysPhe: 0.0 ± 0.0
0.23CysGly: 0.23 ± 0.146
0.077CysHis: 0.077 ± 0.078
0.384CysIle: 0.384 ± 0.159
0.46CysLys: 0.46 ± 0.177
0.307CysLeu: 0.307 ± 0.173
0.23CysMet: 0.23 ± 0.141
0.537CysAsn: 0.537 ± 0.215
0.0CysPro: 0.0 ± 0.0
0.23CysGln: 0.23 ± 0.117
0.537CysArg: 0.537 ± 0.198
0.23CysSer: 0.23 ± 0.144
0.307CysThr: 0.307 ± 0.15
0.077CysVal: 0.077 ± 0.077
0.153CysTrp: 0.153 ± 0.093
0.23CysTyr: 0.23 ± 0.134
0.0CysXaa: 0.0 ± 0.0
Asp
3.299AspAla: 3.299 ± 0.561
0.077AspCys: 0.077 ± 0.088
6.062AspAsp: 6.062 ± 0.918
6.445AspGlu: 6.445 ± 0.926
3.146AspPhe: 3.146 ± 0.566
3.683AspGly: 3.683 ± 0.578
0.997AspHis: 0.997 ± 0.277
4.911AspIle: 4.911 ± 0.528
5.678AspLys: 5.678 ± 0.764
5.141AspLeu: 5.141 ± 0.768
1.995AspMet: 1.995 ± 0.418
4.297AspAsn: 4.297 ± 0.623
1.228AspPro: 1.228 ± 0.271
1.074AspGln: 1.074 ± 0.239
1.918AspArg: 1.918 ± 0.439
4.374AspSer: 4.374 ± 0.633
3.069AspThr: 3.069 ± 0.436
4.987AspVal: 4.987 ± 0.524
0.537AspTrp: 0.537 ± 0.201
3.453AspTyr: 3.453 ± 0.547
0.0AspXaa: 0.0 ± 0.0
Glu
4.987GluAla: 4.987 ± 0.575
0.537GluCys: 0.537 ± 0.188
3.53GluAsp: 3.53 ± 0.676
6.752GluGlu: 6.752 ± 1.003
3.99GluPhe: 3.99 ± 0.504
2.916GluGly: 2.916 ± 0.49
0.844GluHis: 0.844 ± 0.279
5.141GluIle: 5.141 ± 0.715
6.215GluLys: 6.215 ± 1.058
6.522GluLeu: 6.522 ± 0.726
2.762GluMet: 2.762 ± 0.421
4.757GluAsn: 4.757 ± 0.676
1.381GluPro: 1.381 ± 0.388
3.223GluGln: 3.223 ± 0.404
4.834GluArg: 4.834 ± 0.773
3.99GluSer: 3.99 ± 0.66
2.839GluThr: 2.839 ± 0.563
6.292GluVal: 6.292 ± 1.085
0.844GluTrp: 0.844 ± 0.248
3.146GluTyr: 3.146 ± 0.479
0.0GluXaa: 0.0 ± 0.0
Phe
2.225PheAla: 2.225 ± 0.429
0.153PheCys: 0.153 ± 0.113
2.762PheAsp: 2.762 ± 0.552
3.453PheGlu: 3.453 ± 0.589
1.918PhePhe: 1.918 ± 0.377
2.762PheGly: 2.762 ± 0.447
0.46PheHis: 0.46 ± 0.199
3.223PheIle: 3.223 ± 0.525
4.45PheLys: 4.45 ± 0.463
2.685PheLeu: 2.685 ± 0.535
1.074PheMet: 1.074 ± 0.269
3.146PheAsn: 3.146 ± 0.483
0.844PhePro: 0.844 ± 0.207
0.844PheGln: 0.844 ± 0.257
1.304PheArg: 1.304 ± 0.286
2.532PheSer: 2.532 ± 0.443
2.379PheThr: 2.379 ± 0.461
3.99PheVal: 3.99 ± 0.685
0.23PheTrp: 0.23 ± 0.149
1.458PheTyr: 1.458 ± 0.406
0.0PheXaa: 0.0 ± 0.0
Gly
2.916GlyAla: 2.916 ± 0.709
0.46GlyCys: 0.46 ± 0.157
2.762GlyAsp: 2.762 ± 0.484
3.299GlyGlu: 3.299 ± 0.565
2.762GlyPhe: 2.762 ± 0.545
1.918GlyGly: 1.918 ± 0.386
0.767GlyHis: 0.767 ± 0.192
3.836GlyIle: 3.836 ± 0.52
4.987GlyLys: 4.987 ± 0.58
4.527GlyLeu: 4.527 ± 0.659
1.228GlyMet: 1.228 ± 0.312
2.609GlyAsn: 2.609 ± 0.533
0.307GlyPro: 0.307 ± 0.21
1.688GlyGln: 1.688 ± 0.374
2.072GlyArg: 2.072 ± 0.368
2.532GlySer: 2.532 ± 0.344
3.299GlyThr: 3.299 ± 0.459
4.757GlyVal: 4.757 ± 0.71
0.537GlyTrp: 0.537 ± 0.182
3.53GlyTyr: 3.53 ± 0.481
0.0GlyXaa: 0.0 ± 0.0
His
0.537HisAla: 0.537 ± 0.193
0.23HisCys: 0.23 ± 0.138
0.921HisAsp: 0.921 ± 0.27
0.844HisGlu: 0.844 ± 0.34
0.844HisPhe: 0.844 ± 0.331
1.304HisGly: 1.304 ± 0.373
0.153HisHis: 0.153 ± 0.114
1.074HisIle: 1.074 ± 0.373
1.228HisLys: 1.228 ± 0.345
1.918HisLeu: 1.918 ± 0.447
0.23HisMet: 0.23 ± 0.116
0.844HisAsn: 0.844 ± 0.26
0.46HisPro: 0.46 ± 0.198
0.997HisGln: 0.997 ± 0.276
0.307HisArg: 0.307 ± 0.148
0.844HisSer: 0.844 ± 0.248
0.921HisThr: 0.921 ± 0.316
0.691HisVal: 0.691 ± 0.225
0.23HisTrp: 0.23 ± 0.122
1.151HisTyr: 1.151 ± 0.336
0.0HisXaa: 0.0 ± 0.0
Ile
4.834IleAla: 4.834 ± 0.611
0.384IleCys: 0.384 ± 0.151
6.599IleAsp: 6.599 ± 0.603
5.524IleGlu: 5.524 ± 0.761
2.685IlePhe: 2.685 ± 0.422
3.836IleGly: 3.836 ± 0.759
0.921IleHis: 0.921 ± 0.3
4.374IleIle: 4.374 ± 0.525
8.517IleLys: 8.517 ± 0.848
4.527IleLeu: 4.527 ± 0.592
0.997IleMet: 0.997 ± 0.361
4.987IleAsn: 4.987 ± 0.477
2.992IlePro: 2.992 ± 0.426
2.302IleGln: 2.302 ± 0.457
2.379IleArg: 2.379 ± 0.405
4.757IleSer: 4.757 ± 0.55
4.757IleThr: 4.757 ± 0.65
4.834IleVal: 4.834 ± 0.444
0.997IleTrp: 0.997 ± 0.272
3.453IleTyr: 3.453 ± 0.576
0.0IleXaa: 0.0 ± 0.0
Lys
6.982LysAla: 6.982 ± 0.562
0.077LysCys: 0.077 ± 0.068
6.752LysAsp: 6.752 ± 0.836
7.059LysGlu: 7.059 ± 1.347
3.299LysPhe: 3.299 ± 0.444
4.757LysGly: 4.757 ± 0.683
1.611LysHis: 1.611 ± 0.357
7.366LysIle: 7.366 ± 0.592
8.287LysLys: 8.287 ± 1.208
7.366LysLeu: 7.366 ± 0.801
2.839LysMet: 2.839 ± 0.491
4.834LysAsn: 4.834 ± 0.539
2.992LysPro: 2.992 ± 0.849
5.141LysGln: 5.141 ± 0.562
3.836LysArg: 3.836 ± 0.654
5.371LysSer: 5.371 ± 0.797
5.755LysThr: 5.755 ± 0.828
5.601LysVal: 5.601 ± 0.593
0.844LysTrp: 0.844 ± 0.242
4.143LysTyr: 4.143 ± 0.754
0.0LysXaa: 0.0 ± 0.0
Leu
4.911LeuAla: 4.911 ± 0.489
0.23LeuCys: 0.23 ± 0.141
5.908LeuAsp: 5.908 ± 0.72
6.522LeuGlu: 6.522 ± 0.707
3.146LeuPhe: 3.146 ± 0.575
3.99LeuGly: 3.99 ± 0.617
1.228LeuHis: 1.228 ± 0.246
6.062LeuIle: 6.062 ± 0.624
7.366LeuLys: 7.366 ± 0.685
6.675LeuLeu: 6.675 ± 0.78
2.302LeuMet: 2.302 ± 0.444
6.215LeuAsn: 6.215 ± 0.801
2.225LeuPro: 2.225 ± 0.466
2.685LeuGln: 2.685 ± 0.418
3.453LeuArg: 3.453 ± 0.597
5.141LeuSer: 5.141 ± 0.806
6.292LeuThr: 6.292 ± 0.719
3.913LeuVal: 3.913 ± 0.544
1.151LeuTrp: 1.151 ± 0.333
2.379LeuTyr: 2.379 ± 0.476
0.0LeuXaa: 0.0 ± 0.0
Met
1.995MetAla: 1.995 ± 0.367
0.077MetCys: 0.077 ± 0.078
1.458MetAsp: 1.458 ± 0.365
1.841MetGlu: 1.841 ± 0.395
0.844MetPhe: 0.844 ± 0.258
0.921MetGly: 0.921 ± 0.242
0.384MetHis: 0.384 ± 0.162
1.841MetIle: 1.841 ± 0.371
2.302MetLys: 2.302 ± 0.426
2.148MetLeu: 2.148 ± 0.349
0.614MetMet: 0.614 ± 0.2
0.997MetAsn: 0.997 ± 0.296
0.691MetPro: 0.691 ± 0.277
0.844MetGln: 0.844 ± 0.259
0.844MetArg: 0.844 ± 0.283
1.688MetSer: 1.688 ± 0.366
1.688MetThr: 1.688 ± 0.442
1.151MetVal: 1.151 ± 0.294
0.307MetTrp: 0.307 ± 0.123
0.997MetTyr: 0.997 ± 0.28
0.0MetXaa: 0.0 ± 0.0
Asn
4.68AsnAla: 4.68 ± 0.712
0.46AsnCys: 0.46 ± 0.178
4.527AsnAsp: 4.527 ± 0.8
4.527AsnGlu: 4.527 ± 0.643
2.148AsnPhe: 2.148 ± 0.333
4.987AsnGly: 4.987 ± 0.67
0.767AsnHis: 0.767 ± 0.191
2.839AsnIle: 2.839 ± 0.434
6.675AsnLys: 6.675 ± 0.758
5.831AsnLeu: 5.831 ± 0.698
1.458AsnMet: 1.458 ± 0.285
4.067AsnAsn: 4.067 ± 0.688
2.072AsnPro: 2.072 ± 0.413
1.688AsnGln: 1.688 ± 0.341
2.455AsnArg: 2.455 ± 0.409
4.22AsnSer: 4.22 ± 0.583
4.067AsnThr: 4.067 ± 0.557
3.913AsnVal: 3.913 ± 0.545
1.074AsnTrp: 1.074 ± 0.334
2.455AsnTyr: 2.455 ± 0.468
0.0AsnXaa: 0.0 ± 0.0
Pro
1.381ProAla: 1.381 ± 0.348
0.307ProCys: 0.307 ± 0.151
1.458ProAsp: 1.458 ± 0.365
1.381ProGlu: 1.381 ± 0.353
1.074ProPhe: 1.074 ± 0.307
0.614ProGly: 0.614 ± 0.247
0.537ProHis: 0.537 ± 0.19
1.841ProIle: 1.841 ± 0.524
3.376ProLys: 3.376 ± 0.709
1.458ProLeu: 1.458 ± 0.292
0.844ProMet: 0.844 ± 0.232
2.072ProAsn: 2.072 ± 0.329
0.691ProPro: 0.691 ± 0.217
0.997ProGln: 0.997 ± 0.28
0.691ProArg: 0.691 ± 0.234
1.918ProSer: 1.918 ± 0.454
1.304ProThr: 1.304 ± 0.304
1.918ProVal: 1.918 ± 0.421
0.0ProTrp: 0.0 ± 0.0
0.997ProTyr: 0.997 ± 0.295
0.0ProXaa: 0.0 ± 0.0
Gln
2.762GlnAla: 2.762 ± 0.455
0.23GlnCys: 0.23 ± 0.121
1.765GlnAsp: 1.765 ± 0.393
1.535GlnGlu: 1.535 ± 0.383
1.688GlnPhe: 1.688 ± 0.353
1.304GlnGly: 1.304 ± 0.321
0.614GlnHis: 0.614 ± 0.218
2.455GlnIle: 2.455 ± 0.428
3.836GlnLys: 3.836 ± 0.476
3.299GlnLeu: 3.299 ± 0.567
0.23GlnMet: 0.23 ± 0.114
3.146GlnAsn: 3.146 ± 0.424
0.307GlnPro: 0.307 ± 0.171
1.611GlnGln: 1.611 ± 0.445
1.995GlnArg: 1.995 ± 0.343
2.225GlnSer: 2.225 ± 0.4
2.992GlnThr: 2.992 ± 0.497
2.379GlnVal: 2.379 ± 0.46
0.077GlnTrp: 0.077 ± 0.088
1.381GlnTyr: 1.381 ± 0.295
0.0GlnXaa: 0.0 ± 0.0
Arg
2.225ArgAla: 2.225 ± 0.425
0.23ArgCys: 0.23 ± 0.143
2.532ArgAsp: 2.532 ± 0.462
2.302ArgGlu: 2.302 ± 0.391
1.765ArgPhe: 1.765 ± 0.336
1.765ArgGly: 1.765 ± 0.303
1.228ArgHis: 1.228 ± 0.342
3.76ArgIle: 3.76 ± 0.536
3.146ArgLys: 3.146 ± 0.561
4.604ArgLeu: 4.604 ± 0.589
0.921ArgMet: 0.921 ± 0.215
1.995ArgAsn: 1.995 ± 0.336
0.921ArgPro: 0.921 ± 0.309
1.611ArgGln: 1.611 ± 0.356
1.841ArgArg: 1.841 ± 0.359
2.148ArgSer: 2.148 ± 0.438
2.225ArgThr: 2.225 ± 0.496
2.302ArgVal: 2.302 ± 0.39
0.46ArgTrp: 0.46 ± 0.184
2.609ArgTyr: 2.609 ± 0.379
0.0ArgXaa: 0.0 ± 0.0
Ser
3.683SerAla: 3.683 ± 0.545
0.307SerCys: 0.307 ± 0.165
3.069SerAsp: 3.069 ± 0.493
4.604SerGlu: 4.604 ± 0.561
1.918SerPhe: 1.918 ± 0.526
3.683SerGly: 3.683 ± 0.546
0.997SerHis: 0.997 ± 0.293
5.831SerIle: 5.831 ± 0.916
5.601SerLys: 5.601 ± 0.705
5.678SerLeu: 5.678 ± 0.692
1.458SerMet: 1.458 ± 0.287
4.143SerAsn: 4.143 ± 0.553
1.688SerPro: 1.688 ± 0.343
1.841SerGln: 1.841 ± 0.38
2.992SerArg: 2.992 ± 0.504
3.299SerSer: 3.299 ± 0.632
3.606SerThr: 3.606 ± 0.606
3.453SerVal: 3.453 ± 0.516
0.46SerTrp: 0.46 ± 0.176
2.302SerTyr: 2.302 ± 0.445
0.0SerXaa: 0.0 ± 0.0
Thr
3.53ThrAla: 3.53 ± 0.487
0.153ThrCys: 0.153 ± 0.113
3.146ThrAsp: 3.146 ± 0.53
3.683ThrGlu: 3.683 ± 0.647
2.379ThrPhe: 2.379 ± 0.465
3.376ThrGly: 3.376 ± 0.543
1.535ThrHis: 1.535 ± 0.263
5.448ThrIle: 5.448 ± 0.673
5.831ThrLys: 5.831 ± 0.677
5.218ThrLeu: 5.218 ± 0.594
0.921ThrMet: 0.921 ± 0.268
3.069ThrAsn: 3.069 ± 0.585
2.072ThrPro: 2.072 ± 0.398
2.148ThrGln: 2.148 ± 0.376
2.685ThrArg: 2.685 ± 0.483
3.453ThrSer: 3.453 ± 0.495
4.45ThrThr: 4.45 ± 0.66
4.527ThrVal: 4.527 ± 0.729
0.844ThrTrp: 0.844 ± 0.277
2.762ThrTyr: 2.762 ± 0.485
0.0ThrXaa: 0.0 ± 0.0
Val
4.297ValAla: 4.297 ± 0.601
0.153ValCys: 0.153 ± 0.125
5.064ValAsp: 5.064 ± 0.543
6.368ValGlu: 6.368 ± 1.005
2.532ValPhe: 2.532 ± 0.539
3.223ValGly: 3.223 ± 0.549
0.844ValHis: 0.844 ± 0.195
4.987ValIle: 4.987 ± 0.51
4.834ValLys: 4.834 ± 0.515
4.987ValLeu: 4.987 ± 0.538
1.151ValMet: 1.151 ± 0.342
4.68ValAsn: 4.68 ± 0.494
2.148ValPro: 2.148 ± 0.399
2.148ValGln: 2.148 ± 0.308
1.765ValArg: 1.765 ± 0.312
5.448ValSer: 5.448 ± 0.528
3.99ValThr: 3.99 ± 0.622
4.604ValVal: 4.604 ± 0.704
0.767ValTrp: 0.767 ± 0.199
2.532ValTyr: 2.532 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
0.844TrpAla: 0.844 ± 0.276
0.23TrpCys: 0.23 ± 0.129
0.614TrpAsp: 0.614 ± 0.238
0.537TrpGlu: 0.537 ± 0.184
0.614TrpPhe: 0.614 ± 0.237
0.384TrpGly: 0.384 ± 0.183
0.077TrpHis: 0.077 ± 0.07
0.691TrpIle: 0.691 ± 0.217
0.844TrpLys: 0.844 ± 0.299
0.614TrpLeu: 0.614 ± 0.166
0.077TrpMet: 0.077 ± 0.076
1.228TrpAsn: 1.228 ± 0.549
0.077TrpPro: 0.077 ± 0.072
0.384TrpGln: 0.384 ± 0.228
0.384TrpArg: 0.384 ± 0.142
1.228TrpSer: 1.228 ± 0.32
0.691TrpThr: 0.691 ± 0.229
0.844TrpVal: 0.844 ± 0.231
0.0TrpTrp: 0.0 ± 0.0
0.537TrpTyr: 0.537 ± 0.204
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.379TyrAla: 2.379 ± 0.426
0.077TyrCys: 0.077 ± 0.086
3.146TyrAsp: 3.146 ± 0.54
3.453TyrGlu: 3.453 ± 0.579
2.148TyrPhe: 2.148 ± 0.486
2.379TyrGly: 2.379 ± 0.455
0.767TyrHis: 0.767 ± 0.343
2.762TyrIle: 2.762 ± 0.548
5.218TyrLys: 5.218 ± 0.7
2.532TyrLeu: 2.532 ± 0.412
0.921TyrMet: 0.921 ± 0.297
2.839TyrAsn: 2.839 ± 0.438
1.074TyrPro: 1.074 ± 0.283
1.995TyrGln: 1.995 ± 0.388
1.918TyrArg: 1.918 ± 0.345
2.379TyrSer: 2.379 ± 0.426
2.379TyrThr: 2.379 ± 0.43
2.532TyrVal: 2.532 ± 0.365
0.844TyrTrp: 0.844 ± 0.28
1.995TyrTyr: 1.995 ± 0.434
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 70 proteins (13034 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski