Amino acid dipepetide frequency for Pseudomonas phage KPP23

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.693AlaAla: 11.693 ± 1.302
1.382AlaCys: 1.382 ± 0.313
5.687AlaAsp: 5.687 ± 0.589
9.886AlaGlu: 9.886 ± 0.889
5.102AlaPhe: 5.102 ± 0.572
7.122AlaGly: 7.122 ± 0.818
1.594AlaHis: 1.594 ± 0.318
4.943AlaIle: 4.943 ± 0.551
6.59AlaLys: 6.59 ± 0.803
8.929AlaLeu: 8.929 ± 0.736
2.498AlaMet: 2.498 ± 0.342
3.508AlaAsn: 3.508 ± 0.495
4.996AlaPro: 4.996 ± 0.514
4.358AlaGln: 4.358 ± 0.523
7.388AlaArg: 7.388 ± 0.703
4.411AlaSer: 4.411 ± 0.475
5.315AlaThr: 5.315 ± 0.622
6.165AlaVal: 6.165 ± 0.51
1.169AlaTrp: 1.169 ± 0.253
2.87AlaTyr: 2.87 ± 0.352
0.0AlaXaa: 0.0 ± 0.0
Cys
0.85CysAla: 0.85 ± 0.243
0.478CysCys: 0.478 ± 0.19
0.691CysAsp: 0.691 ± 0.187
1.222CysGlu: 1.222 ± 0.314
0.319CysPhe: 0.319 ± 0.105
0.744CysGly: 0.744 ± 0.226
0.319CysHis: 0.319 ± 0.139
0.159CysIle: 0.159 ± 0.097
0.585CysLys: 0.585 ± 0.187
0.638CysLeu: 0.638 ± 0.198
0.159CysMet: 0.159 ± 0.078
0.372CysAsn: 0.372 ± 0.127
0.691CysPro: 0.691 ± 0.184
0.372CysGln: 0.372 ± 0.125
1.116CysArg: 1.116 ± 0.269
1.116CysSer: 1.116 ± 0.315
0.372CysThr: 0.372 ± 0.13
0.585CysVal: 0.585 ± 0.191
0.159CysTrp: 0.159 ± 0.085
0.159CysTyr: 0.159 ± 0.103
0.0CysXaa: 0.0 ± 0.0
Asp
6.697AspAla: 6.697 ± 0.477
0.691AspCys: 0.691 ± 0.214
4.092AspAsp: 4.092 ± 1.054
4.624AspGlu: 4.624 ± 0.575
2.445AspPhe: 2.445 ± 0.387
4.837AspGly: 4.837 ± 0.448
0.691AspHis: 0.691 ± 0.189
2.817AspIle: 2.817 ± 0.338
2.285AspLys: 2.285 ± 0.413
5.315AspLeu: 5.315 ± 0.535
0.904AspMet: 0.904 ± 0.198
1.913AspAsn: 1.913 ± 0.308
4.146AspPro: 4.146 ± 0.486
1.541AspGln: 1.541 ± 0.273
4.146AspArg: 4.146 ± 0.538
3.295AspSer: 3.295 ± 0.464
2.657AspThr: 2.657 ± 0.372
2.392AspVal: 2.392 ± 0.401
0.531AspTrp: 0.531 ± 0.18
2.126AspTyr: 2.126 ± 0.323
0.0AspXaa: 0.0 ± 0.0
Glu
8.185GluAla: 8.185 ± 0.935
0.478GluCys: 0.478 ± 0.186
3.986GluAsp: 3.986 ± 0.426
5.421GluGlu: 5.421 ± 0.713
2.657GluPhe: 2.657 ± 0.341
4.677GluGly: 4.677 ± 0.534
1.063GluHis: 1.063 ± 0.251
4.358GluIle: 4.358 ± 0.547
4.89GluLys: 4.89 ± 0.634
5.953GluLeu: 5.953 ± 0.523
1.967GluMet: 1.967 ± 0.301
2.657GluAsn: 2.657 ± 0.365
2.551GluPro: 2.551 ± 0.396
3.083GluGln: 3.083 ± 0.426
5.953GluArg: 5.953 ± 0.614
3.72GluSer: 3.72 ± 0.446
3.402GluThr: 3.402 ± 0.398
4.624GluVal: 4.624 ± 0.465
1.116GluTrp: 1.116 ± 0.29
2.339GluTyr: 2.339 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
4.571PheAla: 4.571 ± 0.636
0.744PheCys: 0.744 ± 0.211
2.498PheAsp: 2.498 ± 0.27
2.232PheGlu: 2.232 ± 0.36
1.435PhePhe: 1.435 ± 0.259
3.561PheGly: 3.561 ± 0.493
0.691PheHis: 0.691 ± 0.27
2.392PheIle: 2.392 ± 0.319
1.807PheLys: 1.807 ± 0.358
2.657PheLeu: 2.657 ± 0.328
0.744PheMet: 0.744 ± 0.248
2.179PheAsn: 2.179 ± 0.31
2.126PhePro: 2.126 ± 0.296
1.488PheGln: 1.488 ± 0.305
2.657PheArg: 2.657 ± 0.343
3.029PheSer: 3.029 ± 0.444
2.498PheThr: 2.498 ± 0.417
2.711PheVal: 2.711 ± 0.44
0.531PheTrp: 0.531 ± 0.168
1.648PheTyr: 1.648 ± 0.34
0.0PheXaa: 0.0 ± 0.0
Gly
7.016GlyAla: 7.016 ± 0.641
0.797GlyCys: 0.797 ± 0.264
3.933GlyAsp: 3.933 ± 0.447
4.837GlyGlu: 4.837 ± 0.525
3.614GlyPhe: 3.614 ± 0.519
6.856GlyGly: 6.856 ± 0.817
1.382GlyHis: 1.382 ± 0.252
3.029GlyIle: 3.029 ± 0.412
4.624GlyLys: 4.624 ± 0.611
5.155GlyLeu: 5.155 ± 0.58
1.648GlyMet: 1.648 ± 0.315
2.285GlyAsn: 2.285 ± 0.398
2.392GlyPro: 2.392 ± 0.401
3.083GlyGln: 3.083 ± 0.472
5.846GlyArg: 5.846 ± 0.543
5.262GlySer: 5.262 ± 0.5
4.252GlyThr: 4.252 ± 0.649
5.474GlyVal: 5.474 ± 0.608
1.01GlyTrp: 1.01 ± 0.221
2.711GlyTyr: 2.711 ± 0.331
0.0GlyXaa: 0.0 ± 0.0
His
1.594HisAla: 1.594 ± 0.249
0.266HisCys: 0.266 ± 0.134
1.01HisAsp: 1.01 ± 0.239
0.585HisGlu: 0.585 ± 0.158
1.01HisPhe: 1.01 ± 0.215
1.01HisGly: 1.01 ± 0.259
0.213HisHis: 0.213 ± 0.117
0.797HisIle: 0.797 ± 0.23
1.116HisLys: 1.116 ± 0.257
0.957HisLeu: 0.957 ± 0.234
0.319HisMet: 0.319 ± 0.136
0.531HisAsn: 0.531 ± 0.186
0.957HisPro: 0.957 ± 0.237
0.266HisGln: 0.266 ± 0.109
0.957HisArg: 0.957 ± 0.291
0.744HisSer: 0.744 ± 0.178
0.85HisThr: 0.85 ± 0.219
0.691HisVal: 0.691 ± 0.194
0.0HisTrp: 0.0 ± 0.0
0.372HisTyr: 0.372 ± 0.148
0.0HisXaa: 0.0 ± 0.0
Ile
4.783IleAla: 4.783 ± 0.636
0.638IleCys: 0.638 ± 0.197
3.295IleAsp: 3.295 ± 0.365
3.561IleGlu: 3.561 ± 0.498
1.807IlePhe: 1.807 ± 0.298
3.348IleGly: 3.348 ± 0.506
0.744IleHis: 0.744 ± 0.199
2.232IleIle: 2.232 ± 0.327
2.339IleLys: 2.339 ± 0.387
3.614IleLeu: 3.614 ± 0.456
1.01IleMet: 1.01 ± 0.194
2.02IleAsn: 2.02 ± 0.347
2.764IlePro: 2.764 ± 0.473
2.551IleGln: 2.551 ± 0.382
3.508IleArg: 3.508 ± 0.585
2.551IleSer: 2.551 ± 0.322
3.136IleThr: 3.136 ± 0.376
2.498IleVal: 2.498 ± 0.407
0.904IleTrp: 0.904 ± 0.233
1.276IleTyr: 1.276 ± 0.244
0.0IleXaa: 0.0 ± 0.0
Lys
7.228LysAla: 7.228 ± 1.008
0.425LysCys: 0.425 ± 0.144
2.551LysAsp: 2.551 ± 0.378
3.189LysGlu: 3.189 ± 0.485
2.392LysPhe: 2.392 ± 0.348
3.88LysGly: 3.88 ± 0.645
0.85LysHis: 0.85 ± 0.204
2.445LysIle: 2.445 ± 0.413
3.295LysLys: 3.295 ± 0.499
4.039LysLeu: 4.039 ± 0.592
1.116LysMet: 1.116 ± 0.247
1.967LysAsn: 1.967 ± 0.343
2.657LysPro: 2.657 ± 0.423
1.86LysGln: 1.86 ± 0.329
3.508LysArg: 3.508 ± 0.548
3.455LysSer: 3.455 ± 0.422
3.402LysThr: 3.402 ± 0.407
3.029LysVal: 3.029 ± 0.384
0.531LysTrp: 0.531 ± 0.175
1.435LysTyr: 1.435 ± 0.344
0.0LysXaa: 0.0 ± 0.0
Leu
9.088LeuAla: 9.088 ± 0.671
0.957LeuCys: 0.957 ± 0.257
6.006LeuAsp: 6.006 ± 0.573
5.9LeuGlu: 5.9 ± 0.522
3.667LeuPhe: 3.667 ± 0.441
5.953LeuGly: 5.953 ± 0.668
1.01LeuHis: 1.01 ± 0.238
3.402LeuIle: 3.402 ± 0.306
3.029LeuLys: 3.029 ± 0.358
6.484LeuLeu: 6.484 ± 0.629
1.86LeuMet: 1.86 ± 0.347
3.242LeuAsn: 3.242 ± 0.512
3.614LeuPro: 3.614 ± 0.436
3.242LeuGln: 3.242 ± 0.418
6.803LeuArg: 6.803 ± 0.669
5.581LeuSer: 5.581 ± 0.484
4.943LeuThr: 4.943 ± 0.55
4.465LeuVal: 4.465 ± 0.45
0.797LeuTrp: 0.797 ± 0.231
1.86LeuTyr: 1.86 ± 0.31
0.0LeuXaa: 0.0 ± 0.0
Met
2.285MetAla: 2.285 ± 0.289
0.106MetCys: 0.106 ± 0.072
1.382MetAsp: 1.382 ± 0.242
1.01MetGlu: 1.01 ± 0.212
0.691MetPhe: 0.691 ± 0.174
1.86MetGly: 1.86 ± 0.309
0.372MetHis: 0.372 ± 0.147
0.797MetIle: 0.797 ± 0.19
1.329MetLys: 1.329 ± 0.283
1.967MetLeu: 1.967 ± 0.379
0.319MetMet: 0.319 ± 0.136
1.541MetAsn: 1.541 ± 0.286
1.063MetPro: 1.063 ± 0.255
1.116MetGln: 1.116 ± 0.224
1.01MetArg: 1.01 ± 0.225
1.382MetSer: 1.382 ± 0.258
1.488MetThr: 1.488 ± 0.255
0.85MetVal: 0.85 ± 0.204
0.106MetTrp: 0.106 ± 0.075
0.425MetTyr: 0.425 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
3.986AsnAla: 3.986 ± 0.577
0.319AsnCys: 0.319 ± 0.125
0.904AsnAsp: 0.904 ± 0.231
2.126AsnGlu: 2.126 ± 0.264
1.276AsnPhe: 1.276 ± 0.264
4.465AsnGly: 4.465 ± 0.549
0.585AsnHis: 0.585 ± 0.232
1.382AsnIle: 1.382 ± 0.303
1.701AsnLys: 1.701 ± 0.274
2.657AsnLeu: 2.657 ± 0.395
1.116AsnMet: 1.116 ± 0.296
1.807AsnAsn: 1.807 ± 0.339
2.604AsnPro: 2.604 ± 0.301
2.073AsnGln: 2.073 ± 0.475
2.498AsnArg: 2.498 ± 0.382
2.764AsnSer: 2.764 ± 0.3
2.126AsnThr: 2.126 ± 0.347
2.498AsnVal: 2.498 ± 0.323
0.691AsnTrp: 0.691 ± 0.159
1.063AsnTyr: 1.063 ± 0.267
0.0AsnXaa: 0.0 ± 0.0
Pro
4.518ProAla: 4.518 ± 0.476
0.425ProCys: 0.425 ± 0.149
3.189ProAsp: 3.189 ± 0.43
4.89ProGlu: 4.89 ± 0.616
1.967ProPhe: 1.967 ± 0.359
4.039ProGly: 4.039 ± 0.431
0.213ProHis: 0.213 ± 0.087
3.136ProIle: 3.136 ± 0.497
3.029ProLys: 3.029 ± 0.553
3.189ProLeu: 3.189 ± 0.373
0.585ProMet: 0.585 ± 0.166
1.594ProAsn: 1.594 ± 0.267
1.648ProPro: 1.648 ± 0.333
1.276ProGln: 1.276 ± 0.318
2.657ProArg: 2.657 ± 0.43
2.817ProSer: 2.817 ± 0.311
2.232ProThr: 2.232 ± 0.284
3.986ProVal: 3.986 ± 0.446
0.531ProTrp: 0.531 ± 0.148
1.594ProTyr: 1.594 ± 0.313
0.0ProXaa: 0.0 ± 0.0
Gln
4.305GlnAla: 4.305 ± 0.557
0.425GlnCys: 0.425 ± 0.128
1.754GlnAsp: 1.754 ± 0.365
2.445GlnGlu: 2.445 ± 0.374
1.488GlnPhe: 1.488 ± 0.222
2.604GlnGly: 2.604 ± 0.504
0.478GlnHis: 0.478 ± 0.169
2.285GlnIle: 2.285 ± 0.347
2.073GlnLys: 2.073 ± 0.381
3.508GlnLeu: 3.508 ± 0.456
1.169GlnMet: 1.169 ± 0.265
1.967GlnAsn: 1.967 ± 0.384
1.488GlnPro: 1.488 ± 0.493
1.648GlnGln: 1.648 ± 0.507
2.179GlnArg: 2.179 ± 0.287
2.339GlnSer: 2.339 ± 0.341
2.87GlnThr: 2.87 ± 0.391
2.232GlnVal: 2.232 ± 0.355
0.213GlnTrp: 0.213 ± 0.096
0.904GlnTyr: 0.904 ± 0.213
0.0GlnXaa: 0.0 ± 0.0
Arg
6.697ArgAla: 6.697 ± 0.723
0.531ArgCys: 0.531 ± 0.184
4.146ArgAsp: 4.146 ± 0.508
6.165ArgGlu: 6.165 ± 0.683
3.348ArgPhe: 3.348 ± 0.394
4.305ArgGly: 4.305 ± 0.686
0.638ArgHis: 0.638 ± 0.177
4.411ArgIle: 4.411 ± 0.417
4.358ArgLys: 4.358 ± 0.634
6.537ArgLeu: 6.537 ± 0.633
1.86ArgMet: 1.86 ± 0.349
2.976ArgAsn: 2.976 ± 0.415
2.551ArgPro: 2.551 ± 0.386
2.604ArgGln: 2.604 ± 0.403
5.262ArgArg: 5.262 ± 1.091
3.72ArgSer: 3.72 ± 0.505
3.508ArgThr: 3.508 ± 0.486
4.89ArgVal: 4.89 ± 0.536
0.904ArgTrp: 0.904 ± 0.204
2.126ArgTyr: 2.126 ± 0.359
0.0ArgXaa: 0.0 ± 0.0
Ser
6.909SerAla: 6.909 ± 0.622
0.744SerCys: 0.744 ± 0.182
3.402SerAsp: 3.402 ± 0.446
3.667SerGlu: 3.667 ± 0.521
1.967SerPhe: 1.967 ± 0.319
5.953SerGly: 5.953 ± 0.691
0.744SerHis: 0.744 ± 0.202
2.339SerIle: 2.339 ± 0.353
3.029SerLys: 3.029 ± 0.374
4.783SerLeu: 4.783 ± 0.454
0.478SerMet: 0.478 ± 0.19
2.339SerAsn: 2.339 ± 0.366
3.455SerPro: 3.455 ± 0.419
2.498SerGln: 2.498 ± 0.335
4.465SerArg: 4.465 ± 0.595
3.455SerSer: 3.455 ± 0.502
2.817SerThr: 2.817 ± 0.426
4.092SerVal: 4.092 ± 0.392
1.169SerTrp: 1.169 ± 0.283
1.329SerTyr: 1.329 ± 0.249
0.0SerXaa: 0.0 ± 0.0
Thr
5.421ThrAla: 5.421 ± 0.637
0.266ThrCys: 0.266 ± 0.111
3.614ThrAsp: 3.614 ± 0.481
4.89ThrGlu: 4.89 ± 0.433
1.967ThrPhe: 1.967 ± 0.332
4.305ThrGly: 4.305 ± 0.517
0.585ThrHis: 0.585 ± 0.177
2.657ThrIle: 2.657 ± 0.394
2.711ThrLys: 2.711 ± 0.362
6.431ThrLeu: 6.431 ± 0.535
0.85ThrMet: 0.85 ± 0.217
1.701ThrAsn: 1.701 ± 0.293
2.817ThrPro: 2.817 ± 0.42
1.594ThrGln: 1.594 ± 0.281
3.614ThrArg: 3.614 ± 0.518
3.029ThrSer: 3.029 ± 0.4
2.923ThrThr: 2.923 ± 0.495
3.986ThrVal: 3.986 ± 0.539
0.585ThrTrp: 0.585 ± 0.149
1.754ThrTyr: 1.754 ± 0.312
0.0ThrXaa: 0.0 ± 0.0
Val
5.315ValAla: 5.315 ± 0.695
0.585ValCys: 0.585 ± 0.172
3.402ValAsp: 3.402 ± 0.384
3.614ValGlu: 3.614 ± 0.467
2.551ValPhe: 2.551 ± 0.334
3.189ValGly: 3.189 ± 0.352
1.116ValHis: 1.116 ± 0.311
3.508ValIle: 3.508 ± 0.479
2.976ValLys: 2.976 ± 0.431
5.368ValLeu: 5.368 ± 0.474
1.594ValMet: 1.594 ± 0.268
2.445ValAsn: 2.445 ± 0.388
2.976ValPro: 2.976 ± 0.412
2.339ValGln: 2.339 ± 0.395
4.252ValArg: 4.252 ± 0.551
3.986ValSer: 3.986 ± 0.453
3.88ValThr: 3.88 ± 0.517
4.146ValVal: 4.146 ± 0.511
1.276ValTrp: 1.276 ± 0.251
2.498ValTyr: 2.498 ± 0.416
0.0ValXaa: 0.0 ± 0.0
Trp
1.648TrpAla: 1.648 ± 0.354
0.213TrpCys: 0.213 ± 0.101
1.01TrpAsp: 1.01 ± 0.229
0.638TrpGlu: 0.638 ± 0.182
0.691TrpPhe: 0.691 ± 0.165
0.638TrpGly: 0.638 ± 0.181
0.266TrpHis: 0.266 ± 0.095
0.638TrpIle: 0.638 ± 0.167
0.585TrpLys: 0.585 ± 0.159
1.01TrpLeu: 1.01 ± 0.226
0.159TrpMet: 0.159 ± 0.083
0.531TrpAsn: 0.531 ± 0.159
0.478TrpPro: 0.478 ± 0.155
0.372TrpGln: 0.372 ± 0.12
0.904TrpArg: 0.904 ± 0.26
0.691TrpSer: 0.691 ± 0.213
1.116TrpThr: 1.116 ± 0.257
0.638TrpVal: 0.638 ± 0.203
0.159TrpTrp: 0.159 ± 0.093
0.478TrpTyr: 0.478 ± 0.194
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.817TyrAla: 2.817 ± 0.379
0.638TyrCys: 0.638 ± 0.192
1.594TyrAsp: 1.594 ± 0.315
2.179TyrGlu: 2.179 ± 0.338
1.807TyrPhe: 1.807 ± 0.365
1.86TyrGly: 1.86 ± 0.359
0.691TyrHis: 0.691 ± 0.183
0.85TyrIle: 0.85 ± 0.25
0.904TyrLys: 0.904 ± 0.221
2.711TyrLeu: 2.711 ± 0.42
0.691TyrMet: 0.691 ± 0.158
1.116TyrAsn: 1.116 ± 0.237
1.541TyrPro: 1.541 ± 0.298
1.01TyrGln: 1.01 ± 0.247
2.817TyrArg: 2.817 ± 0.367
2.285TyrSer: 2.285 ± 0.373
1.913TyrThr: 1.913 ± 0.344
1.063TyrVal: 1.063 ± 0.256
0.478TyrTrp: 0.478 ± 0.166
1.329TyrTyr: 1.329 ± 0.26
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 95 proteins (18816 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski