Amino acid dipepetide frequency for Pseudomonas phage KPP22

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.831AlaAla: 12.831 ± 0.724
1.026AlaCys: 1.026 ± 0.318
5.646AlaAsp: 5.646 ± 0.542
8.161AlaGlu: 8.161 ± 1.176
3.233AlaPhe: 3.233 ± 0.304
7.596AlaGly: 7.596 ± 0.709
1.54AlaHis: 1.54 ± 0.358
6.416AlaIle: 6.416 ± 0.567
6.005AlaLys: 6.005 ± 0.65
6.98AlaLeu: 6.98 ± 0.642
2.361AlaMet: 2.361 ± 0.294
4.67AlaAsn: 4.67 ± 0.669
4.517AlaPro: 4.517 ± 0.606
4.106AlaGln: 4.106 ± 0.426
5.954AlaArg: 5.954 ± 0.571
5.646AlaSer: 5.646 ± 0.477
4.978AlaThr: 4.978 ± 0.583
6.108AlaVal: 6.108 ± 0.572
1.386AlaTrp: 1.386 ± 0.246
2.464AlaTyr: 2.464 ± 0.346
0.0AlaXaa: 0.0 ± 0.0
Cys
1.129CysAla: 1.129 ± 0.269
0.205CysCys: 0.205 ± 0.095
0.77CysAsp: 0.77 ± 0.213
0.616CysGlu: 0.616 ± 0.223
0.462CysPhe: 0.462 ± 0.167
0.462CysGly: 0.462 ± 0.159
0.051CysHis: 0.051 ± 0.058
0.257CysIle: 0.257 ± 0.094
0.462CysLys: 0.462 ± 0.158
0.616CysLeu: 0.616 ± 0.202
0.154CysMet: 0.154 ± 0.084
0.411CysAsn: 0.411 ± 0.143
0.667CysPro: 0.667 ± 0.17
0.103CysGln: 0.103 ± 0.075
0.513CysArg: 0.513 ± 0.187
0.411CysSer: 0.411 ± 0.184
0.924CysThr: 0.924 ± 0.261
0.77CysVal: 0.77 ± 0.22
0.103CysTrp: 0.103 ± 0.081
0.308CysTyr: 0.308 ± 0.146
0.0CysXaa: 0.0 ± 0.0
Asp
4.824AspAla: 4.824 ± 0.58
0.565AspCys: 0.565 ± 0.178
3.849AspAsp: 3.849 ± 0.48
5.132AspGlu: 5.132 ± 0.57
3.182AspPhe: 3.182 ± 0.38
6.518AspGly: 6.518 ± 0.558
1.18AspHis: 1.18 ± 0.232
3.644AspIle: 3.644 ± 0.508
3.079AspLys: 3.079 ± 0.386
4.876AspLeu: 4.876 ± 0.55
1.796AspMet: 1.796 ± 0.269
1.642AspAsn: 1.642 ± 0.33
4.055AspPro: 4.055 ± 0.553
2.464AspGln: 2.464 ± 0.437
2.925AspArg: 2.925 ± 0.365
3.593AspSer: 3.593 ± 0.502
2.874AspThr: 2.874 ± 0.337
3.644AspVal: 3.644 ± 0.401
1.488AspTrp: 1.488 ± 0.276
1.642AspTyr: 1.642 ± 0.284
0.0AspXaa: 0.0 ± 0.0
Glu
7.596GluAla: 7.596 ± 0.778
0.411GluCys: 0.411 ± 0.166
3.849GluAsp: 3.849 ± 0.488
5.954GluGlu: 5.954 ± 0.992
2.669GluPhe: 2.669 ± 0.38
4.003GluGly: 4.003 ± 0.758
2.207GluHis: 2.207 ± 0.437
4.67GluIle: 4.67 ± 0.482
4.568GluLys: 4.568 ± 0.95
6.056GluLeu: 6.056 ± 0.64
1.95GluMet: 1.95 ± 0.368
2.258GluAsn: 2.258 ± 0.434
2.925GluPro: 2.925 ± 0.363
3.131GluGln: 3.131 ± 0.498
4.978GluArg: 4.978 ± 0.656
3.079GluSer: 3.079 ± 0.422
2.977GluThr: 2.977 ± 0.471
3.901GluVal: 3.901 ± 0.531
0.77GluTrp: 0.77 ± 0.194
2.412GluTyr: 2.412 ± 0.364
0.0GluXaa: 0.0 ± 0.0
Phe
3.849PheAla: 3.849 ± 0.504
0.257PheCys: 0.257 ± 0.106
3.593PheAsp: 3.593 ± 0.459
2.669PheGlu: 2.669 ± 0.483
1.437PhePhe: 1.437 ± 0.321
3.644PheGly: 3.644 ± 0.478
0.616PheHis: 0.616 ± 0.178
2.207PheIle: 2.207 ± 0.3
1.95PheLys: 1.95 ± 0.38
2.823PheLeu: 2.823 ± 0.425
0.924PheMet: 0.924 ± 0.205
2.156PheAsn: 2.156 ± 0.401
1.334PhePro: 1.334 ± 0.235
1.437PheGln: 1.437 ± 0.28
2.258PheArg: 2.258 ± 0.329
2.258PheSer: 2.258 ± 0.328
1.899PheThr: 1.899 ± 0.325
2.823PheVal: 2.823 ± 0.436
0.719PheTrp: 0.719 ± 0.205
1.232PheTyr: 1.232 ± 0.232
0.0PheXaa: 0.0 ± 0.0
Gly
6.518GlyAla: 6.518 ± 0.713
0.513GlyCys: 0.513 ± 0.179
4.363GlyAsp: 4.363 ± 0.434
5.338GlyGlu: 5.338 ± 0.502
2.925GlyPhe: 2.925 ± 0.375
4.619GlyGly: 4.619 ± 0.771
1.18GlyHis: 1.18 ± 0.241
4.67GlyIle: 4.67 ± 0.483
3.285GlyLys: 3.285 ± 0.478
5.03GlyLeu: 5.03 ± 0.581
1.796GlyMet: 1.796 ± 0.27
2.925GlyAsn: 2.925 ± 0.732
2.669GlyPro: 2.669 ± 0.368
2.72GlyGln: 2.72 ± 0.373
5.132GlyArg: 5.132 ± 0.454
5.132GlySer: 5.132 ± 0.757
4.363GlyThr: 4.363 ± 0.434
5.543GlyVal: 5.543 ± 0.481
2.104GlyTrp: 2.104 ± 0.3
2.207GlyTyr: 2.207 ± 0.315
0.0GlyXaa: 0.0 ± 0.0
His
1.129HisAla: 1.129 ± 0.257
0.257HisCys: 0.257 ± 0.139
1.18HisAsp: 1.18 ± 0.296
1.232HisGlu: 1.232 ± 0.248
0.719HisPhe: 0.719 ± 0.167
1.54HisGly: 1.54 ± 0.335
0.411HisHis: 0.411 ± 0.163
0.924HisIle: 0.924 ± 0.247
0.719HisLys: 0.719 ± 0.21
1.232HisLeu: 1.232 ± 0.267
0.616HisMet: 0.616 ± 0.158
0.616HisAsn: 0.616 ± 0.179
0.975HisPro: 0.975 ± 0.227
0.719HisGln: 0.719 ± 0.201
1.54HisArg: 1.54 ± 0.318
1.18HisSer: 1.18 ± 0.262
0.873HisThr: 0.873 ± 0.241
0.975HisVal: 0.975 ± 0.212
0.308HisTrp: 0.308 ± 0.121
0.616HisTyr: 0.616 ± 0.176
0.0HisXaa: 0.0 ± 0.0
Ile
5.132IleAla: 5.132 ± 0.475
0.873IleCys: 0.873 ± 0.23
4.157IleAsp: 4.157 ± 0.448
3.593IleGlu: 3.593 ± 0.473
1.899IlePhe: 1.899 ± 0.339
3.901IleGly: 3.901 ± 0.547
1.232IleHis: 1.232 ± 0.27
3.644IleIle: 3.644 ± 0.433
3.336IleLys: 3.336 ± 0.537
4.055IleLeu: 4.055 ± 0.54
2.207IleMet: 2.207 ± 0.347
3.079IleAsn: 3.079 ± 0.347
3.439IlePro: 3.439 ± 0.356
2.618IleGln: 2.618 ± 0.385
3.952IleArg: 3.952 ± 0.464
4.157IleSer: 4.157 ± 0.402
3.49IleThr: 3.49 ± 0.391
3.131IleVal: 3.131 ± 0.448
0.77IleTrp: 0.77 ± 0.245
1.591IleTyr: 1.591 ± 0.262
0.0IleXaa: 0.0 ± 0.0
Lys
7.288LysAla: 7.288 ± 0.989
0.359LysCys: 0.359 ± 0.193
2.977LysAsp: 2.977 ± 0.42
3.798LysGlu: 3.798 ± 0.72
2.31LysPhe: 2.31 ± 0.298
3.49LysGly: 3.49 ± 0.438
0.565LysHis: 0.565 ± 0.202
3.131LysIle: 3.131 ± 0.508
2.925LysLys: 2.925 ± 0.55
4.568LysLeu: 4.568 ± 0.521
1.591LysMet: 1.591 ± 0.328
1.899LysAsn: 1.899 ± 0.261
2.772LysPro: 2.772 ± 0.38
1.591LysGln: 1.591 ± 0.366
3.285LysArg: 3.285 ± 0.503
3.079LysSer: 3.079 ± 0.406
2.925LysThr: 2.925 ± 0.407
4.106LysVal: 4.106 ± 0.455
0.667LysTrp: 0.667 ± 0.192
1.591LysTyr: 1.591 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
7.699LeuAla: 7.699 ± 0.675
0.77LeuCys: 0.77 ± 0.197
5.492LeuAsp: 5.492 ± 0.545
5.132LeuGlu: 5.132 ± 0.585
2.464LeuPhe: 2.464 ± 0.311
4.927LeuGly: 4.927 ± 0.452
0.924LeuHis: 0.924 ± 0.234
4.003LeuIle: 4.003 ± 0.511
4.26LeuLys: 4.26 ± 0.472
5.081LeuLeu: 5.081 ± 0.6
1.386LeuMet: 1.386 ± 0.275
3.285LeuAsn: 3.285 ± 0.321
3.233LeuPro: 3.233 ± 0.456
2.823LeuGln: 2.823 ± 0.374
4.824LeuArg: 4.824 ± 0.535
5.286LeuSer: 5.286 ± 0.46
3.49LeuThr: 3.49 ± 0.351
5.286LeuVal: 5.286 ± 0.559
1.026LeuTrp: 1.026 ± 0.195
2.361LeuTyr: 2.361 ± 0.414
0.0LeuXaa: 0.0 ± 0.0
Met
2.412MetAla: 2.412 ± 0.36
0.154MetCys: 0.154 ± 0.077
1.334MetAsp: 1.334 ± 0.263
1.796MetGlu: 1.796 ± 0.322
1.129MetPhe: 1.129 ± 0.212
1.026MetGly: 1.026 ± 0.197
0.257MetHis: 0.257 ± 0.114
1.642MetIle: 1.642 ± 0.281
1.694MetLys: 1.694 ± 0.27
2.002MetLeu: 2.002 ± 0.298
0.359MetMet: 0.359 ± 0.132
1.232MetAsn: 1.232 ± 0.237
0.924MetPro: 0.924 ± 0.228
0.616MetGln: 0.616 ± 0.214
1.591MetArg: 1.591 ± 0.299
2.31MetSer: 2.31 ± 0.341
1.694MetThr: 1.694 ± 0.296
1.591MetVal: 1.591 ± 0.263
0.154MetTrp: 0.154 ± 0.115
0.667MetTyr: 0.667 ± 0.18
0.0MetXaa: 0.0 ± 0.0
Asn
4.773AsnAla: 4.773 ± 0.703
0.154AsnCys: 0.154 ± 0.098
2.566AsnAsp: 2.566 ± 0.325
2.053AsnGlu: 2.053 ± 0.279
1.848AsnPhe: 1.848 ± 0.376
3.439AsnGly: 3.439 ± 0.41
1.078AsnHis: 1.078 ± 0.219
2.31AsnIle: 2.31 ± 0.331
1.334AsnLys: 1.334 ± 0.311
3.747AsnLeu: 3.747 ± 0.398
0.873AsnMet: 0.873 ± 0.184
1.95AsnAsn: 1.95 ± 0.523
2.874AsnPro: 2.874 ± 0.396
1.642AsnGln: 1.642 ± 0.377
2.207AsnArg: 2.207 ± 0.321
3.079AsnSer: 3.079 ± 0.398
1.95AsnThr: 1.95 ± 0.555
2.874AsnVal: 2.874 ± 0.545
0.616AsnTrp: 0.616 ± 0.144
1.283AsnTyr: 1.283 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
4.978ProAla: 4.978 ± 0.736
0.154ProCys: 0.154 ± 0.083
3.747ProAsp: 3.747 ± 0.445
3.695ProGlu: 3.695 ± 0.528
2.104ProPhe: 2.104 ± 0.367
3.695ProGly: 3.695 ± 0.469
0.975ProHis: 0.975 ± 0.237
1.95ProIle: 1.95 ± 0.331
2.104ProLys: 2.104 ± 0.37
2.258ProLeu: 2.258 ± 0.356
1.078ProMet: 1.078 ± 0.244
2.053ProAsn: 2.053 ± 0.331
2.207ProPro: 2.207 ± 0.472
1.899ProGln: 1.899 ± 0.26
2.207ProArg: 2.207 ± 0.357
2.925ProSer: 2.925 ± 0.353
2.823ProThr: 2.823 ± 0.33
3.131ProVal: 3.131 ± 0.41
0.667ProTrp: 0.667 ± 0.242
1.129ProTyr: 1.129 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.541GlnAla: 3.541 ± 0.379
0.616GlnCys: 0.616 ± 0.202
2.002GlnAsp: 2.002 ± 0.313
2.977GlnGlu: 2.977 ± 0.437
1.796GlnPhe: 1.796 ± 0.34
2.515GlnGly: 2.515 ± 0.406
0.667GlnHis: 0.667 ± 0.202
2.258GlnIle: 2.258 ± 0.314
2.258GlnLys: 2.258 ± 0.396
3.028GlnLeu: 3.028 ± 0.34
0.975GlnMet: 0.975 ± 0.232
2.31GlnAsn: 2.31 ± 0.397
1.437GlnPro: 1.437 ± 0.211
1.334GlnGln: 1.334 ± 0.218
2.361GlnArg: 2.361 ± 0.34
2.156GlnSer: 2.156 ± 0.402
2.053GlnThr: 2.053 ± 0.298
2.31GlnVal: 2.31 ± 0.432
0.308GlnTrp: 0.308 ± 0.123
1.078GlnTyr: 1.078 ± 0.245
0.0GlnXaa: 0.0 ± 0.0
Arg
6.108ArgAla: 6.108 ± 0.645
0.565ArgCys: 0.565 ± 0.163
3.644ArgAsp: 3.644 ± 0.413
4.363ArgGlu: 4.363 ± 0.598
2.618ArgPhe: 2.618 ± 0.391
3.747ArgGly: 3.747 ± 0.392
1.386ArgHis: 1.386 ± 0.336
4.568ArgIle: 4.568 ± 0.469
4.055ArgLys: 4.055 ± 0.516
5.338ArgLeu: 5.338 ± 0.537
1.18ArgMet: 1.18 ± 0.274
2.515ArgAsn: 2.515 ± 0.369
2.412ArgPro: 2.412 ± 0.372
2.464ArgGln: 2.464 ± 0.366
3.849ArgArg: 3.849 ± 0.459
3.028ArgSer: 3.028 ± 0.306
2.566ArgThr: 2.566 ± 0.409
3.541ArgVal: 3.541 ± 0.489
0.975ArgTrp: 0.975 ± 0.195
1.848ArgTyr: 1.848 ± 0.27
0.0ArgXaa: 0.0 ± 0.0
Ser
7.031SerAla: 7.031 ± 0.66
0.565SerCys: 0.565 ± 0.155
3.747SerAsp: 3.747 ± 0.422
3.285SerGlu: 3.285 ± 0.431
2.412SerPhe: 2.412 ± 0.382
4.311SerGly: 4.311 ± 0.623
0.873SerHis: 0.873 ± 0.239
3.49SerIle: 3.49 ± 0.387
3.593SerLys: 3.593 ± 0.452
4.363SerLeu: 4.363 ± 0.496
1.745SerMet: 1.745 ± 0.303
2.002SerAsn: 2.002 ± 0.369
3.079SerPro: 3.079 ± 0.338
2.258SerGln: 2.258 ± 0.411
4.157SerArg: 4.157 ± 0.492
4.414SerSer: 4.414 ± 0.582
3.387SerThr: 3.387 ± 0.472
4.363SerVal: 4.363 ± 0.485
0.821SerTrp: 0.821 ± 0.231
2.31SerTyr: 2.31 ± 0.296
0.0SerXaa: 0.0 ± 0.0
Thr
4.824ThrAla: 4.824 ± 0.525
0.462ThrCys: 0.462 ± 0.138
2.361ThrAsp: 2.361 ± 0.313
2.566ThrGlu: 2.566 ± 0.369
2.464ThrPhe: 2.464 ± 0.312
4.978ThrGly: 4.978 ± 0.685
0.821ThrHis: 0.821 ± 0.193
3.798ThrIle: 3.798 ± 0.5
3.336ThrLys: 3.336 ± 0.398
3.644ThrLeu: 3.644 ± 0.44
0.975ThrMet: 0.975 ± 0.227
2.002ThrAsn: 2.002 ± 0.364
2.874ThrPro: 2.874 ± 0.425
1.54ThrGln: 1.54 ± 0.28
2.31ThrArg: 2.31 ± 0.335
4.055ThrSer: 4.055 ± 0.415
2.515ThrThr: 2.515 ± 0.448
3.49ThrVal: 3.49 ± 0.431
0.821ThrTrp: 0.821 ± 0.243
1.745ThrTyr: 1.745 ± 0.243
0.0ThrXaa: 0.0 ± 0.0
Val
6.467ValAla: 6.467 ± 0.587
0.924ValCys: 0.924 ± 0.211
4.927ValAsp: 4.927 ± 0.5
5.235ValGlu: 5.235 ± 0.636
2.72ValPhe: 2.72 ± 0.361
4.619ValGly: 4.619 ± 0.463
1.078ValHis: 1.078 ± 0.274
3.695ValIle: 3.695 ± 0.564
3.747ValLys: 3.747 ± 0.448
4.876ValLeu: 4.876 ± 0.634
1.54ValMet: 1.54 ± 0.319
2.874ValAsn: 2.874 ± 0.475
1.95ValPro: 1.95 ± 0.254
2.874ValGln: 2.874 ± 0.435
3.644ValArg: 3.644 ± 0.476
3.541ValSer: 3.541 ± 0.394
3.695ValThr: 3.695 ± 0.464
4.619ValVal: 4.619 ± 0.535
1.026ValTrp: 1.026 ± 0.224
1.694ValTyr: 1.694 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
1.437TrpAla: 1.437 ± 0.267
0.205TrpCys: 0.205 ± 0.096
0.873TrpAsp: 0.873 ± 0.187
1.18TrpGlu: 1.18 ± 0.249
0.462TrpPhe: 0.462 ± 0.147
1.18TrpGly: 1.18 ± 0.292
0.205TrpHis: 0.205 ± 0.099
0.975TrpIle: 0.975 ± 0.247
0.975TrpLys: 0.975 ± 0.189
1.026TrpLeu: 1.026 ± 0.202
0.359TrpMet: 0.359 ± 0.143
1.283TrpAsn: 1.283 ± 0.248
0.565TrpPro: 0.565 ± 0.158
0.359TrpGln: 0.359 ± 0.143
0.924TrpArg: 0.924 ± 0.232
0.924TrpSer: 0.924 ± 0.171
0.77TrpThr: 0.77 ± 0.191
1.18TrpVal: 1.18 ± 0.204
0.205TrpTrp: 0.205 ± 0.093
0.359TrpTyr: 0.359 ± 0.123
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.053TyrAla: 2.053 ± 0.292
0.359TyrCys: 0.359 ± 0.162
1.899TyrAsp: 1.899 ± 0.288
1.591TyrGlu: 1.591 ± 0.285
1.232TyrPhe: 1.232 ± 0.233
2.823TyrGly: 2.823 ± 0.359
0.513TyrHis: 0.513 ± 0.149
2.002TyrIle: 2.002 ± 0.299
1.232TyrLys: 1.232 ± 0.241
2.156TyrLeu: 2.156 ± 0.34
0.667TyrMet: 0.667 ± 0.147
1.488TyrAsn: 1.488 ± 0.27
0.873TyrPro: 0.873 ± 0.203
1.334TyrGln: 1.334 ± 0.295
2.156TyrArg: 2.156 ± 0.313
1.95TyrSer: 1.95 ± 0.343
1.334TyrThr: 1.334 ± 0.267
2.361TyrVal: 2.361 ± 0.486
0.513TyrTrp: 0.513 ± 0.138
0.565TyrTyr: 0.565 ± 0.178
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (19485 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski