Amino acid dipepetide frequency for Gordonia phage Lauer

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.451AlaAla: 10.451 ± 1.459
0.903AlaCys: 0.903 ± 0.279
6.645AlaAsp: 6.645 ± 0.601
4.903AlaGlu: 4.903 ± 0.566
4.064AlaPhe: 4.064 ± 0.716
7.999AlaGly: 7.999 ± 0.915
1.806AlaHis: 1.806 ± 0.4
4.774AlaIle: 4.774 ± 0.563
3.677AlaLys: 3.677 ± 0.56
7.806AlaLeu: 7.806 ± 0.731
3.419AlaMet: 3.419 ± 0.7
3.484AlaAsn: 3.484 ± 0.562
4.451AlaPro: 4.451 ± 0.499
3.419AlaGln: 3.419 ± 0.804
6.58AlaArg: 6.58 ± 0.766
5.225AlaSer: 5.225 ± 0.631
6.064AlaThr: 6.064 ± 0.775
6.516AlaVal: 6.516 ± 0.83
2.193AlaTrp: 2.193 ± 0.401
2.58AlaTyr: 2.58 ± 0.468
0.0AlaXaa: 0.0 ± 0.0
Cys
0.774CysAla: 0.774 ± 0.243
0.194CysCys: 0.194 ± 0.186
0.839CysAsp: 0.839 ± 0.22
0.516CysGlu: 0.516 ± 0.186
0.387CysPhe: 0.387 ± 0.161
1.419CysGly: 1.419 ± 0.507
0.129CysHis: 0.129 ± 0.09
0.194CysIle: 0.194 ± 0.103
0.516CysLys: 0.516 ± 0.22
0.581CysLeu: 0.581 ± 0.214
0.129CysMet: 0.129 ± 0.087
0.774CysAsn: 0.774 ± 0.258
0.323CysPro: 0.323 ± 0.134
0.516CysGln: 0.516 ± 0.164
0.774CysArg: 0.774 ± 0.222
0.516CysSer: 0.516 ± 0.173
0.71CysThr: 0.71 ± 0.253
0.903CysVal: 0.903 ± 0.302
0.258CysTrp: 0.258 ± 0.116
0.452CysTyr: 0.452 ± 0.168
0.0CysXaa: 0.0 ± 0.0
Asp
4.58AspAla: 4.58 ± 0.393
0.581AspCys: 0.581 ± 0.224
5.225AspAsp: 5.225 ± 0.747
5.935AspGlu: 5.935 ± 0.769
2.0AspPhe: 2.0 ± 0.326
4.774AspGly: 4.774 ± 0.718
1.613AspHis: 1.613 ± 0.34
4.451AspIle: 4.451 ± 0.572
2.451AspLys: 2.451 ± 0.413
5.484AspLeu: 5.484 ± 0.706
1.613AspMet: 1.613 ± 0.308
2.387AspAsn: 2.387 ± 0.385
4.193AspPro: 4.193 ± 0.606
2.387AspGln: 2.387 ± 0.444
4.064AspArg: 4.064 ± 0.493
3.097AspSer: 3.097 ± 0.439
4.258AspThr: 4.258 ± 0.61
4.193AspVal: 4.193 ± 0.437
1.355AspTrp: 1.355 ± 0.262
2.71AspTyr: 2.71 ± 0.532
0.0AspXaa: 0.0 ± 0.0
Glu
6.322GluAla: 6.322 ± 0.523
0.903GluCys: 0.903 ± 0.253
4.516GluAsp: 4.516 ± 0.69
4.645GluGlu: 4.645 ± 0.756
1.161GluPhe: 1.161 ± 0.269
3.871GluGly: 3.871 ± 0.593
1.097GluHis: 1.097 ± 0.235
2.71GluIle: 2.71 ± 0.434
2.387GluLys: 2.387 ± 0.437
4.451GluLeu: 4.451 ± 0.488
2.064GluMet: 2.064 ± 0.558
2.0GluAsn: 2.0 ± 0.354
2.516GluPro: 2.516 ± 0.385
2.839GluGln: 2.839 ± 0.483
4.193GluArg: 4.193 ± 0.55
2.839GluSer: 2.839 ± 0.504
3.097GluThr: 3.097 ± 0.383
3.935GluVal: 3.935 ± 0.486
1.161GluTrp: 1.161 ± 0.261
1.871GluTyr: 1.871 ± 0.401
0.0GluXaa: 0.0 ± 0.0
Phe
2.58PheAla: 2.58 ± 0.533
0.839PheCys: 0.839 ± 0.269
3.484PheAsp: 3.484 ± 0.575
1.935PheGlu: 1.935 ± 0.345
1.29PhePhe: 1.29 ± 0.29
2.774PheGly: 2.774 ± 0.46
0.774PheHis: 0.774 ± 0.209
1.355PheIle: 1.355 ± 0.371
1.355PheLys: 1.355 ± 0.364
2.193PheLeu: 2.193 ± 0.38
0.581PheMet: 0.581 ± 0.163
1.032PheAsn: 1.032 ± 0.238
1.29PhePro: 1.29 ± 0.292
0.968PheGln: 0.968 ± 0.281
2.387PheArg: 2.387 ± 0.399
1.806PheSer: 1.806 ± 0.346
2.322PheThr: 2.322 ± 0.373
2.193PheVal: 2.193 ± 0.412
0.452PheTrp: 0.452 ± 0.163
0.387PheTyr: 0.387 ± 0.229
0.0PheXaa: 0.0 ± 0.0
Gly
7.677GlyAla: 7.677 ± 0.966
0.581GlyCys: 0.581 ± 0.206
5.419GlyAsp: 5.419 ± 1.159
3.677GlyGlu: 3.677 ± 0.514
2.516GlyPhe: 2.516 ± 0.36
6.838GlyGly: 6.838 ± 0.9
1.677GlyHis: 1.677 ± 0.414
5.096GlyIle: 5.096 ± 0.902
3.613GlyLys: 3.613 ± 0.515
5.613GlyLeu: 5.613 ± 0.755
1.419GlyMet: 1.419 ± 0.317
2.58GlyAsn: 2.58 ± 0.399
4.58GlyPro: 4.58 ± 1.833
2.58GlyGln: 2.58 ± 0.538
5.29GlyArg: 5.29 ± 0.652
5.419GlySer: 5.419 ± 0.874
5.677GlyThr: 5.677 ± 0.569
6.516GlyVal: 6.516 ± 0.688
1.871GlyTrp: 1.871 ± 0.365
2.516GlyTyr: 2.516 ± 0.446
0.0GlyXaa: 0.0 ± 0.0
His
1.161HisAla: 1.161 ± 0.228
0.194HisCys: 0.194 ± 0.11
1.806HisAsp: 1.806 ± 0.373
1.29HisGlu: 1.29 ± 0.3
0.71HisPhe: 0.71 ± 0.224
1.29HisGly: 1.29 ± 0.328
0.903HisHis: 0.903 ± 0.287
1.29HisIle: 1.29 ± 0.271
1.29HisLys: 1.29 ± 0.318
1.097HisLeu: 1.097 ± 0.298
0.774HisMet: 0.774 ± 0.231
0.71HisAsn: 0.71 ± 0.197
1.484HisPro: 1.484 ± 0.331
0.774HisGln: 0.774 ± 0.197
1.097HisArg: 1.097 ± 0.204
1.355HisSer: 1.355 ± 0.28
1.226HisThr: 1.226 ± 0.226
1.484HisVal: 1.484 ± 0.234
0.452HisTrp: 0.452 ± 0.181
0.323HisTyr: 0.323 ± 0.167
0.0HisXaa: 0.0 ± 0.0
Ile
5.613IleAla: 5.613 ± 0.647
0.258IleCys: 0.258 ± 0.121
3.613IleAsp: 3.613 ± 0.371
3.548IleGlu: 3.548 ± 0.424
1.548IlePhe: 1.548 ± 0.418
4.129IleGly: 4.129 ± 0.748
0.903IleHis: 0.903 ± 0.252
2.0IleIle: 2.0 ± 0.447
2.645IleLys: 2.645 ± 0.672
3.032IleLeu: 3.032 ± 0.404
1.419IleMet: 1.419 ± 0.25
1.677IleAsn: 1.677 ± 0.297
2.58IlePro: 2.58 ± 0.361
1.935IleGln: 1.935 ± 0.611
3.355IleArg: 3.355 ± 0.464
2.451IleSer: 2.451 ± 0.382
2.839IleThr: 2.839 ± 0.36
3.935IleVal: 3.935 ± 0.45
0.968IleTrp: 0.968 ± 0.255
0.903IleTyr: 0.903 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
5.032LysAla: 5.032 ± 0.702
0.516LysCys: 0.516 ± 0.25
2.258LysAsp: 2.258 ± 0.337
2.774LysGlu: 2.774 ± 0.513
2.0LysPhe: 2.0 ± 0.581
3.806LysGly: 3.806 ± 0.655
0.774LysHis: 0.774 ± 0.211
2.258LysIle: 2.258 ± 0.392
2.903LysLys: 2.903 ± 0.497
3.742LysLeu: 3.742 ± 0.711
1.548LysMet: 1.548 ± 0.371
1.871LysAsn: 1.871 ± 0.364
2.645LysPro: 2.645 ± 0.436
2.322LysGln: 2.322 ± 0.382
3.161LysArg: 3.161 ± 0.49
2.774LysSer: 2.774 ± 0.498
2.258LysThr: 2.258 ± 0.431
2.322LysVal: 2.322 ± 0.498
0.839LysTrp: 0.839 ± 0.244
1.613LysTyr: 1.613 ± 0.314
0.0LysXaa: 0.0 ± 0.0
Leu
8.064LeuAla: 8.064 ± 1.109
0.774LeuCys: 0.774 ± 0.303
5.225LeuAsp: 5.225 ± 0.635
4.58LeuGlu: 4.58 ± 0.665
1.871LeuPhe: 1.871 ± 0.36
4.967LeuGly: 4.967 ± 0.819
1.355LeuHis: 1.355 ± 0.283
3.742LeuIle: 3.742 ± 0.463
3.742LeuLys: 3.742 ± 0.596
6.193LeuLeu: 6.193 ± 0.664
2.0LeuMet: 2.0 ± 0.388
1.742LeuAsn: 1.742 ± 0.319
4.58LeuPro: 4.58 ± 0.494
2.387LeuGln: 2.387 ± 0.45
6.322LeuArg: 6.322 ± 0.847
4.064LeuSer: 4.064 ± 0.573
4.838LeuThr: 4.838 ± 0.67
4.967LeuVal: 4.967 ± 0.581
1.742LeuTrp: 1.742 ± 0.364
2.645LeuTyr: 2.645 ± 0.357
0.0LeuXaa: 0.0 ± 0.0
Met
3.032MetAla: 3.032 ± 0.432
0.065MetCys: 0.065 ± 0.065
1.806MetAsp: 1.806 ± 0.351
1.226MetGlu: 1.226 ± 0.27
0.968MetPhe: 0.968 ± 0.27
1.677MetGly: 1.677 ± 0.396
0.581MetHis: 0.581 ± 0.23
1.677MetIle: 1.677 ± 0.332
1.419MetLys: 1.419 ± 0.281
1.806MetLeu: 1.806 ± 0.423
0.774MetMet: 0.774 ± 0.222
0.839MetAsn: 0.839 ± 0.257
1.484MetPro: 1.484 ± 0.281
0.903MetGln: 0.903 ± 0.197
1.548MetArg: 1.548 ± 0.327
2.387MetSer: 2.387 ± 0.443
1.806MetThr: 1.806 ± 0.327
1.742MetVal: 1.742 ± 0.326
0.516MetTrp: 0.516 ± 0.195
0.581MetTyr: 0.581 ± 0.153
0.0MetXaa: 0.0 ± 0.0
Asn
3.871AsnAla: 3.871 ± 0.733
0.452AsnCys: 0.452 ± 0.185
1.935AsnAsp: 1.935 ± 0.314
2.0AsnGlu: 2.0 ± 0.374
0.903AsnPhe: 0.903 ± 0.292
3.161AsnGly: 3.161 ± 0.636
0.71AsnHis: 0.71 ± 0.216
1.613AsnIle: 1.613 ± 0.317
1.613AsnLys: 1.613 ± 0.292
2.903AsnLeu: 2.903 ± 0.364
0.903AsnMet: 0.903 ± 0.241
1.355AsnAsn: 1.355 ± 0.446
2.839AsnPro: 2.839 ± 0.48
0.774AsnGln: 0.774 ± 0.281
2.322AsnArg: 2.322 ± 0.374
1.935AsnSer: 1.935 ± 0.648
1.29AsnThr: 1.29 ± 0.228
3.097AsnVal: 3.097 ± 0.523
0.774AsnTrp: 0.774 ± 0.237
0.839AsnTyr: 0.839 ± 0.271
0.0AsnXaa: 0.0 ± 0.0
Pro
6.193ProAla: 6.193 ± 0.676
0.839ProCys: 0.839 ± 0.268
3.161ProAsp: 3.161 ± 0.432
3.032ProGlu: 3.032 ± 0.492
1.806ProPhe: 1.806 ± 0.31
4.903ProGly: 4.903 ± 0.581
1.161ProHis: 1.161 ± 0.233
2.58ProIle: 2.58 ± 0.422
2.968ProLys: 2.968 ± 0.512
3.032ProLeu: 3.032 ± 0.437
1.355ProMet: 1.355 ± 0.277
2.129ProAsn: 2.129 ± 0.376
2.451ProPro: 2.451 ± 0.506
2.71ProGln: 2.71 ± 1.063
2.968ProArg: 2.968 ± 0.487
3.226ProSer: 3.226 ± 0.494
3.419ProThr: 3.419 ± 0.558
3.484ProVal: 3.484 ± 0.548
1.419ProTrp: 1.419 ± 0.317
1.355ProTyr: 1.355 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
3.097GlnAla: 3.097 ± 0.646
0.258GlnCys: 0.258 ± 0.119
2.322GlnAsp: 2.322 ± 0.409
2.258GlnGlu: 2.258 ± 0.449
1.806GlnPhe: 1.806 ± 0.355
4.903GlnGly: 4.903 ± 2.432
0.71GlnHis: 0.71 ± 0.21
1.226GlnIle: 1.226 ± 0.32
1.484GlnLys: 1.484 ± 0.296
3.032GlnLeu: 3.032 ± 0.525
0.903GlnMet: 0.903 ± 0.287
1.032GlnAsn: 1.032 ± 0.282
1.806GlnPro: 1.806 ± 0.328
1.613GlnGln: 1.613 ± 0.308
2.774GlnArg: 2.774 ± 0.247
2.193GlnSer: 2.193 ± 0.445
1.355GlnThr: 1.355 ± 0.254
2.839GlnVal: 2.839 ± 0.392
0.903GlnTrp: 0.903 ± 0.205
1.355GlnTyr: 1.355 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
6.774ArgAla: 6.774 ± 0.63
0.581ArgCys: 0.581 ± 0.273
4.129ArgAsp: 4.129 ± 0.624
3.29ArgGlu: 3.29 ± 0.483
1.806ArgPhe: 1.806 ± 0.384
5.161ArgGly: 5.161 ± 0.619
1.355ArgHis: 1.355 ± 0.273
2.645ArgIle: 2.645 ± 0.354
4.387ArgLys: 4.387 ± 0.467
5.419ArgLeu: 5.419 ± 0.687
1.548ArgMet: 1.548 ± 0.353
2.774ArgAsn: 2.774 ± 0.463
2.839ArgPro: 2.839 ± 0.53
2.903ArgGln: 2.903 ± 0.414
6.193ArgArg: 6.193 ± 0.886
4.0ArgSer: 4.0 ± 0.398
2.968ArgThr: 2.968 ± 0.518
5.354ArgVal: 5.354 ± 0.68
1.419ArgTrp: 1.419 ± 0.377
1.677ArgTyr: 1.677 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
5.354SerAla: 5.354 ± 0.783
1.097SerCys: 1.097 ± 0.364
3.742SerAsp: 3.742 ± 0.561
2.903SerGlu: 2.903 ± 0.472
1.419SerPhe: 1.419 ± 0.316
5.935SerGly: 5.935 ± 0.72
1.29SerHis: 1.29 ± 0.281
2.839SerIle: 2.839 ± 0.549
2.968SerLys: 2.968 ± 0.393
4.322SerLeu: 4.322 ± 0.497
1.677SerMet: 1.677 ± 0.32
2.0SerAsn: 2.0 ± 0.414
3.419SerPro: 3.419 ± 0.594
2.193SerGln: 2.193 ± 0.411
3.226SerArg: 3.226 ± 0.446
3.742SerSer: 3.742 ± 0.527
3.613SerThr: 3.613 ± 0.48
3.29SerVal: 3.29 ± 0.587
1.677SerTrp: 1.677 ± 0.316
1.806SerTyr: 1.806 ± 0.361
0.0SerXaa: 0.0 ± 0.0
Thr
5.225ThrAla: 5.225 ± 0.626
0.516ThrCys: 0.516 ± 0.195
3.097ThrAsp: 3.097 ± 0.431
2.774ThrGlu: 2.774 ± 0.485
1.806ThrPhe: 1.806 ± 0.292
5.29ThrGly: 5.29 ± 0.755
1.29ThrHis: 1.29 ± 0.296
2.968ThrIle: 2.968 ± 0.536
3.032ThrLys: 3.032 ± 0.425
5.225ThrLeu: 5.225 ± 0.493
1.677ThrMet: 1.677 ± 0.406
1.871ThrAsn: 1.871 ± 0.302
4.193ThrPro: 4.193 ± 0.464
1.742ThrGln: 1.742 ± 0.24
4.193ThrArg: 4.193 ± 0.536
3.935ThrSer: 3.935 ± 0.537
3.161ThrThr: 3.161 ± 0.483
4.645ThrVal: 4.645 ± 0.631
1.097ThrTrp: 1.097 ± 0.226
1.226ThrTyr: 1.226 ± 0.337
0.0ThrXaa: 0.0 ± 0.0
Val
6.774ValAla: 6.774 ± 0.579
0.774ValCys: 0.774 ± 0.255
4.774ValAsp: 4.774 ± 0.489
4.387ValGlu: 4.387 ± 0.531
2.258ValPhe: 2.258 ± 0.31
4.709ValGly: 4.709 ± 0.579
1.29ValHis: 1.29 ± 0.302
4.129ValIle: 4.129 ± 0.445
2.903ValLys: 2.903 ± 0.347
5.484ValLeu: 5.484 ± 0.647
1.548ValMet: 1.548 ± 0.309
2.387ValAsn: 2.387 ± 0.377
4.0ValPro: 4.0 ± 0.421
2.839ValGln: 2.839 ± 0.421
3.613ValArg: 3.613 ± 0.466
4.516ValSer: 4.516 ± 0.535
5.096ValThr: 5.096 ± 0.52
5.742ValVal: 5.742 ± 0.603
1.871ValTrp: 1.871 ± 0.296
1.935ValTyr: 1.935 ± 0.296
0.0ValXaa: 0.0 ± 0.0
Trp
2.0TrpAla: 2.0 ± 0.388
0.258TrpCys: 0.258 ± 0.127
1.419TrpAsp: 1.419 ± 0.289
0.774TrpGlu: 0.774 ± 0.269
0.645TrpPhe: 0.645 ± 0.22
1.032TrpGly: 1.032 ± 0.321
0.71TrpHis: 0.71 ± 0.209
0.452TrpIle: 0.452 ± 0.152
0.903TrpLys: 0.903 ± 0.223
2.258TrpLeu: 2.258 ± 0.442
0.387TrpMet: 0.387 ± 0.138
1.806TrpAsn: 1.806 ± 0.368
0.968TrpPro: 0.968 ± 0.238
1.032TrpGln: 1.032 ± 0.321
1.677TrpArg: 1.677 ± 0.299
1.355TrpSer: 1.355 ± 0.311
1.161TrpThr: 1.161 ± 0.264
1.742TrpVal: 1.742 ± 0.398
0.452TrpTrp: 0.452 ± 0.194
0.645TrpTyr: 0.645 ± 0.205
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.387TyrAla: 2.387 ± 0.378
0.323TyrCys: 0.323 ± 0.126
1.935TyrAsp: 1.935 ± 0.44
2.129TyrGlu: 2.129 ± 0.454
1.032TyrPhe: 1.032 ± 0.237
2.516TyrGly: 2.516 ± 0.468
0.581TyrHis: 0.581 ± 0.238
1.355TyrIle: 1.355 ± 0.261
1.161TyrLys: 1.161 ± 0.408
2.129TyrLeu: 2.129 ± 0.374
1.032TyrMet: 1.032 ± 0.327
0.839TyrAsn: 0.839 ± 0.279
1.548TyrPro: 1.548 ± 0.357
1.032TyrGln: 1.032 ± 0.234
1.548TyrArg: 1.548 ± 0.361
1.677TyrSer: 1.677 ± 0.33
1.806TyrThr: 1.806 ± 0.279
2.129TyrVal: 2.129 ± 0.413
0.258TyrTrp: 0.258 ± 0.122
0.774TyrTyr: 0.774 ± 0.28
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 66 proteins (15502 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski