Amino acid dipepetide frequency for Gordonia phage GEazy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.912AlaAla: 18.912 ± 1.702
0.929AlaCys: 0.929 ± 0.246
8.427AlaAsp: 8.427 ± 0.819
7.697AlaGlu: 7.697 ± 0.984
2.92AlaPhe: 2.92 ± 0.548
10.617AlaGly: 10.617 ± 1.157
2.522AlaHis: 2.522 ± 0.467
4.645AlaIle: 4.645 ± 0.564
4.247AlaLys: 4.247 ± 0.497
10.352AlaLeu: 10.352 ± 0.918
2.853AlaMet: 2.853 ± 0.422
3.451AlaAsn: 3.451 ± 0.369
7.896AlaPro: 7.896 ± 0.751
4.512AlaGln: 4.512 ± 0.673
8.759AlaArg: 8.759 ± 1.007
6.901AlaSer: 6.901 ± 0.784
8.626AlaThr: 8.626 ± 0.831
8.693AlaVal: 8.693 ± 0.984
1.991AlaTrp: 1.991 ± 0.382
1.792AlaTyr: 1.792 ± 0.304
0.0AlaXaa: 0.0 ± 0.0
Cys
0.664CysAla: 0.664 ± 0.268
0.199CysCys: 0.199 ± 0.121
0.73CysAsp: 0.73 ± 0.23
0.796CysGlu: 0.796 ± 0.249
0.066CysPhe: 0.066 ± 0.059
1.062CysGly: 1.062 ± 0.282
0.398CysHis: 0.398 ± 0.198
0.066CysIle: 0.066 ± 0.068
0.133CysLys: 0.133 ± 0.09
0.531CysLeu: 0.531 ± 0.245
0.0CysMet: 0.0 ± 0.0
0.464CysAsn: 0.464 ± 0.166
0.73CysPro: 0.73 ± 0.289
0.332CysGln: 0.332 ± 0.199
1.128CysArg: 1.128 ± 0.299
0.265CysSer: 0.265 ± 0.149
0.464CysThr: 0.464 ± 0.169
0.73CysVal: 0.73 ± 0.222
0.332CysTrp: 0.332 ± 0.144
0.133CysTyr: 0.133 ± 0.081
0.0CysXaa: 0.0 ± 0.0
Asp
7.896AspAla: 7.896 ± 0.618
0.73AspCys: 0.73 ± 0.284
4.778AspAsp: 4.778 ± 0.754
5.574AspGlu: 5.574 ± 0.679
2.057AspPhe: 2.057 ± 0.354
5.64AspGly: 5.64 ± 0.516
1.593AspHis: 1.593 ± 0.337
2.057AspIle: 2.057 ± 0.456
1.194AspLys: 1.194 ± 0.354
6.105AspLeu: 6.105 ± 0.681
0.995AspMet: 0.995 ± 0.311
1.327AspAsn: 1.327 ± 0.355
4.446AspPro: 4.446 ± 0.793
2.455AspGln: 2.455 ± 0.373
4.18AspArg: 4.18 ± 0.676
3.583AspSer: 3.583 ± 0.471
3.517AspThr: 3.517 ± 0.598
3.981AspVal: 3.981 ± 0.575
1.526AspTrp: 1.526 ± 0.272
1.725AspTyr: 1.725 ± 0.412
0.0AspXaa: 0.0 ± 0.0
Glu
5.773GluAla: 5.773 ± 0.707
0.398GluCys: 0.398 ± 0.169
3.318GluAsp: 3.318 ± 0.507
2.787GluGlu: 2.787 ± 0.571
2.057GluPhe: 2.057 ± 0.487
3.915GluGly: 3.915 ± 0.479
1.327GluHis: 1.327 ± 0.373
3.583GluIle: 3.583 ± 0.563
1.327GluLys: 1.327 ± 0.225
5.242GluLeu: 5.242 ± 0.614
1.46GluMet: 1.46 ± 0.317
1.261GluAsn: 1.261 ± 0.314
3.517GluPro: 3.517 ± 0.663
1.991GluGln: 1.991 ± 0.349
4.91GluArg: 4.91 ± 0.777
3.052GluSer: 3.052 ± 0.414
3.251GluThr: 3.251 ± 0.48
3.716GluVal: 3.716 ± 0.58
1.327GluTrp: 1.327 ± 0.413
1.46GluTyr: 1.46 ± 0.264
0.0GluXaa: 0.0 ± 0.0
Phe
2.522PheAla: 2.522 ± 0.591
0.133PheCys: 0.133 ± 0.096
1.991PheAsp: 1.991 ± 0.346
2.19PheGlu: 2.19 ± 0.388
0.597PhePhe: 0.597 ± 0.161
2.654PheGly: 2.654 ± 0.469
0.796PheHis: 0.796 ± 0.211
1.194PheIle: 1.194 ± 0.349
0.398PheLys: 0.398 ± 0.176
2.123PheLeu: 2.123 ± 0.467
0.199PheMet: 0.199 ± 0.106
0.597PheAsn: 0.597 ± 0.193
1.593PhePro: 1.593 ± 0.293
0.73PheGln: 0.73 ± 0.177
1.725PheArg: 1.725 ± 0.362
1.924PheSer: 1.924 ± 0.378
2.455PheThr: 2.455 ± 0.543
1.991PheVal: 1.991 ± 0.342
0.796PheTrp: 0.796 ± 0.243
0.464PheTyr: 0.464 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
8.759GlyAla: 8.759 ± 1.066
0.597GlyCys: 0.597 ± 0.218
5.773GlyAsp: 5.773 ± 0.712
4.048GlyGlu: 4.048 ± 0.461
2.92GlyPhe: 2.92 ± 0.517
8.693GlyGly: 8.693 ± 1.056
1.858GlyHis: 1.858 ± 0.31
4.114GlyIle: 4.114 ± 0.815
2.522GlyLys: 2.522 ± 0.431
7.1GlyLeu: 7.1 ± 0.711
2.19GlyMet: 2.19 ± 0.316
2.389GlyAsn: 2.389 ± 0.423
4.313GlyPro: 4.313 ± 0.589
3.052GlyGln: 3.052 ± 0.411
6.768GlyArg: 6.768 ± 0.725
5.109GlySer: 5.109 ± 0.493
4.91GlyThr: 4.91 ± 0.548
6.304GlyVal: 6.304 ± 0.593
1.526GlyTrp: 1.526 ± 0.271
2.389GlyTyr: 2.389 ± 0.357
0.0GlyXaa: 0.0 ± 0.0
His
1.991HisAla: 1.991 ± 0.472
0.464HisCys: 0.464 ± 0.191
1.194HisAsp: 1.194 ± 0.281
0.995HisGlu: 0.995 ± 0.233
0.531HisPhe: 0.531 ± 0.232
1.858HisGly: 1.858 ± 0.477
0.863HisHis: 0.863 ± 0.273
0.995HisIle: 0.995 ± 0.278
0.398HisLys: 0.398 ± 0.163
1.792HisLeu: 1.792 ± 0.457
0.265HisMet: 0.265 ± 0.137
0.531HisAsn: 0.531 ± 0.166
1.792HisPro: 1.792 ± 0.348
0.531HisGln: 0.531 ± 0.187
1.526HisArg: 1.526 ± 0.355
1.46HisSer: 1.46 ± 0.253
1.327HisThr: 1.327 ± 0.385
1.393HisVal: 1.393 ± 0.272
0.398HisTrp: 0.398 ± 0.168
0.597HisTyr: 0.597 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
7.167IleAla: 7.167 ± 0.711
0.597IleCys: 0.597 ± 0.199
2.787IleAsp: 2.787 ± 0.491
3.185IleGlu: 3.185 ± 0.612
0.664IlePhe: 0.664 ± 0.195
3.384IleGly: 3.384 ± 0.624
0.863IleHis: 0.863 ± 0.248
1.725IleIle: 1.725 ± 0.336
1.526IleLys: 1.526 ± 0.466
3.119IleLeu: 3.119 ± 0.46
0.265IleMet: 0.265 ± 0.131
1.062IleAsn: 1.062 ± 0.247
1.924IlePro: 1.924 ± 0.308
1.261IleGln: 1.261 ± 0.338
3.384IleArg: 3.384 ± 0.434
3.052IleSer: 3.052 ± 0.478
3.185IleThr: 3.185 ± 0.426
3.318IleVal: 3.318 ± 0.454
0.796IleTrp: 0.796 ± 0.236
0.863IleTyr: 0.863 ± 0.296
0.0IleXaa: 0.0 ± 0.0
Lys
3.384LysAla: 3.384 ± 0.513
0.398LysCys: 0.398 ± 0.159
1.991LysAsp: 1.991 ± 0.456
0.664LysGlu: 0.664 ± 0.195
0.863LysPhe: 0.863 ± 0.232
2.588LysGly: 2.588 ± 0.452
0.332LysHis: 0.332 ± 0.134
0.995LysIle: 0.995 ± 0.307
1.725LysLys: 1.725 ± 0.487
2.92LysLeu: 2.92 ± 0.511
0.398LysMet: 0.398 ± 0.182
0.73LysAsn: 0.73 ± 0.173
1.261LysPro: 1.261 ± 0.271
0.929LysGln: 0.929 ± 0.305
1.924LysArg: 1.924 ± 0.411
1.991LysSer: 1.991 ± 0.451
2.389LysThr: 2.389 ± 0.463
2.123LysVal: 2.123 ± 0.421
0.73LysTrp: 0.73 ± 0.211
0.398LysTyr: 0.398 ± 0.15
0.0LysXaa: 0.0 ± 0.0
Leu
12.276LeuAla: 12.276 ± 0.904
0.597LeuCys: 0.597 ± 0.234
4.579LeuAsp: 4.579 ± 0.666
4.645LeuGlu: 4.645 ± 0.683
2.256LeuPhe: 2.256 ± 0.388
6.238LeuGly: 6.238 ± 0.627
1.792LeuHis: 1.792 ± 0.405
2.986LeuIle: 2.986 ± 0.423
1.792LeuLys: 1.792 ± 0.435
7.034LeuLeu: 7.034 ± 0.849
1.659LeuMet: 1.659 ± 0.292
2.19LeuAsn: 2.19 ± 0.356
4.91LeuPro: 4.91 ± 0.742
2.322LeuGln: 2.322 ± 0.376
6.702LeuArg: 6.702 ± 0.841
4.645LeuSer: 4.645 ± 0.571
5.242LeuThr: 5.242 ± 0.606
7.167LeuVal: 7.167 ± 0.751
1.46LeuTrp: 1.46 ± 0.303
1.393LeuTyr: 1.393 ± 0.273
0.0LeuXaa: 0.0 ± 0.0
Met
3.583MetAla: 3.583 ± 0.651
0.133MetCys: 0.133 ± 0.094
1.393MetAsp: 1.393 ± 0.307
0.464MetGlu: 0.464 ± 0.173
0.464MetPhe: 0.464 ± 0.164
1.194MetGly: 1.194 ± 0.242
0.531MetHis: 0.531 ± 0.194
1.261MetIle: 1.261 ± 0.247
0.531MetLys: 0.531 ± 0.176
1.924MetLeu: 1.924 ± 0.364
0.664MetMet: 0.664 ± 0.217
0.863MetAsn: 0.863 ± 0.321
1.327MetPro: 1.327 ± 0.285
0.597MetGln: 0.597 ± 0.216
1.725MetArg: 1.725 ± 0.347
1.128MetSer: 1.128 ± 0.268
2.322MetThr: 2.322 ± 0.542
1.194MetVal: 1.194 ± 0.29
0.597MetTrp: 0.597 ± 0.235
0.066MetTyr: 0.066 ± 0.058
0.0MetXaa: 0.0 ± 0.0
Asn
3.052AsnAla: 3.052 ± 0.565
0.133AsnCys: 0.133 ± 0.103
1.526AsnAsp: 1.526 ± 0.283
0.995AsnGlu: 0.995 ± 0.252
0.929AsnPhe: 0.929 ± 0.269
3.251AsnGly: 3.251 ± 0.603
0.664AsnHis: 0.664 ± 0.207
0.73AsnIle: 0.73 ± 0.292
0.73AsnLys: 0.73 ± 0.319
2.19AsnLeu: 2.19 ± 0.382
0.464AsnMet: 0.464 ± 0.157
0.664AsnAsn: 0.664 ± 0.184
2.322AsnPro: 2.322 ± 0.41
1.128AsnGln: 1.128 ± 0.292
2.19AsnArg: 2.19 ± 0.501
1.062AsnSer: 1.062 ± 0.263
1.261AsnThr: 1.261 ± 0.281
2.123AsnVal: 2.123 ± 0.375
0.664AsnTrp: 0.664 ± 0.162
0.531AsnTyr: 0.531 ± 0.225
0.0AsnXaa: 0.0 ± 0.0
Pro
8.693ProAla: 8.693 ± 0.878
0.531ProCys: 0.531 ± 0.182
4.645ProAsp: 4.645 ± 0.545
3.251ProGlu: 3.251 ± 0.497
1.46ProPhe: 1.46 ± 0.379
5.64ProGly: 5.64 ± 0.656
0.796ProHis: 0.796 ± 0.265
2.389ProIle: 2.389 ± 0.391
1.858ProLys: 1.858 ± 0.381
4.048ProLeu: 4.048 ± 0.483
1.46ProMet: 1.46 ± 0.298
1.792ProAsn: 1.792 ± 0.392
4.38ProPro: 4.38 ± 0.921
1.991ProGln: 1.991 ± 0.356
3.915ProArg: 3.915 ± 0.618
3.451ProSer: 3.451 ± 0.576
4.18ProThr: 4.18 ± 0.632
4.18ProVal: 4.18 ± 0.665
1.46ProTrp: 1.46 ± 0.338
0.597ProTyr: 0.597 ± 0.182
0.0ProXaa: 0.0 ± 0.0
Gln
3.451GlnAla: 3.451 ± 0.641
0.531GlnCys: 0.531 ± 0.213
1.858GlnAsp: 1.858 ± 0.393
1.261GlnGlu: 1.261 ± 0.352
0.73GlnPhe: 0.73 ± 0.176
2.522GlnGly: 2.522 ± 0.52
0.929GlnHis: 0.929 ± 0.312
1.991GlnIle: 1.991 ± 0.433
1.062GlnLys: 1.062 ± 0.339
3.517GlnLeu: 3.517 ± 0.491
1.128GlnMet: 1.128 ± 0.263
0.73GlnAsn: 0.73 ± 0.211
1.593GlnPro: 1.593 ± 0.4
1.725GlnGln: 1.725 ± 0.394
2.853GlnArg: 2.853 ± 0.537
1.46GlnSer: 1.46 ± 0.275
1.858GlnThr: 1.858 ± 0.371
2.322GlnVal: 2.322 ± 0.403
1.062GlnTrp: 1.062 ± 0.289
0.73GlnTyr: 0.73 ± 0.231
0.0GlnXaa: 0.0 ± 0.0
Arg
8.825ArgAla: 8.825 ± 0.807
0.929ArgCys: 0.929 ± 0.243
5.176ArgAsp: 5.176 ± 0.484
3.318ArgGlu: 3.318 ± 0.473
1.725ArgPhe: 1.725 ± 0.334
5.176ArgGly: 5.176 ± 0.621
1.128ArgHis: 1.128 ± 0.236
3.583ArgIle: 3.583 ± 0.5
3.451ArgLys: 3.451 ± 0.523
6.304ArgLeu: 6.304 ± 0.611
2.92ArgMet: 2.92 ± 0.522
2.787ArgAsn: 2.787 ± 0.433
4.844ArgPro: 4.844 ± 0.739
2.986ArgGln: 2.986 ± 0.415
8.162ArgArg: 8.162 ± 1.034
4.38ArgSer: 4.38 ± 0.564
4.579ArgThr: 4.579 ± 0.554
4.313ArgVal: 4.313 ± 0.449
0.929ArgTrp: 0.929 ± 0.286
0.995ArgTyr: 0.995 ± 0.316
0.0ArgXaa: 0.0 ± 0.0
Ser
7.432SerAla: 7.432 ± 0.76
0.332SerCys: 0.332 ± 0.149
3.451SerAsp: 3.451 ± 0.418
3.451SerGlu: 3.451 ± 0.673
1.194SerPhe: 1.194 ± 0.288
6.636SerGly: 6.636 ± 0.8
1.261SerHis: 1.261 ± 0.306
2.787SerIle: 2.787 ± 0.402
1.194SerLys: 1.194 ± 0.26
3.981SerLeu: 3.981 ± 0.509
1.46SerMet: 1.46 ± 0.392
1.593SerAsn: 1.593 ± 0.297
3.119SerPro: 3.119 ± 0.541
1.327SerGln: 1.327 ± 0.265
3.65SerArg: 3.65 ± 0.513
4.313SerSer: 4.313 ± 0.695
3.517SerThr: 3.517 ± 0.524
4.977SerVal: 4.977 ± 0.575
0.995SerTrp: 0.995 ± 0.258
1.062SerTyr: 1.062 ± 0.242
0.0SerXaa: 0.0 ± 0.0
Thr
8.361ThrAla: 8.361 ± 0.822
0.265ThrCys: 0.265 ± 0.137
4.91ThrAsp: 4.91 ± 0.623
3.915ThrGlu: 3.915 ± 0.514
2.057ThrPhe: 2.057 ± 0.326
5.839ThrGly: 5.839 ± 0.61
1.062ThrHis: 1.062 ± 0.282
3.716ThrIle: 3.716 ± 0.559
0.995ThrLys: 0.995 ± 0.241
4.512ThrLeu: 4.512 ± 0.532
0.73ThrMet: 0.73 ± 0.252
1.393ThrAsn: 1.393 ± 0.294
4.446ThrPro: 4.446 ± 0.554
2.19ThrGln: 2.19 ± 0.388
4.38ThrArg: 4.38 ± 0.471
3.915ThrSer: 3.915 ± 0.446
5.109ThrThr: 5.109 ± 0.748
5.906ThrVal: 5.906 ± 0.721
1.393ThrTrp: 1.393 ± 0.345
1.327ThrTyr: 1.327 ± 0.394
0.0ThrXaa: 0.0 ± 0.0
Val
10.219ValAla: 10.219 ± 1.199
0.464ValCys: 0.464 ± 0.169
4.313ValAsp: 4.313 ± 0.584
3.849ValGlu: 3.849 ± 0.613
2.256ValPhe: 2.256 ± 0.36
5.574ValGly: 5.574 ± 0.578
0.863ValHis: 0.863 ± 0.317
3.716ValIle: 3.716 ± 0.483
2.389ValLys: 2.389 ± 0.454
5.375ValLeu: 5.375 ± 0.567
1.526ValMet: 1.526 ± 0.37
1.46ValAsn: 1.46 ± 0.282
4.446ValPro: 4.446 ± 0.6
2.19ValGln: 2.19 ± 0.354
5.906ValArg: 5.906 ± 0.652
4.313ValSer: 4.313 ± 0.683
5.375ValThr: 5.375 ± 0.636
4.91ValVal: 4.91 ± 0.596
1.593ValTrp: 1.593 ± 0.367
1.526ValTyr: 1.526 ± 0.341
0.0ValXaa: 0.0 ± 0.0
Trp
1.858TrpAla: 1.858 ± 0.415
0.464TrpCys: 0.464 ± 0.167
1.327TrpAsp: 1.327 ± 0.239
1.261TrpGlu: 1.261 ± 0.394
0.73TrpPhe: 0.73 ± 0.191
1.393TrpGly: 1.393 ± 0.283
0.73TrpHis: 0.73 ± 0.239
0.796TrpIle: 0.796 ± 0.213
0.664TrpLys: 0.664 ± 0.193
2.123TrpLeu: 2.123 ± 0.43
0.995TrpMet: 0.995 ± 0.23
1.062TrpAsn: 1.062 ± 0.331
0.995TrpPro: 0.995 ± 0.257
0.863TrpGln: 0.863 ± 0.211
1.261TrpArg: 1.261 ± 0.265
0.73TrpSer: 0.73 ± 0.213
1.393TrpThr: 1.393 ± 0.321
1.194TrpVal: 1.194 ± 0.249
0.597TrpTrp: 0.597 ± 0.23
0.199TrpTyr: 0.199 ± 0.114
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.924TyrAla: 1.924 ± 0.366
0.398TyrCys: 0.398 ± 0.16
1.194TyrAsp: 1.194 ± 0.34
1.327TyrGlu: 1.327 ± 0.361
0.597TyrPhe: 0.597 ± 0.189
1.46TyrGly: 1.46 ± 0.334
0.597TyrHis: 0.597 ± 0.234
0.796TyrIle: 0.796 ± 0.275
0.597TyrLys: 0.597 ± 0.177
1.593TyrLeu: 1.593 ± 0.361
0.265TyrMet: 0.265 ± 0.119
0.265TyrAsn: 0.265 ± 0.106
0.929TyrPro: 0.929 ± 0.284
0.199TyrGln: 0.199 ± 0.111
1.46TyrArg: 1.46 ± 0.301
0.995TyrSer: 0.995 ± 0.275
1.46TyrThr: 1.46 ± 0.289
1.725TyrVal: 1.725 ± 0.35
0.464TyrTrp: 0.464 ± 0.186
0.531TyrTyr: 0.531 ± 0.191
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (15071 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski