Amino acid dipepetide frequency for Streptomyces phage ClubPenguin

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.044AlaAla: 11.044 ± 2.074
0.338AlaCys: 0.338 ± 0.121
5.297AlaAsp: 5.297 ± 0.621
6.311AlaGlu: 6.311 ± 0.685
3.155AlaPhe: 3.155 ± 0.437
6.255AlaGly: 6.255 ± 1.057
1.409AlaHis: 1.409 ± 0.348
6.255AlaIle: 6.255 ± 0.992
6.367AlaLys: 6.367 ± 0.908
6.536AlaLeu: 6.536 ± 1.111
3.325AlaMet: 3.325 ± 0.369
3.606AlaAsn: 3.606 ± 0.529
2.705AlaPro: 2.705 ± 0.391
3.55AlaGln: 3.55 ± 0.508
4.508AlaArg: 4.508 ± 0.556
5.297AlaSer: 5.297 ± 0.794
5.804AlaThr: 5.804 ± 0.713
6.762AlaVal: 6.762 ± 0.538
1.409AlaTrp: 1.409 ± 0.297
3.437AlaTyr: 3.437 ± 0.389
0.0AlaXaa: 0.0 ± 0.0
Cys
0.451CysAla: 0.451 ± 0.167
0.225CysCys: 0.225 ± 0.123
0.338CysAsp: 0.338 ± 0.162
0.451CysGlu: 0.451 ± 0.166
0.282CysPhe: 0.282 ± 0.129
0.507CysGly: 0.507 ± 0.173
0.225CysHis: 0.225 ± 0.123
0.394CysIle: 0.394 ± 0.167
0.225CysLys: 0.225 ± 0.126
0.902CysLeu: 0.902 ± 0.286
0.056CysMet: 0.056 ± 0.071
0.169CysAsn: 0.169 ± 0.122
0.225CysPro: 0.225 ± 0.114
0.113CysGln: 0.113 ± 0.074
0.394CysArg: 0.394 ± 0.206
0.282CysSer: 0.282 ± 0.126
0.225CysThr: 0.225 ± 0.11
0.338CysVal: 0.338 ± 0.13
0.282CysTrp: 0.282 ± 0.123
0.169CysTyr: 0.169 ± 0.122
0.0CysXaa: 0.0 ± 0.0
Asp
6.086AspAla: 6.086 ± 0.507
0.507AspCys: 0.507 ± 0.197
3.775AspAsp: 3.775 ± 0.596
4.677AspGlu: 4.677 ± 0.709
2.592AspPhe: 2.592 ± 0.612
5.24AspGly: 5.24 ± 0.646
0.733AspHis: 0.733 ± 0.223
3.55AspIle: 3.55 ± 0.53
3.268AspLys: 3.268 ± 0.418
4.282AspLeu: 4.282 ± 0.596
2.254AspMet: 2.254 ± 0.283
2.592AspAsn: 2.592 ± 0.37
3.437AspPro: 3.437 ± 0.664
2.367AspGln: 2.367 ± 0.325
2.592AspArg: 2.592 ± 0.444
3.606AspSer: 3.606 ± 0.457
3.212AspThr: 3.212 ± 0.417
4.113AspVal: 4.113 ± 0.543
0.958AspTrp: 0.958 ± 0.291
1.747AspTyr: 1.747 ± 0.354
0.0AspXaa: 0.0 ± 0.0
Glu
5.466GluAla: 5.466 ± 0.595
0.394GluCys: 0.394 ± 0.166
3.437GluAsp: 3.437 ± 0.601
5.409GluGlu: 5.409 ± 0.817
3.437GluPhe: 3.437 ± 0.578
5.184GluGly: 5.184 ± 0.739
1.014GluHis: 1.014 ± 0.259
3.494GluIle: 3.494 ± 0.417
5.24GluLys: 5.24 ± 0.718
5.466GluLeu: 5.466 ± 0.859
1.747GluMet: 1.747 ± 0.33
1.747GluAsn: 1.747 ± 0.45
1.972GluPro: 1.972 ± 0.413
2.479GluGln: 2.479 ± 0.389
3.888GluArg: 3.888 ± 0.54
2.986GluSer: 2.986 ± 0.373
3.381GluThr: 3.381 ± 0.555
4.226GluVal: 4.226 ± 0.641
1.409GluTrp: 1.409 ± 0.316
2.029GluTyr: 2.029 ± 0.376
0.0GluXaa: 0.0 ± 0.0
Phe
3.325PheAla: 3.325 ± 0.636
0.225PheCys: 0.225 ± 0.124
3.381PheAsp: 3.381 ± 0.505
3.212PheGlu: 3.212 ± 0.541
1.127PhePhe: 1.127 ± 0.22
3.55PheGly: 3.55 ± 0.466
0.507PheHis: 0.507 ± 0.177
2.198PheIle: 2.198 ± 0.352
1.972PheLys: 1.972 ± 0.409
3.212PheLeu: 3.212 ± 0.402
0.902PheMet: 0.902 ± 0.218
0.845PheAsn: 0.845 ± 0.196
1.014PhePro: 1.014 ± 0.207
1.578PheGln: 1.578 ± 0.339
1.916PheArg: 1.916 ± 0.331
2.423PheSer: 2.423 ± 0.389
2.423PheThr: 2.423 ± 0.405
2.479PheVal: 2.479 ± 0.377
0.451PheTrp: 0.451 ± 0.229
1.465PheTyr: 1.465 ± 0.416
0.0PheXaa: 0.0 ± 0.0
Gly
5.466GlyAla: 5.466 ± 0.6
0.338GlyCys: 0.338 ± 0.138
4.733GlyAsp: 4.733 ± 0.462
4.057GlyGlu: 4.057 ± 0.621
3.212GlyPhe: 3.212 ± 0.424
5.578GlyGly: 5.578 ± 0.755
1.014GlyHis: 1.014 ± 0.288
4.677GlyIle: 4.677 ± 0.778
4.79GlyLys: 4.79 ± 0.547
5.804GlyLeu: 5.804 ± 1.19
2.31GlyMet: 2.31 ± 0.416
2.479GlyAsn: 2.479 ± 0.456
2.592GlyPro: 2.592 ± 0.35
1.803GlyGln: 1.803 ± 0.304
3.888GlyArg: 3.888 ± 0.457
5.071GlySer: 5.071 ± 0.818
6.48GlyThr: 6.48 ± 0.682
7.043GlyVal: 7.043 ± 1.112
1.916GlyTrp: 1.916 ± 0.392
2.536GlyTyr: 2.536 ± 0.578
0.0GlyXaa: 0.0 ± 0.0
His
1.127HisAla: 1.127 ± 0.325
0.225HisCys: 0.225 ± 0.115
1.071HisAsp: 1.071 ± 0.236
1.071HisGlu: 1.071 ± 0.241
0.62HisPhe: 0.62 ± 0.228
1.24HisGly: 1.24 ± 0.291
0.62HisHis: 0.62 ± 0.215
1.409HisIle: 1.409 ± 0.349
0.789HisLys: 0.789 ± 0.191
1.521HisLeu: 1.521 ± 0.363
0.394HisMet: 0.394 ± 0.142
0.563HisAsn: 0.563 ± 0.176
0.902HisPro: 0.902 ± 0.297
1.071HisGln: 1.071 ± 0.293
0.62HisArg: 0.62 ± 0.223
1.352HisSer: 1.352 ± 0.378
0.676HisThr: 0.676 ± 0.236
1.521HisVal: 1.521 ± 0.403
0.507HisTrp: 0.507 ± 0.178
1.014HisTyr: 1.014 ± 0.189
0.0HisXaa: 0.0 ± 0.0
Ile
4.564IleAla: 4.564 ± 0.884
0.338IleCys: 0.338 ± 0.145
4.282IleAsp: 4.282 ± 0.506
4.62IleGlu: 4.62 ± 0.787
2.31IlePhe: 2.31 ± 0.319
4.001IleGly: 4.001 ± 0.637
1.24IleHis: 1.24 ± 0.309
2.31IleIle: 2.31 ± 0.38
3.043IleLys: 3.043 ± 0.577
4.339IleLeu: 4.339 ± 0.532
0.902IleMet: 0.902 ± 0.23
2.085IleAsn: 2.085 ± 0.316
2.029IlePro: 2.029 ± 0.285
2.029IleGln: 2.029 ± 0.349
2.874IleArg: 2.874 ± 0.427
3.719IleSer: 3.719 ± 0.701
3.888IleThr: 3.888 ± 0.54
3.944IleVal: 3.944 ± 0.383
0.563IleTrp: 0.563 ± 0.183
1.69IleTyr: 1.69 ± 0.269
0.0IleXaa: 0.0 ± 0.0
Lys
6.874LysAla: 6.874 ± 0.667
0.338LysCys: 0.338 ± 0.147
3.325LysAsp: 3.325 ± 0.589
3.494LysGlu: 3.494 ± 0.532
2.592LysPhe: 2.592 ± 0.448
3.944LysGly: 3.944 ± 0.605
0.958LysHis: 0.958 ± 0.246
2.986LysIle: 2.986 ± 0.337
5.747LysLys: 5.747 ± 0.694
6.086LysLeu: 6.086 ± 0.658
2.085LysMet: 2.085 ± 0.421
2.705LysAsn: 2.705 ± 0.435
2.198LysPro: 2.198 ± 0.362
2.592LysGln: 2.592 ± 0.304
3.888LysArg: 3.888 ± 0.49
4.001LysSer: 4.001 ± 0.636
4.395LysThr: 4.395 ± 0.479
4.395LysVal: 4.395 ± 0.515
0.563LysTrp: 0.563 ± 0.219
1.972LysTyr: 1.972 ± 0.483
0.0LysXaa: 0.0 ± 0.0
Leu
8.17LeuAla: 8.17 ± 1.39
0.394LeuCys: 0.394 ± 0.158
5.466LeuAsp: 5.466 ± 0.556
5.409LeuGlu: 5.409 ± 0.544
2.367LeuPhe: 2.367 ± 0.412
5.466LeuGly: 5.466 ± 0.698
1.352LeuHis: 1.352 ± 0.331
4.564LeuIle: 4.564 ± 0.758
5.015LeuLys: 5.015 ± 0.604
5.522LeuLeu: 5.522 ± 1.164
2.029LeuMet: 2.029 ± 0.383
3.268LeuAsn: 3.268 ± 0.507
2.254LeuPro: 2.254 ± 0.324
2.31LeuGln: 2.31 ± 0.427
4.113LeuArg: 4.113 ± 0.639
6.198LeuSer: 6.198 ± 0.962
6.536LeuThr: 6.536 ± 0.766
6.818LeuVal: 6.818 ± 0.69
0.62LeuTrp: 0.62 ± 0.206
2.141LeuTyr: 2.141 ± 0.398
0.0LeuXaa: 0.0 ± 0.0
Met
1.972MetAla: 1.972 ± 0.379
0.225MetCys: 0.225 ± 0.121
1.578MetAsp: 1.578 ± 0.33
1.409MetGlu: 1.409 ± 0.331
0.563MetPhe: 0.563 ± 0.193
1.578MetGly: 1.578 ± 0.301
0.676MetHis: 0.676 ± 0.188
1.24MetIle: 1.24 ± 0.299
2.141MetLys: 2.141 ± 0.362
2.029MetLeu: 2.029 ± 0.365
0.282MetMet: 0.282 ± 0.141
1.183MetAsn: 1.183 ± 0.249
1.24MetPro: 1.24 ± 0.212
0.789MetGln: 0.789 ± 0.268
1.69MetArg: 1.69 ± 0.293
2.817MetSer: 2.817 ± 0.47
1.747MetThr: 1.747 ± 0.369
1.803MetVal: 1.803 ± 0.331
0.282MetTrp: 0.282 ± 0.137
0.733MetTyr: 0.733 ± 0.197
0.0MetXaa: 0.0 ± 0.0
Asn
3.437AsnAla: 3.437 ± 0.457
0.113AsnCys: 0.113 ± 0.09
2.817AsnAsp: 2.817 ± 0.405
2.141AsnGlu: 2.141 ± 0.401
1.352AsnPhe: 1.352 ± 0.233
4.057AsnGly: 4.057 ± 0.417
1.127AsnHis: 1.127 ± 0.286
1.409AsnIle: 1.409 ± 0.285
1.916AsnLys: 1.916 ± 0.256
2.93AsnLeu: 2.93 ± 0.471
0.676AsnMet: 0.676 ± 0.208
1.69AsnAsn: 1.69 ± 0.294
1.972AsnPro: 1.972 ± 0.354
1.521AsnGln: 1.521 ± 0.316
2.592AsnArg: 2.592 ± 0.475
3.043AsnSer: 3.043 ± 0.466
2.198AsnThr: 2.198 ± 0.338
3.043AsnVal: 3.043 ± 0.497
0.507AsnTrp: 0.507 ± 0.198
1.352AsnTyr: 1.352 ± 0.287
0.0AsnXaa: 0.0 ± 0.0
Pro
3.381ProAla: 3.381 ± 0.606
0.282ProCys: 0.282 ± 0.146
2.029ProAsp: 2.029 ± 0.418
2.817ProGlu: 2.817 ± 0.576
1.296ProPhe: 1.296 ± 0.231
3.268ProGly: 3.268 ± 0.466
0.62ProHis: 0.62 ± 0.201
1.803ProIle: 1.803 ± 0.317
2.423ProLys: 2.423 ± 0.385
2.817ProLeu: 2.817 ± 0.35
0.958ProMet: 0.958 ± 0.182
1.747ProAsn: 1.747 ± 0.336
1.69ProPro: 1.69 ± 0.411
1.352ProGln: 1.352 ± 0.293
1.747ProArg: 1.747 ± 0.377
2.536ProSer: 2.536 ± 0.348
3.944ProThr: 3.944 ± 0.549
2.536ProVal: 2.536 ± 0.36
0.338ProTrp: 0.338 ± 0.168
1.127ProTyr: 1.127 ± 0.235
0.0ProXaa: 0.0 ± 0.0
Gln
3.775GlnAla: 3.775 ± 0.614
0.056GlnCys: 0.056 ± 0.068
1.634GlnAsp: 1.634 ± 0.345
1.803GlnGlu: 1.803 ± 0.289
1.521GlnPhe: 1.521 ± 0.349
2.085GlnGly: 2.085 ± 0.303
0.733GlnHis: 0.733 ± 0.276
2.085GlnIle: 2.085 ± 0.297
1.972GlnLys: 1.972 ± 0.327
3.606GlnLeu: 3.606 ± 0.625
1.578GlnMet: 1.578 ± 0.348
1.69GlnAsn: 1.69 ± 0.366
1.296GlnPro: 1.296 ± 0.364
2.029GlnGln: 2.029 ± 0.334
1.916GlnArg: 1.916 ± 0.391
1.803GlnSer: 1.803 ± 0.361
2.029GlnThr: 2.029 ± 0.45
1.859GlnVal: 1.859 ± 0.368
0.113GlnTrp: 0.113 ± 0.079
1.409GlnTyr: 1.409 ± 0.202
0.0GlnXaa: 0.0 ± 0.0
Arg
4.226ArgAla: 4.226 ± 0.448
0.282ArgCys: 0.282 ± 0.131
2.93ArgAsp: 2.93 ± 0.373
3.212ArgGlu: 3.212 ± 0.48
1.352ArgPhe: 1.352 ± 0.262
3.212ArgGly: 3.212 ± 0.493
1.014ArgHis: 1.014 ± 0.255
2.648ArgIle: 2.648 ± 0.416
3.888ArgLys: 3.888 ± 0.555
4.226ArgLeu: 4.226 ± 0.608
1.578ArgMet: 1.578 ± 0.265
2.874ArgAsn: 2.874 ± 0.461
2.198ArgPro: 2.198 ± 0.48
2.141ArgGln: 2.141 ± 0.451
2.592ArgArg: 2.592 ± 0.405
2.479ArgSer: 2.479 ± 0.361
3.775ArgThr: 3.775 ± 0.61
3.437ArgVal: 3.437 ± 0.478
0.62ArgTrp: 0.62 ± 0.164
1.69ArgTyr: 1.69 ± 0.416
0.0ArgXaa: 0.0 ± 0.0
Ser
6.311SerAla: 6.311 ± 0.886
0.451SerCys: 0.451 ± 0.172
2.761SerAsp: 2.761 ± 0.524
2.874SerGlu: 2.874 ± 0.386
2.592SerPhe: 2.592 ± 0.502
6.311SerGly: 6.311 ± 0.848
1.071SerHis: 1.071 ± 0.252
3.494SerIle: 3.494 ± 0.519
3.888SerLys: 3.888 ± 0.412
5.071SerLeu: 5.071 ± 0.676
1.465SerMet: 1.465 ± 0.226
2.141SerAsn: 2.141 ± 0.377
3.043SerPro: 3.043 ± 0.367
1.859SerGln: 1.859 ± 0.336
2.592SerArg: 2.592 ± 0.369
4.226SerSer: 4.226 ± 0.627
4.508SerThr: 4.508 ± 0.7
4.395SerVal: 4.395 ± 0.506
1.296SerTrp: 1.296 ± 0.292
1.859SerTyr: 1.859 ± 0.305
0.0SerXaa: 0.0 ± 0.0
Thr
6.48ThrAla: 6.48 ± 1.028
0.338ThrCys: 0.338 ± 0.201
4.733ThrAsp: 4.733 ± 0.782
2.93ThrGlu: 2.93 ± 0.33
2.986ThrPhe: 2.986 ± 0.361
5.804ThrGly: 5.804 ± 0.638
1.465ThrHis: 1.465 ± 0.299
3.888ThrIle: 3.888 ± 0.473
4.282ThrLys: 4.282 ± 0.466
5.747ThrLeu: 5.747 ± 0.634
1.24ThrMet: 1.24 ± 0.281
2.367ThrAsn: 2.367 ± 0.321
3.719ThrPro: 3.719 ± 0.581
1.972ThrGln: 1.972 ± 0.444
2.536ThrArg: 2.536 ± 0.441
4.001ThrSer: 4.001 ± 0.646
6.029ThrThr: 6.029 ± 0.905
5.353ThrVal: 5.353 ± 0.709
0.902ThrTrp: 0.902 ± 0.292
2.705ThrTyr: 2.705 ± 0.543
0.0ThrXaa: 0.0 ± 0.0
Val
6.762ValAla: 6.762 ± 0.498
0.845ValCys: 0.845 ± 0.25
3.832ValAsp: 3.832 ± 0.557
4.959ValGlu: 4.959 ± 0.76
2.536ValPhe: 2.536 ± 0.429
4.846ValGly: 4.846 ± 0.534
1.465ValHis: 1.465 ± 0.27
4.451ValIle: 4.451 ± 0.522
5.128ValLys: 5.128 ± 0.517
6.536ValLeu: 6.536 ± 0.666
1.296ValMet: 1.296 ± 0.248
3.494ValAsn: 3.494 ± 0.32
2.592ValPro: 2.592 ± 0.433
2.029ValGln: 2.029 ± 0.337
3.663ValArg: 3.663 ± 0.44
3.944ValSer: 3.944 ± 0.466
4.846ValThr: 4.846 ± 0.599
5.691ValVal: 5.691 ± 0.71
0.958ValTrp: 0.958 ± 0.258
2.93ValTyr: 2.93 ± 0.66
0.0ValXaa: 0.0 ± 0.0
Trp
1.183TrpAla: 1.183 ± 0.297
0.113TrpCys: 0.113 ± 0.078
1.465TrpAsp: 1.465 ± 0.318
1.014TrpGlu: 1.014 ± 0.258
0.845TrpPhe: 0.845 ± 0.255
1.014TrpGly: 1.014 ± 0.247
0.451TrpHis: 0.451 ± 0.158
0.338TrpIle: 0.338 ± 0.144
0.958TrpLys: 0.958 ± 0.298
1.127TrpLeu: 1.127 ± 0.258
0.394TrpMet: 0.394 ± 0.23
0.733TrpAsn: 0.733 ± 0.271
0.282TrpPro: 0.282 ± 0.147
0.282TrpGln: 0.282 ± 0.128
0.563TrpArg: 0.563 ± 0.173
0.902TrpSer: 0.902 ± 0.27
1.24TrpThr: 1.24 ± 0.424
0.845TrpVal: 0.845 ± 0.23
0.113TrpTrp: 0.113 ± 0.085
0.338TrpTyr: 0.338 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.268TyrAla: 3.268 ± 0.575
0.225TyrCys: 0.225 ± 0.133
2.705TyrAsp: 2.705 ± 0.403
2.592TyrGlu: 2.592 ± 0.467
1.465TyrPhe: 1.465 ± 0.328
2.648TyrGly: 2.648 ± 0.44
0.62TyrHis: 0.62 ± 0.177
1.578TyrIle: 1.578 ± 0.345
2.085TyrLys: 2.085 ± 0.434
2.254TyrLeu: 2.254 ± 0.304
0.451TyrMet: 0.451 ± 0.213
1.916TyrAsn: 1.916 ± 0.368
1.352TyrPro: 1.352 ± 0.278
1.127TyrGln: 1.127 ± 0.218
1.803TyrArg: 1.803 ± 0.35
1.521TyrSer: 1.521 ± 0.312
2.141TyrThr: 2.141 ± 0.414
2.198TyrVal: 2.198 ± 0.412
0.394TyrTrp: 0.394 ± 0.188
1.183TyrTyr: 1.183 ± 0.287
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 91 proteins (17748 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski