Amino acid dipepetide frequency for Vibrio phage R01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.654AlaAla: 7.654 ± 0.897
0.836AlaCys: 0.836 ± 0.197
5.411AlaAsp: 5.411 ± 0.445
5.235AlaGlu: 5.235 ± 0.579
3.387AlaPhe: 3.387 ± 0.408
5.455AlaGly: 5.455 ± 0.505
1.76AlaHis: 1.76 ± 0.27
4.663AlaIle: 4.663 ± 0.499
4.311AlaLys: 4.311 ± 0.429
7.346AlaLeu: 7.346 ± 0.587
2.244AlaMet: 2.244 ± 0.392
2.859AlaAsn: 2.859 ± 0.366
2.112AlaPro: 2.112 ± 0.317
2.639AlaGln: 2.639 ± 0.371
4.179AlaArg: 4.179 ± 0.403
4.003AlaSer: 4.003 ± 0.423
5.323AlaThr: 5.323 ± 0.532
5.851AlaVal: 5.851 ± 0.676
0.88AlaTrp: 0.88 ± 0.184
2.288AlaTyr: 2.288 ± 0.317
0.0AlaXaa: 0.0 ± 0.0
Cys
1.276CysAla: 1.276 ± 0.246
0.44CysCys: 0.44 ± 0.136
1.1CysAsp: 1.1 ± 0.23
1.056CysGlu: 1.056 ± 0.18
0.484CysPhe: 0.484 ± 0.159
0.748CysGly: 0.748 ± 0.186
0.308CysHis: 0.308 ± 0.1
0.616CysIle: 0.616 ± 0.15
0.572CysLys: 0.572 ± 0.164
1.364CysLeu: 1.364 ± 0.257
0.396CysMet: 0.396 ± 0.13
0.352CysAsn: 0.352 ± 0.117
0.484CysPro: 0.484 ± 0.139
0.264CysGln: 0.264 ± 0.113
0.792CysArg: 0.792 ± 0.203
0.484CysSer: 0.484 ± 0.145
0.308CysThr: 0.308 ± 0.109
0.748CysVal: 0.748 ± 0.206
0.396CysTrp: 0.396 ± 0.137
0.396CysTyr: 0.396 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
4.971AspAla: 4.971 ± 0.446
0.528AspCys: 0.528 ± 0.141
4.795AspAsp: 4.795 ± 0.64
5.059AspGlu: 5.059 ± 0.524
3.431AspPhe: 3.431 ± 0.325
4.883AspGly: 4.883 ± 0.495
1.364AspHis: 1.364 ± 0.221
3.387AspIle: 3.387 ± 0.392
3.915AspLys: 3.915 ± 0.523
6.379AspLeu: 6.379 ± 0.491
1.892AspMet: 1.892 ± 0.307
2.551AspAsn: 2.551 ± 0.311
2.947AspPro: 2.947 ± 0.43
2.332AspGln: 2.332 ± 0.373
3.475AspArg: 3.475 ± 0.414
3.255AspSer: 3.255 ± 0.421
3.299AspThr: 3.299 ± 0.424
3.739AspVal: 3.739 ± 0.443
1.452AspTrp: 1.452 ± 0.239
2.376AspTyr: 2.376 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
5.763GluAla: 5.763 ± 0.55
0.924GluCys: 0.924 ± 0.212
5.235GluAsp: 5.235 ± 0.526
5.763GluGlu: 5.763 ± 0.785
2.727GluPhe: 2.727 ± 0.308
4.971GluGly: 4.971 ± 0.467
1.496GluHis: 1.496 ± 0.293
3.915GluIle: 3.915 ± 0.427
3.607GluLys: 3.607 ± 0.427
7.434GluLeu: 7.434 ± 0.592
2.288GluMet: 2.288 ± 0.284
3.035GluAsn: 3.035 ± 0.348
2.112GluPro: 2.112 ± 0.315
2.815GluGln: 2.815 ± 0.356
3.475GluArg: 3.475 ± 0.39
4.091GluSer: 4.091 ± 0.435
3.475GluThr: 3.475 ± 0.323
4.795GluVal: 4.795 ± 0.526
0.836GluTrp: 0.836 ± 0.199
2.771GluTyr: 2.771 ± 0.431
0.0GluXaa: 0.0 ± 0.0
Phe
3.079PheAla: 3.079 ± 0.419
0.616PheCys: 0.616 ± 0.191
2.639PheAsp: 2.639 ± 0.299
3.123PheGlu: 3.123 ± 0.336
1.76PhePhe: 1.76 ± 0.3
2.947PheGly: 2.947 ± 0.373
0.924PheHis: 0.924 ± 0.189
1.936PheIle: 1.936 ± 0.304
2.156PheLys: 2.156 ± 0.367
2.903PheLeu: 2.903 ± 0.365
0.88PheMet: 0.88 ± 0.205
1.804PheAsn: 1.804 ± 0.275
1.848PhePro: 1.848 ± 0.274
1.144PheGln: 1.144 ± 0.217
2.376PheArg: 2.376 ± 0.364
2.991PheSer: 2.991 ± 0.39
3.035PheThr: 3.035 ± 0.355
2.419PheVal: 2.419 ± 0.305
0.88PheTrp: 0.88 ± 0.244
1.188PheTyr: 1.188 ± 0.321
0.0PheXaa: 0.0 ± 0.0
Gly
5.103GlyAla: 5.103 ± 0.723
0.924GlyCys: 0.924 ± 0.188
5.059GlyAsp: 5.059 ± 0.573
5.279GlyGlu: 5.279 ± 0.488
3.211GlyPhe: 3.211 ± 0.398
5.587GlyGly: 5.587 ± 0.575
1.452GlyHis: 1.452 ± 0.272
3.739GlyIle: 3.739 ± 0.317
5.895GlyLys: 5.895 ± 0.55
4.971GlyLeu: 4.971 ± 0.597
2.683GlyMet: 2.683 ± 0.432
2.463GlyAsn: 2.463 ± 0.281
1.232GlyPro: 1.232 ± 0.209
2.463GlyGln: 2.463 ± 0.32
3.079GlyArg: 3.079 ± 0.415
4.003GlySer: 4.003 ± 0.489
4.223GlyThr: 4.223 ± 0.496
4.751GlyVal: 4.751 ± 0.42
1.056GlyTrp: 1.056 ± 0.229
2.859GlyTyr: 2.859 ± 0.34
0.0GlyXaa: 0.0 ± 0.0
His
1.408HisAla: 1.408 ± 0.242
0.484HisCys: 0.484 ± 0.168
1.364HisAsp: 1.364 ± 0.299
1.1HisGlu: 1.1 ± 0.234
1.276HisPhe: 1.276 ± 0.291
1.32HisGly: 1.32 ± 0.231
0.66HisHis: 0.66 ± 0.207
1.144HisIle: 1.144 ± 0.242
0.88HisLys: 0.88 ± 0.191
2.156HisLeu: 2.156 ± 0.286
0.44HisMet: 0.44 ± 0.156
0.616HisAsn: 0.616 ± 0.139
1.144HisPro: 1.144 ± 0.235
0.836HisGln: 0.836 ± 0.196
1.408HisArg: 1.408 ± 0.252
1.232HisSer: 1.232 ± 0.213
1.056HisThr: 1.056 ± 0.222
1.232HisVal: 1.232 ± 0.198
0.616HisTrp: 0.616 ± 0.146
0.792HisTyr: 0.792 ± 0.207
0.0HisXaa: 0.0 ± 0.0
Ile
3.871IleAla: 3.871 ± 0.486
0.572IleCys: 0.572 ± 0.142
3.827IleAsp: 3.827 ± 0.474
4.707IleGlu: 4.707 ± 0.445
1.584IlePhe: 1.584 ± 0.28
3.167IleGly: 3.167 ± 0.436
1.32IleHis: 1.32 ± 0.238
2.727IleIle: 2.727 ± 0.296
3.079IleLys: 3.079 ± 0.415
4.575IleLeu: 4.575 ± 0.429
1.188IleMet: 1.188 ± 0.271
2.244IleAsn: 2.244 ± 0.313
2.288IlePro: 2.288 ± 0.288
2.2IleGln: 2.2 ± 0.284
2.991IleArg: 2.991 ± 0.351
3.123IleSer: 3.123 ± 0.383
4.003IleThr: 4.003 ± 0.445
3.079IleVal: 3.079 ± 0.414
0.66IleTrp: 0.66 ± 0.17
1.848IleTyr: 1.848 ± 0.286
0.0IleXaa: 0.0 ± 0.0
Lys
5.103LysAla: 5.103 ± 0.567
0.748LysCys: 0.748 ± 0.186
3.343LysAsp: 3.343 ± 0.326
4.223LysGlu: 4.223 ± 0.429
2.068LysPhe: 2.068 ± 0.282
4.619LysGly: 4.619 ± 0.392
1.804LysHis: 1.804 ± 0.239
3.255LysIle: 3.255 ± 0.406
5.191LysLys: 5.191 ± 0.597
5.147LysLeu: 5.147 ± 0.466
1.936LysMet: 1.936 ± 0.281
2.991LysAsn: 2.991 ± 0.295
3.783LysPro: 3.783 ± 0.452
3.079LysGln: 3.079 ± 0.358
3.695LysArg: 3.695 ± 0.464
2.991LysSer: 2.991 ± 0.342
3.519LysThr: 3.519 ± 0.367
4.091LysVal: 4.091 ± 0.381
1.144LysTrp: 1.144 ± 0.269
2.332LysTyr: 2.332 ± 0.301
0.0LysXaa: 0.0 ± 0.0
Leu
7.083LeuAla: 7.083 ± 0.549
0.968LeuCys: 0.968 ± 0.214
5.851LeuAsp: 5.851 ± 0.487
5.455LeuGlu: 5.455 ± 0.545
3.035LeuPhe: 3.035 ± 0.325
4.663LeuGly: 4.663 ± 0.503
1.672LeuHis: 1.672 ± 0.256
4.927LeuIle: 4.927 ± 0.382
5.411LeuLys: 5.411 ± 0.601
7.083LeuLeu: 7.083 ± 0.686
1.672LeuMet: 1.672 ± 0.215
4.047LeuAsn: 4.047 ± 0.506
3.211LeuPro: 3.211 ± 0.416
2.859LeuGln: 2.859 ± 0.365
4.795LeuArg: 4.795 ± 0.526
5.323LeuSer: 5.323 ± 0.497
6.511LeuThr: 6.511 ± 0.55
6.599LeuVal: 6.599 ± 0.604
0.968LeuTrp: 0.968 ± 0.181
2.595LeuTyr: 2.595 ± 0.342
0.0LeuXaa: 0.0 ± 0.0
Met
3.255MetAla: 3.255 ± 0.421
0.528MetCys: 0.528 ± 0.152
1.54MetAsp: 1.54 ± 0.246
2.112MetGlu: 2.112 ± 0.298
1.232MetPhe: 1.232 ± 0.221
1.848MetGly: 1.848 ± 0.289
0.484MetHis: 0.484 ± 0.15
0.88MetIle: 0.88 ± 0.157
1.276MetLys: 1.276 ± 0.21
1.892MetLeu: 1.892 ± 0.268
0.528MetMet: 0.528 ± 0.215
0.924MetAsn: 0.924 ± 0.214
1.364MetPro: 1.364 ± 0.247
1.056MetGln: 1.056 ± 0.171
1.672MetArg: 1.672 ± 0.289
2.376MetSer: 2.376 ± 0.353
0.968MetThr: 0.968 ± 0.206
1.76MetVal: 1.76 ± 0.259
0.132MetTrp: 0.132 ± 0.073
0.836MetTyr: 0.836 ± 0.186
0.0MetXaa: 0.0 ± 0.0
Asn
3.167AsnAla: 3.167 ± 0.352
0.528AsnCys: 0.528 ± 0.121
2.815AsnAsp: 2.815 ± 0.344
1.848AsnGlu: 1.848 ± 0.278
2.024AsnPhe: 2.024 ± 0.242
3.167AsnGly: 3.167 ± 0.352
1.012AsnHis: 1.012 ± 0.239
1.804AsnIle: 1.804 ± 0.302
2.771AsnLys: 2.771 ± 0.289
3.431AsnLeu: 3.431 ± 0.448
1.584AsnMet: 1.584 ± 0.255
2.551AsnAsn: 2.551 ± 0.47
2.332AsnPro: 2.332 ± 0.277
1.584AsnGln: 1.584 ± 0.235
2.332AsnArg: 2.332 ± 0.381
2.244AsnSer: 2.244 ± 0.274
2.551AsnThr: 2.551 ± 0.318
2.859AsnVal: 2.859 ± 0.306
0.66AsnTrp: 0.66 ± 0.167
1.584AsnTyr: 1.584 ± 0.25
0.0AsnXaa: 0.0 ± 0.0
Pro
2.991ProAla: 2.991 ± 0.389
0.528ProCys: 0.528 ± 0.14
2.639ProAsp: 2.639 ± 0.344
2.903ProGlu: 2.903 ± 0.343
1.848ProPhe: 1.848 ± 0.268
2.288ProGly: 2.288 ± 0.289
0.792ProHis: 0.792 ± 0.194
2.024ProIle: 2.024 ± 0.309
2.815ProLys: 2.815 ± 0.357
2.771ProLeu: 2.771 ± 0.415
0.792ProMet: 0.792 ± 0.19
1.76ProAsn: 1.76 ± 0.32
1.54ProPro: 1.54 ± 0.263
1.188ProGln: 1.188 ± 0.214
2.112ProArg: 2.112 ± 0.315
2.244ProSer: 2.244 ± 0.377
2.903ProThr: 2.903 ± 0.37
3.607ProVal: 3.607 ± 0.442
0.572ProTrp: 0.572 ± 0.153
1.188ProTyr: 1.188 ± 0.204
0.0ProXaa: 0.0 ± 0.0
Gln
3.035GlnAla: 3.035 ± 0.342
0.352GlnCys: 0.352 ± 0.141
2.068GlnAsp: 2.068 ± 0.256
2.507GlnGlu: 2.507 ± 0.323
1.892GlnPhe: 1.892 ± 0.289
2.068GlnGly: 2.068 ± 0.325
0.88GlnHis: 0.88 ± 0.196
1.892GlnIle: 1.892 ± 0.296
2.156GlnLys: 2.156 ± 0.318
3.651GlnLeu: 3.651 ± 0.38
0.88GlnMet: 0.88 ± 0.173
1.54GlnAsn: 1.54 ± 0.25
1.012GlnPro: 1.012 ± 0.253
1.584GlnGln: 1.584 ± 0.312
2.507GlnArg: 2.507 ± 0.431
2.112GlnSer: 2.112 ± 0.285
1.716GlnThr: 1.716 ± 0.268
2.507GlnVal: 2.507 ± 0.332
0.66GlnTrp: 0.66 ± 0.175
1.012GlnTyr: 1.012 ± 0.205
0.0GlnXaa: 0.0 ± 0.0
Arg
3.959ArgAla: 3.959 ± 0.407
0.88ArgCys: 0.88 ± 0.244
3.431ArgAsp: 3.431 ± 0.454
4.619ArgGlu: 4.619 ± 0.456
1.98ArgPhe: 1.98 ± 0.294
4.531ArgGly: 4.531 ± 0.665
0.88ArgHis: 0.88 ± 0.194
3.079ArgIle: 3.079 ± 0.405
3.651ArgLys: 3.651 ± 0.396
4.047ArgLeu: 4.047 ± 0.376
1.584ArgMet: 1.584 ± 0.252
2.112ArgAsn: 2.112 ± 0.369
2.156ArgPro: 2.156 ± 0.245
1.672ArgGln: 1.672 ± 0.273
3.739ArgArg: 3.739 ± 0.533
3.475ArgSer: 3.475 ± 0.418
3.079ArgThr: 3.079 ± 0.383
3.959ArgVal: 3.959 ± 0.47
0.792ArgTrp: 0.792 ± 0.198
2.068ArgTyr: 2.068 ± 0.361
0.0ArgXaa: 0.0 ± 0.0
Ser
3.343SerAla: 3.343 ± 0.32
0.484SerCys: 0.484 ± 0.133
3.167SerAsp: 3.167 ± 0.361
3.607SerGlu: 3.607 ± 0.345
2.288SerPhe: 2.288 ± 0.389
4.795SerGly: 4.795 ± 0.398
1.32SerHis: 1.32 ± 0.255
3.475SerIle: 3.475 ± 0.44
4.135SerLys: 4.135 ± 0.443
4.443SerLeu: 4.443 ± 0.512
1.32SerMet: 1.32 ± 0.219
2.288SerAsn: 2.288 ± 0.267
2.815SerPro: 2.815 ± 0.369
2.288SerGln: 2.288 ± 0.368
3.695SerArg: 3.695 ± 0.619
3.475SerSer: 3.475 ± 0.466
4.135SerThr: 4.135 ± 0.46
3.387SerVal: 3.387 ± 0.372
0.616SerTrp: 0.616 ± 0.129
2.156SerTyr: 2.156 ± 0.302
0.0SerXaa: 0.0 ± 0.0
Thr
4.531ThrAla: 4.531 ± 0.558
0.616ThrCys: 0.616 ± 0.176
3.387ThrAsp: 3.387 ± 0.384
3.211ThrGlu: 3.211 ± 0.386
2.507ThrPhe: 2.507 ± 0.362
4.531ThrGly: 4.531 ± 0.439
0.88ThrHis: 0.88 ± 0.178
3.167ThrIle: 3.167 ± 0.397
5.587ThrLys: 5.587 ± 0.509
5.983ThrLeu: 5.983 ± 0.48
1.144ThrMet: 1.144 ± 0.202
3.035ThrAsn: 3.035 ± 0.38
3.167ThrPro: 3.167 ± 0.518
1.892ThrGln: 1.892 ± 0.249
2.683ThrArg: 2.683 ± 0.364
3.079ThrSer: 3.079 ± 0.495
2.991ThrThr: 2.991 ± 0.396
4.531ThrVal: 4.531 ± 0.515
1.056ThrTrp: 1.056 ± 0.198
2.156ThrTyr: 2.156 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
5.147ValAla: 5.147 ± 0.433
0.924ValCys: 0.924 ± 0.234
4.135ValAsp: 4.135 ± 0.394
5.235ValGlu: 5.235 ± 0.524
2.288ValPhe: 2.288 ± 0.316
4.575ValGly: 4.575 ± 0.452
1.012ValHis: 1.012 ± 0.193
3.651ValIle: 3.651 ± 0.395
4.707ValLys: 4.707 ± 0.353
5.015ValLeu: 5.015 ± 0.493
2.112ValMet: 2.112 ± 0.281
3.651ValAsn: 3.651 ± 0.445
2.551ValPro: 2.551 ± 0.319
2.332ValGln: 2.332 ± 0.304
3.563ValArg: 3.563 ± 0.346
4.135ValSer: 4.135 ± 0.557
4.355ValThr: 4.355 ± 0.572
5.015ValVal: 5.015 ± 0.528
1.188ValTrp: 1.188 ± 0.23
2.639ValTyr: 2.639 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
1.144TrpAla: 1.144 ± 0.224
0.22TrpCys: 0.22 ± 0.094
1.364TrpAsp: 1.364 ± 0.218
1.54TrpGlu: 1.54 ± 0.27
0.572TrpPhe: 0.572 ± 0.18
0.748TrpGly: 0.748 ± 0.198
0.264TrpHis: 0.264 ± 0.119
0.968TrpIle: 0.968 ± 0.212
1.012TrpLys: 1.012 ± 0.216
1.54TrpLeu: 1.54 ± 0.31
0.176TrpMet: 0.176 ± 0.091
0.528TrpAsn: 0.528 ± 0.154
0.0TrpPro: 0.0 ± 0.0
0.616TrpGln: 0.616 ± 0.148
0.924TrpArg: 0.924 ± 0.193
0.792TrpSer: 0.792 ± 0.199
0.968TrpThr: 0.968 ± 0.173
1.276TrpVal: 1.276 ± 0.266
0.176TrpTrp: 0.176 ± 0.084
0.44TrpTyr: 0.44 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.551TyrAla: 2.551 ± 0.339
0.484TyrCys: 0.484 ± 0.128
2.859TyrAsp: 2.859 ± 0.298
2.947TyrGlu: 2.947 ± 0.38
1.056TyrPhe: 1.056 ± 0.184
3.123TyrGly: 3.123 ± 0.302
0.88TyrHis: 0.88 ± 0.201
1.804TyrIle: 1.804 ± 0.263
1.936TyrLys: 1.936 ± 0.303
2.595TyrLeu: 2.595 ± 0.338
0.88TyrMet: 0.88 ± 0.175
1.496TyrAsn: 1.496 ± 0.301
1.408TyrPro: 1.408 ± 0.259
1.188TyrGln: 1.188 ± 0.25
2.244TyrArg: 2.244 ± 0.344
1.804TyrSer: 1.804 ± 0.243
1.716TyrThr: 1.716 ± 0.29
2.024TyrVal: 2.024 ± 0.287
0.528TyrTrp: 0.528 ± 0.185
1.628TyrTyr: 1.628 ± 0.263
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (22733 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski