Amino acid dipepetide frequency for Flavobacterium phage FPSV-F12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.968AlaAla: 2.968 ± 0.36
0.366AlaCys: 0.366 ± 0.145
2.455AlaAsp: 2.455 ± 0.299
3.737AlaGlu: 3.737 ± 0.433
1.759AlaPhe: 1.759 ± 0.231
3.774AlaGly: 3.774 ± 0.512
0.733AlaHis: 0.733 ± 0.163
3.627AlaIle: 3.627 ± 0.37
4.873AlaLys: 4.873 ± 0.647
3.92AlaLeu: 3.92 ± 0.443
1.502AlaMet: 1.502 ± 0.26
3.114AlaAsn: 3.114 ± 0.527
1.319AlaPro: 1.319 ± 0.241
1.869AlaGln: 1.869 ± 0.346
1.539AlaArg: 1.539 ± 0.296
2.785AlaSer: 2.785 ± 0.297
2.711AlaThr: 2.711 ± 0.389
2.272AlaVal: 2.272 ± 0.322
0.403AlaTrp: 0.403 ± 0.14
2.015AlaTyr: 2.015 ± 0.313
0.0AlaXaa: 0.0 ± 0.0
Cys
0.256CysAla: 0.256 ± 0.09
0.147CysCys: 0.147 ± 0.076
0.476CysAsp: 0.476 ± 0.174
0.623CysGlu: 0.623 ± 0.163
0.293CysPhe: 0.293 ± 0.114
0.696CysGly: 0.696 ± 0.214
0.183CysHis: 0.183 ± 0.085
0.733CysIle: 0.733 ± 0.224
1.172CysLys: 1.172 ± 0.322
0.843CysLeu: 0.843 ± 0.291
0.11CysMet: 0.11 ± 0.073
0.806CysAsn: 0.806 ± 0.236
0.476CysPro: 0.476 ± 0.138
0.183CysGln: 0.183 ± 0.085
0.11CysArg: 0.11 ± 0.061
0.403CysSer: 0.403 ± 0.145
0.586CysThr: 0.586 ± 0.139
0.623CysVal: 0.623 ± 0.172
0.073CysTrp: 0.073 ± 0.054
0.586CysTyr: 0.586 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
2.638AspAla: 2.638 ± 0.448
0.953AspCys: 0.953 ± 0.276
3.554AspAsp: 3.554 ± 0.336
5.203AspGlu: 5.203 ± 0.448
3.774AspPhe: 3.774 ± 0.289
3.224AspGly: 3.224 ± 0.309
0.586AspHis: 0.586 ± 0.171
4.983AspIle: 4.983 ± 0.439
5.606AspLys: 5.606 ± 0.348
5.02AspLeu: 5.02 ± 0.412
1.722AspMet: 1.722 ± 0.273
4.25AspAsn: 4.25 ± 0.344
0.806AspPro: 0.806 ± 0.173
0.66AspGln: 0.66 ± 0.129
1.832AspArg: 1.832 ± 0.279
4.067AspSer: 4.067 ± 0.397
3.224AspThr: 3.224 ± 0.476
3.188AspVal: 3.188 ± 0.384
0.769AspTrp: 0.769 ± 0.158
3.078AspTyr: 3.078 ± 0.287
0.0AspXaa: 0.0 ± 0.0
Glu
4.36GluAla: 4.36 ± 0.416
0.403GluCys: 0.403 ± 0.164
3.811GluAsp: 3.811 ± 0.338
7.071GluGlu: 7.071 ± 0.577
3.371GluPhe: 3.371 ± 0.373
3.737GluGly: 3.737 ± 0.418
1.172GluHis: 1.172 ± 0.181
5.862GluIle: 5.862 ± 0.516
7.621GluLys: 7.621 ± 0.808
7.255GluLeu: 7.255 ± 0.524
1.832GluMet: 1.832 ± 0.274
6.229GluAsn: 6.229 ± 0.733
1.466GluPro: 1.466 ± 0.215
3.884GluGln: 3.884 ± 0.39
2.162GluArg: 2.162 ± 0.352
3.737GluSer: 3.737 ± 0.404
4.433GluThr: 4.433 ± 0.497
4.726GluVal: 4.726 ± 0.371
1.026GluTrp: 1.026 ± 0.215
3.298GluTyr: 3.298 ± 0.342
0.0GluXaa: 0.0 ± 0.0
Phe
2.015PheAla: 2.015 ± 0.282
0.33PheCys: 0.33 ± 0.121
3.371PheAsp: 3.371 ± 0.362
3.737PheGlu: 3.737 ± 0.423
1.612PhePhe: 1.612 ± 0.241
2.565PheGly: 2.565 ± 0.354
0.476PheHis: 0.476 ± 0.108
4.323PheIle: 4.323 ± 0.313
4.03PheLys: 4.03 ± 0.447
2.785PheLeu: 2.785 ± 0.324
0.806PheMet: 0.806 ± 0.148
4.617PheAsn: 4.617 ± 0.477
1.502PhePro: 1.502 ± 0.315
1.759PheGln: 1.759 ± 0.27
1.759PheArg: 1.759 ± 0.3
2.382PheSer: 2.382 ± 0.277
2.675PheThr: 2.675 ± 0.392
2.711PheVal: 2.711 ± 0.301
0.366PheTrp: 0.366 ± 0.106
2.308PheTyr: 2.308 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
2.125GlyAla: 2.125 ± 0.316
0.696GlyCys: 0.696 ± 0.208
2.748GlyAsp: 2.748 ± 0.27
3.481GlyGlu: 3.481 ± 0.397
3.151GlyPhe: 3.151 ± 0.328
2.565GlyGly: 2.565 ± 0.419
0.55GlyHis: 0.55 ± 0.147
5.13GlyIle: 5.13 ± 0.452
5.899GlyLys: 5.899 ± 0.652
4.177GlyLeu: 4.177 ± 0.365
1.282GlyMet: 1.282 ± 0.218
3.334GlyAsn: 3.334 ± 0.514
0.806GlyPro: 0.806 ± 0.169
2.015GlyGln: 2.015 ± 0.283
1.649GlyArg: 1.649 ± 0.282
3.554GlySer: 3.554 ± 0.581
3.737GlyThr: 3.737 ± 0.367
3.334GlyVal: 3.334 ± 0.396
0.66GlyTrp: 0.66 ± 0.182
2.711GlyTyr: 2.711 ± 0.319
0.0GlyXaa: 0.0 ± 0.0
His
0.66HisAla: 0.66 ± 0.16
0.11HisCys: 0.11 ± 0.074
0.586HisAsp: 0.586 ± 0.133
0.513HisGlu: 0.513 ± 0.136
0.769HisPhe: 0.769 ± 0.176
0.66HisGly: 0.66 ± 0.202
0.256HisHis: 0.256 ± 0.123
1.539HisIle: 1.539 ± 0.276
1.942HisLys: 1.942 ± 0.217
1.539HisLeu: 1.539 ± 0.232
0.33HisMet: 0.33 ± 0.115
1.099HisAsn: 1.099 ± 0.19
0.623HisPro: 0.623 ± 0.138
0.806HisGln: 0.806 ± 0.158
0.696HisArg: 0.696 ± 0.169
0.733HisSer: 0.733 ± 0.164
1.466HisThr: 1.466 ± 0.278
0.513HisVal: 0.513 ± 0.141
0.256HisTrp: 0.256 ± 0.096
0.769HisTyr: 0.769 ± 0.196
0.0HisXaa: 0.0 ± 0.0
Ile
3.884IleAla: 3.884 ± 0.406
0.586IleCys: 0.586 ± 0.19
5.752IleAsp: 5.752 ± 0.404
7.255IleGlu: 7.255 ± 0.499
2.968IlePhe: 2.968 ± 0.482
3.957IleGly: 3.957 ± 0.394
1.649IleHis: 1.649 ± 0.338
5.313IleIle: 5.313 ± 0.58
8.537IleLys: 8.537 ± 0.572
5.899IleLeu: 5.899 ± 0.638
1.685IleMet: 1.685 ± 0.287
6.632IleAsn: 6.632 ± 0.572
2.601IlePro: 2.601 ± 0.285
3.298IleGln: 3.298 ± 0.323
1.832IleArg: 1.832 ± 0.336
6.046IleSer: 6.046 ± 0.641
5.349IleThr: 5.349 ± 0.507
3.444IleVal: 3.444 ± 0.321
0.696IleTrp: 0.696 ± 0.163
3.737IleTyr: 3.737 ± 0.371
0.0IleXaa: 0.0 ± 0.0
Lys
4.323LysAla: 4.323 ± 0.484
0.733LysCys: 0.733 ± 0.22
6.742LysAsp: 6.742 ± 0.752
8.684LysGlu: 8.684 ± 0.954
3.957LysPhe: 3.957 ± 0.441
5.459LysGly: 5.459 ± 0.577
1.759LysHis: 1.759 ± 0.268
7.914LysIle: 7.914 ± 0.568
7.621LysLys: 7.621 ± 0.539
8.024LysLeu: 8.024 ± 0.662
2.382LysMet: 2.382 ± 0.295
6.961LysAsn: 6.961 ± 0.502
2.418LysPro: 2.418 ± 0.284
3.92LysGln: 3.92 ± 0.392
3.078LysArg: 3.078 ± 0.422
5.752LysSer: 5.752 ± 0.522
5.239LysThr: 5.239 ± 0.518
4.946LysVal: 4.946 ± 0.503
0.989LysTrp: 0.989 ± 0.183
3.884LysTyr: 3.884 ± 0.349
0.0LysXaa: 0.0 ± 0.0
Leu
4.983LeuAla: 4.983 ± 0.515
0.843LeuCys: 0.843 ± 0.26
4.617LeuAsp: 4.617 ± 0.408
6.119LeuGlu: 6.119 ± 0.444
3.371LeuPhe: 3.371 ± 0.363
4.14LeuGly: 4.14 ± 0.56
1.502LeuHis: 1.502 ± 0.234
6.155LeuIle: 6.155 ± 0.608
7.804LeuLys: 7.804 ± 0.576
7.621LeuLeu: 7.621 ± 0.547
1.685LeuMet: 1.685 ± 0.314
6.558LeuAsn: 6.558 ± 0.44
3.591LeuPro: 3.591 ± 0.397
3.078LeuGln: 3.078 ± 0.288
2.198LeuArg: 2.198 ± 0.309
5.642LeuSer: 5.642 ± 0.434
5.533LeuThr: 5.533 ± 0.359
3.957LeuVal: 3.957 ± 0.35
0.586LeuTrp: 0.586 ± 0.128
3.224LeuTyr: 3.224 ± 0.293
0.0LeuXaa: 0.0 ± 0.0
Met
1.282MetAla: 1.282 ± 0.245
0.037MetCys: 0.037 ± 0.037
0.879MetAsp: 0.879 ± 0.243
1.979MetGlu: 1.979 ± 0.279
1.136MetPhe: 1.136 ± 0.183
1.099MetGly: 1.099 ± 0.197
0.33MetHis: 0.33 ± 0.101
1.979MetIle: 1.979 ± 0.264
2.601MetLys: 2.601 ± 0.38
1.722MetLeu: 1.722 ± 0.277
0.513MetMet: 0.513 ± 0.137
1.612MetAsn: 1.612 ± 0.221
0.66MetPro: 0.66 ± 0.136
0.989MetGln: 0.989 ± 0.264
0.733MetArg: 0.733 ± 0.162
1.832MetSer: 1.832 ± 0.286
1.649MetThr: 1.649 ± 0.227
1.063MetVal: 1.063 ± 0.187
0.293MetTrp: 0.293 ± 0.124
0.916MetTyr: 0.916 ± 0.249
0.0MetXaa: 0.0 ± 0.0
Asn
3.224AsnAla: 3.224 ± 0.442
0.879AsnCys: 0.879 ± 0.254
4.323AsnAsp: 4.323 ± 0.463
4.25AsnGlu: 4.25 ± 0.519
3.407AsnPhe: 3.407 ± 0.328
4.287AsnGly: 4.287 ± 0.384
1.319AsnHis: 1.319 ± 0.218
6.632AsnIle: 6.632 ± 0.652
8.097AsnLys: 8.097 ± 0.577
5.752AsnLeu: 5.752 ± 0.43
1.905AsnMet: 1.905 ± 0.267
6.668AsnAsn: 6.668 ± 0.719
3.371AsnPro: 3.371 ± 0.335
2.895AsnGln: 2.895 ± 0.308
2.785AsnArg: 2.785 ± 0.34
5.02AsnSer: 5.02 ± 0.584
4.69AsnThr: 4.69 ± 0.534
3.92AsnVal: 3.92 ± 0.435
0.806AsnTrp: 0.806 ± 0.155
3.591AsnTyr: 3.591 ± 0.366
0.0AsnXaa: 0.0 ± 0.0
Pro
1.685ProAla: 1.685 ± 0.275
0.366ProCys: 0.366 ± 0.141
1.722ProAsp: 1.722 ± 0.299
2.601ProGlu: 2.601 ± 0.324
1.942ProPhe: 1.942 ± 0.362
1.905ProGly: 1.905 ± 0.352
0.513ProHis: 0.513 ± 0.134
2.858ProIle: 2.858 ± 0.288
2.308ProLys: 2.308 ± 0.276
2.601ProLeu: 2.601 ± 0.255
0.696ProMet: 0.696 ± 0.127
1.722ProAsn: 1.722 ± 0.277
0.66ProPro: 0.66 ± 0.162
1.026ProGln: 1.026 ± 0.182
0.66ProArg: 0.66 ± 0.181
1.905ProSer: 1.905 ± 0.237
1.832ProThr: 1.832 ± 0.351
2.272ProVal: 2.272 ± 0.295
0.11ProTrp: 0.11 ± 0.057
1.246ProTyr: 1.246 ± 0.2
0.0ProXaa: 0.0 ± 0.0
Gln
1.832GlnAla: 1.832 ± 0.229
0.11GlnCys: 0.11 ± 0.093
2.528GlnAsp: 2.528 ± 0.244
3.664GlnGlu: 3.664 ± 0.415
1.539GlnPhe: 1.539 ± 0.198
1.905GlnGly: 1.905 ± 0.26
0.696GlnHis: 0.696 ± 0.132
3.041GlnIle: 3.041 ± 0.347
3.224GlnLys: 3.224 ± 0.381
3.151GlnLeu: 3.151 ± 0.415
0.806GlnMet: 0.806 ± 0.159
3.334GlnAsn: 3.334 ± 0.392
1.246GlnPro: 1.246 ± 0.217
1.685GlnGln: 1.685 ± 0.198
1.209GlnArg: 1.209 ± 0.212
1.685GlnSer: 1.685 ± 0.258
2.382GlnThr: 2.382 ± 0.324
2.125GlnVal: 2.125 ± 0.31
0.256GlnTrp: 0.256 ± 0.112
1.905GlnTyr: 1.905 ± 0.214
0.0GlnXaa: 0.0 ± 0.0
Arg
1.649ArgAla: 1.649 ± 0.355
0.366ArgCys: 0.366 ± 0.133
1.905ArgAsp: 1.905 ± 0.263
2.601ArgGlu: 2.601 ± 0.341
1.392ArgPhe: 1.392 ± 0.2
1.282ArgGly: 1.282 ± 0.197
0.476ArgHis: 0.476 ± 0.104
2.895ArgIle: 2.895 ± 0.36
3.188ArgLys: 3.188 ± 0.379
2.748ArgLeu: 2.748 ± 0.278
1.063ArgMet: 1.063 ± 0.191
2.382ArgAsn: 2.382 ± 0.253
0.953ArgPro: 0.953 ± 0.221
0.989ArgGln: 0.989 ± 0.164
1.685ArgArg: 1.685 ± 0.353
0.953ArgSer: 0.953 ± 0.148
2.015ArgThr: 2.015 ± 0.224
1.612ArgVal: 1.612 ± 0.317
0.183ArgTrp: 0.183 ± 0.075
0.989ArgTyr: 0.989 ± 0.212
0.0ArgXaa: 0.0 ± 0.0
Ser
2.638SerAla: 2.638 ± 0.294
0.733SerCys: 0.733 ± 0.236
3.261SerAsp: 3.261 ± 0.336
3.957SerGlu: 3.957 ± 0.353
3.114SerPhe: 3.114 ± 0.326
3.334SerGly: 3.334 ± 0.499
0.989SerHis: 0.989 ± 0.196
5.459SerIle: 5.459 ± 0.49
5.716SerLys: 5.716 ± 0.472
5.972SerLeu: 5.972 ± 0.566
1.429SerMet: 1.429 ± 0.186
4.433SerAsn: 4.433 ± 0.591
2.125SerPro: 2.125 ± 0.327
2.308SerGln: 2.308 ± 0.368
1.649SerArg: 1.649 ± 0.267
3.884SerSer: 3.884 ± 0.432
3.444SerThr: 3.444 ± 0.36
3.371SerVal: 3.371 ± 0.398
0.476SerTrp: 0.476 ± 0.141
3.114SerTyr: 3.114 ± 0.463
0.0SerXaa: 0.0 ± 0.0
Thr
2.235ThrAla: 2.235 ± 0.38
0.55ThrCys: 0.55 ± 0.197
3.517ThrAsp: 3.517 ± 0.443
4.067ThrGlu: 4.067 ± 0.425
3.334ThrPhe: 3.334 ± 0.438
3.92ThrGly: 3.92 ± 0.435
0.989ThrHis: 0.989 ± 0.217
5.02ThrIle: 5.02 ± 0.411
4.873ThrLys: 4.873 ± 0.414
5.423ThrLeu: 5.423 ± 0.579
1.429ThrMet: 1.429 ± 0.224
4.91ThrAsn: 4.91 ± 0.543
2.308ThrPro: 2.308 ± 0.263
1.979ThrGln: 1.979 ± 0.235
1.905ThrArg: 1.905 ± 0.243
3.811ThrSer: 3.811 ± 0.367
3.334ThrThr: 3.334 ± 0.464
3.664ThrVal: 3.664 ± 0.468
0.586ThrTrp: 0.586 ± 0.149
2.272ThrTyr: 2.272 ± 0.252
0.0ThrXaa: 0.0 ± 0.0
Val
3.151ValAla: 3.151 ± 0.441
0.44ValCys: 0.44 ± 0.148
3.261ValAsp: 3.261 ± 0.456
4.323ValGlu: 4.323 ± 0.508
2.491ValPhe: 2.491 ± 0.34
2.675ValGly: 2.675 ± 0.385
0.806ValHis: 0.806 ± 0.2
3.957ValIle: 3.957 ± 0.402
4.763ValLys: 4.763 ± 0.385
3.994ValLeu: 3.994 ± 0.409
1.063ValMet: 1.063 ± 0.191
4.287ValAsn: 4.287 ± 0.449
1.979ValPro: 1.979 ± 0.294
2.491ValGln: 2.491 ± 0.271
1.575ValArg: 1.575 ± 0.237
3.298ValSer: 3.298 ± 0.367
3.224ValThr: 3.224 ± 0.332
3.261ValVal: 3.261 ± 0.428
0.366ValTrp: 0.366 ± 0.139
2.601ValTyr: 2.601 ± 0.389
0.0ValXaa: 0.0 ± 0.0
Trp
0.183TrpAla: 0.183 ± 0.085
0.147TrpCys: 0.147 ± 0.066
0.55TrpAsp: 0.55 ± 0.129
0.916TrpGlu: 0.916 ± 0.187
0.366TrpPhe: 0.366 ± 0.17
0.513TrpGly: 0.513 ± 0.14
0.33TrpHis: 0.33 ± 0.164
0.623TrpIle: 0.623 ± 0.177
0.806TrpLys: 0.806 ± 0.166
1.282TrpLeu: 1.282 ± 0.247
0.256TrpMet: 0.256 ± 0.113
0.66TrpAsn: 0.66 ± 0.158
0.0TrpPro: 0.0 ± 0.0
0.513TrpGln: 0.513 ± 0.159
0.44TrpArg: 0.44 ± 0.117
0.44TrpSer: 0.44 ± 0.125
0.476TrpThr: 0.476 ± 0.124
0.403TrpVal: 0.403 ± 0.114
0.037TrpTrp: 0.037 ± 0.04
0.66TrpTyr: 0.66 ± 0.217
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.649TyrAla: 1.649 ± 0.233
0.586TyrCys: 0.586 ± 0.175
3.078TyrAsp: 3.078 ± 0.322
2.675TyrGlu: 2.675 ± 0.304
2.382TyrPhe: 2.382 ± 0.293
1.905TyrGly: 1.905 ± 0.349
0.55TyrHis: 0.55 ± 0.129
3.078TyrIle: 3.078 ± 0.452
4.104TyrLys: 4.104 ± 0.41
3.481TyrLeu: 3.481 ± 0.352
0.769TyrMet: 0.769 ± 0.127
4.214TyrAsn: 4.214 ± 0.497
1.722TyrPro: 1.722 ± 0.347
1.979TyrGln: 1.979 ± 0.25
1.832TyrArg: 1.832 ± 0.212
3.481TyrSer: 3.481 ± 0.36
2.125TyrThr: 2.125 ± 0.269
2.638TyrVal: 2.638 ± 0.397
0.66TyrTrp: 0.66 ± 0.173
2.125TyrTyr: 2.125 ± 0.404
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 87 proteins (27294 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski