Amino acid dipepetide frequency for Flavobacterium phage vB_FspS_morran9-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.908AlaAla: 0.908 ± 0.391
0.661AlaCys: 0.661 ± 0.199
2.064AlaAsp: 2.064 ± 0.37
3.22AlaGlu: 3.22 ± 0.741
2.147AlaPhe: 2.147 ± 0.429
2.147AlaGly: 2.147 ± 0.38
0.33AlaHis: 0.33 ± 0.227
3.633AlaIle: 3.633 ± 0.511
5.779AlaLys: 5.779 ± 0.669
5.284AlaLeu: 5.284 ± 0.796
1.651AlaMet: 1.651 ± 0.429
4.458AlaAsn: 4.458 ± 0.651
1.073AlaPro: 1.073 ± 0.291
2.477AlaGln: 2.477 ± 0.458
1.073AlaArg: 1.073 ± 0.363
2.89AlaSer: 2.89 ± 0.637
4.541AlaThr: 4.541 ± 0.636
2.972AlaVal: 2.972 ± 0.611
0.661AlaTrp: 0.661 ± 0.213
1.899AlaTyr: 1.899 ± 0.361
0.0AlaXaa: 0.0 ± 0.0
Cys
0.495CysAla: 0.495 ± 0.212
0.248CysCys: 0.248 ± 0.144
0.661CysAsp: 0.661 ± 0.204
0.743CysGlu: 0.743 ± 0.24
0.743CysPhe: 0.743 ± 0.242
0.908CysGly: 0.908 ± 0.283
0.165CysHis: 0.165 ± 0.121
0.495CysIle: 0.495 ± 0.211
0.743CysLys: 0.743 ± 0.337
1.321CysLeu: 1.321 ± 0.382
0.083CysMet: 0.083 ± 0.074
0.413CysAsn: 0.413 ± 0.197
0.826CysPro: 0.826 ± 0.264
0.248CysGln: 0.248 ± 0.149
0.248CysArg: 0.248 ± 0.153
0.991CysSer: 0.991 ± 0.285
0.661CysThr: 0.661 ± 0.224
0.826CysVal: 0.826 ± 0.219
0.0CysTrp: 0.0 ± 0.0
0.413CysTyr: 0.413 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
3.468AspAla: 3.468 ± 0.42
0.826AspCys: 0.826 ± 0.254
1.982AspAsp: 1.982 ± 0.424
4.046AspGlu: 4.046 ± 0.602
3.715AspPhe: 3.715 ± 0.5
3.055AspGly: 3.055 ± 0.488
0.413AspHis: 0.413 ± 0.185
4.046AspIle: 4.046 ± 0.502
6.11AspLys: 6.11 ± 0.846
5.284AspLeu: 5.284 ± 0.511
1.238AspMet: 1.238 ± 0.378
4.624AspAsn: 4.624 ± 0.696
0.495AspPro: 0.495 ± 0.155
0.578AspGln: 0.578 ± 0.215
1.238AspArg: 1.238 ± 0.276
3.633AspSer: 3.633 ± 0.479
3.88AspThr: 3.88 ± 0.689
2.807AspVal: 2.807 ± 0.543
0.578AspTrp: 0.578 ± 0.18
3.303AspTyr: 3.303 ± 0.565
0.0AspXaa: 0.0 ± 0.0
Glu
3.137GluAla: 3.137 ± 0.843
0.826GluCys: 0.826 ± 0.301
3.22GluAsp: 3.22 ± 0.633
3.963GluGlu: 3.963 ± 0.622
4.376GluPhe: 4.376 ± 0.627
2.229GluGly: 2.229 ± 0.388
0.908GluHis: 0.908 ± 0.276
7.513GluIle: 7.513 ± 0.772
6.77GluLys: 6.77 ± 1.017
7.761GluLeu: 7.761 ± 0.944
1.486GluMet: 1.486 ± 0.357
6.11GluAsn: 6.11 ± 0.743
1.816GluPro: 1.816 ± 0.368
3.633GluGln: 3.633 ± 0.463
2.559GluArg: 2.559 ± 0.6
4.376GluSer: 4.376 ± 0.562
4.541GluThr: 4.541 ± 0.703
4.293GluVal: 4.293 ± 0.619
0.33GluTrp: 0.33 ± 0.158
3.385GluTyr: 3.385 ± 0.584
0.0GluXaa: 0.0 ± 0.0
Phe
1.982PheAla: 1.982 ± 0.42
0.495PheCys: 0.495 ± 0.216
3.715PheAsp: 3.715 ± 0.52
4.624PheGlu: 4.624 ± 0.603
2.312PhePhe: 2.312 ± 0.407
2.725PheGly: 2.725 ± 0.518
0.413PheHis: 0.413 ± 0.197
3.055PheIle: 3.055 ± 0.589
4.211PheLys: 4.211 ± 0.667
3.798PheLeu: 3.798 ± 0.715
1.569PheMet: 1.569 ± 0.314
4.211PheAsn: 4.211 ± 0.892
0.991PhePro: 0.991 ± 0.262
1.734PheGln: 1.734 ± 0.434
0.991PheArg: 0.991 ± 0.305
3.633PheSer: 3.633 ± 0.576
4.128PheThr: 4.128 ± 0.695
3.137PheVal: 3.137 ± 0.538
0.413PheTrp: 0.413 ± 0.154
1.569PheTyr: 1.569 ± 0.381
0.0PheXaa: 0.0 ± 0.0
Gly
2.972GlyAla: 2.972 ± 0.681
0.413GlyCys: 0.413 ± 0.198
2.642GlyAsp: 2.642 ± 0.541
2.064GlyGlu: 2.064 ± 0.407
2.807GlyPhe: 2.807 ± 0.607
1.899GlyGly: 1.899 ± 0.672
0.495GlyHis: 0.495 ± 0.203
3.798GlyIle: 3.798 ± 0.467
3.55GlyLys: 3.55 ± 0.632
3.715GlyLeu: 3.715 ± 0.49
1.238GlyMet: 1.238 ± 0.282
4.871GlyAsn: 4.871 ± 0.656
0.0GlyPro: 0.0 ± 0.0
1.734GlyGln: 1.734 ± 0.336
1.569GlyArg: 1.569 ± 0.372
2.807GlySer: 2.807 ± 0.515
4.706GlyThr: 4.706 ± 0.779
2.725GlyVal: 2.725 ± 0.445
0.413GlyTrp: 0.413 ± 0.186
2.477GlyTyr: 2.477 ± 0.448
0.0GlyXaa: 0.0 ± 0.0
His
0.248HisAla: 0.248 ± 0.14
0.33HisCys: 0.33 ± 0.189
0.991HisAsp: 0.991 ± 0.312
0.661HisGlu: 0.661 ± 0.234
0.661HisPhe: 0.661 ± 0.191
0.826HisGly: 0.826 ± 0.289
0.661HisHis: 0.661 ± 0.36
1.321HisIle: 1.321 ± 0.34
1.238HisLys: 1.238 ± 0.345
0.743HisLeu: 0.743 ± 0.268
0.083HisMet: 0.083 ± 0.085
1.156HisAsn: 1.156 ± 0.333
0.33HisPro: 0.33 ± 0.202
0.495HisGln: 0.495 ± 0.19
0.495HisArg: 0.495 ± 0.215
0.908HisSer: 0.908 ± 0.306
0.826HisThr: 0.826 ± 0.211
0.661HisVal: 0.661 ± 0.246
0.0HisTrp: 0.0 ± 0.0
0.495HisTyr: 0.495 ± 0.159
0.0HisXaa: 0.0 ± 0.0
Ile
4.624IleAla: 4.624 ± 0.672
0.991IleCys: 0.991 ± 0.352
4.706IleAsp: 4.706 ± 0.599
7.926IleGlu: 7.926 ± 0.655
3.385IlePhe: 3.385 ± 0.565
3.303IleGly: 3.303 ± 0.633
1.156IleHis: 1.156 ± 0.455
5.697IleIle: 5.697 ± 0.793
8.504IleLys: 8.504 ± 0.925
7.018IleLeu: 7.018 ± 0.907
1.073IleMet: 1.073 ± 0.28
6.605IleAsn: 6.605 ± 0.806
2.394IlePro: 2.394 ± 0.395
2.972IleGln: 2.972 ± 0.617
1.734IleArg: 1.734 ± 0.45
5.284IleSer: 5.284 ± 0.854
4.789IleThr: 4.789 ± 0.741
4.706IleVal: 4.706 ± 0.665
0.743IleTrp: 0.743 ± 0.257
3.137IleTyr: 3.137 ± 0.455
0.0IleXaa: 0.0 ± 0.0
Lys
4.458LysAla: 4.458 ± 0.499
1.156LysCys: 1.156 ± 0.262
5.119LysAsp: 5.119 ± 0.689
8.752LysGlu: 8.752 ± 1.288
3.468LysPhe: 3.468 ± 0.44
4.211LysGly: 4.211 ± 0.665
1.734LysHis: 1.734 ± 0.42
7.348LysIle: 7.348 ± 0.79
7.183LysLys: 7.183 ± 0.986
8.421LysLeu: 8.421 ± 0.984
3.137LysMet: 3.137 ± 0.447
6.275LysAsn: 6.275 ± 0.8
2.807LysPro: 2.807 ± 0.526
5.284LysGln: 5.284 ± 0.994
3.798LysArg: 3.798 ± 0.7
4.954LysSer: 4.954 ± 0.624
6.11LysThr: 6.11 ± 0.666
5.614LysVal: 5.614 ± 0.736
0.991LysTrp: 0.991 ± 0.314
4.458LysTyr: 4.458 ± 0.712
0.0LysXaa: 0.0 ± 0.0
Leu
4.376LeuAla: 4.376 ± 0.778
0.661LeuCys: 0.661 ± 0.327
5.779LeuAsp: 5.779 ± 0.662
6.11LeuGlu: 6.11 ± 0.921
3.55LeuPhe: 3.55 ± 0.455
3.55LeuGly: 3.55 ± 0.421
1.073LeuHis: 1.073 ± 0.292
7.596LeuIle: 7.596 ± 0.997
9.164LeuLys: 9.164 ± 0.792
6.853LeuLeu: 6.853 ± 0.869
2.229LeuMet: 2.229 ± 0.562
7.596LeuAsn: 7.596 ± 0.816
3.55LeuPro: 3.55 ± 0.537
4.293LeuGln: 4.293 ± 0.595
3.137LeuArg: 3.137 ± 0.529
5.697LeuSer: 5.697 ± 0.69
5.614LeuThr: 5.614 ± 0.792
4.541LeuVal: 4.541 ± 0.749
0.826LeuTrp: 0.826 ± 0.274
3.715LeuTyr: 3.715 ± 0.486
0.0LeuXaa: 0.0 ± 0.0
Met
1.982MetAla: 1.982 ± 0.363
0.165MetCys: 0.165 ± 0.126
0.908MetAsp: 0.908 ± 0.278
1.404MetGlu: 1.404 ± 0.338
1.321MetPhe: 1.321 ± 0.303
1.073MetGly: 1.073 ± 0.283
0.248MetHis: 0.248 ± 0.131
1.321MetIle: 1.321 ± 0.388
2.807MetLys: 2.807 ± 0.56
1.816MetLeu: 1.816 ± 0.376
0.248MetMet: 0.248 ± 0.121
1.569MetAsn: 1.569 ± 0.319
0.743MetPro: 0.743 ± 0.252
0.991MetGln: 0.991 ± 0.272
1.073MetArg: 1.073 ± 0.258
1.816MetSer: 1.816 ± 0.35
1.238MetThr: 1.238 ± 0.292
0.826MetVal: 0.826 ± 0.193
0.413MetTrp: 0.413 ± 0.198
0.661MetTyr: 0.661 ± 0.23
0.0MetXaa: 0.0 ± 0.0
Asn
4.624AsnAla: 4.624 ± 0.627
0.743AsnCys: 0.743 ± 0.241
4.458AsnAsp: 4.458 ± 0.698
6.275AsnGlu: 6.275 ± 0.741
3.963AsnPhe: 3.963 ± 0.551
4.541AsnGly: 4.541 ± 0.539
0.908AsnHis: 0.908 ± 0.258
5.367AsnIle: 5.367 ± 0.556
8.091AsnLys: 8.091 ± 0.947
7.183AsnLeu: 7.183 ± 0.905
1.073AsnMet: 1.073 ± 0.301
4.789AsnAsn: 4.789 ± 1.004
2.642AsnPro: 2.642 ± 0.419
2.89AsnGln: 2.89 ± 0.515
2.229AsnArg: 2.229 ± 0.539
5.697AsnSer: 5.697 ± 0.681
4.128AsnThr: 4.128 ± 0.587
4.954AsnVal: 4.954 ± 0.508
0.908AsnTrp: 0.908 ± 0.269
4.541AsnTyr: 4.541 ± 0.602
0.0AsnXaa: 0.0 ± 0.0
Pro
1.238ProAla: 1.238 ± 0.325
0.413ProCys: 0.413 ± 0.183
1.073ProAsp: 1.073 ± 0.317
1.982ProGlu: 1.982 ± 0.499
1.569ProPhe: 1.569 ± 0.465
0.0ProGly: 0.0 ± 0.0
0.083ProHis: 0.083 ± 0.087
1.651ProIle: 1.651 ± 0.327
1.651ProLys: 1.651 ± 0.417
3.303ProLeu: 3.303 ± 0.536
0.826ProMet: 0.826 ± 0.261
2.477ProAsn: 2.477 ± 0.397
0.661ProPro: 0.661 ± 0.259
1.569ProGln: 1.569 ± 0.386
0.165ProArg: 0.165 ± 0.127
1.982ProSer: 1.982 ± 0.368
1.982ProThr: 1.982 ± 0.396
1.073ProVal: 1.073 ± 0.298
0.0ProTrp: 0.0 ± 0.0
1.651ProTyr: 1.651 ± 0.454
0.0ProXaa: 0.0 ± 0.0
Gln
2.064GlnAla: 2.064 ± 0.66
0.495GlnCys: 0.495 ± 0.209
1.486GlnAsp: 1.486 ± 0.348
2.064GlnGlu: 2.064 ± 0.429
1.486GlnPhe: 1.486 ± 0.346
2.394GlnGly: 2.394 ± 0.435
0.826GlnHis: 0.826 ± 0.316
3.88GlnIle: 3.88 ± 0.577
4.293GlnLys: 4.293 ± 0.807
3.798GlnLeu: 3.798 ± 0.643
1.321GlnMet: 1.321 ± 0.403
2.89GlnAsn: 2.89 ± 0.574
1.156GlnPro: 1.156 ± 0.375
1.734GlnGln: 1.734 ± 0.715
2.064GlnArg: 2.064 ± 0.4
2.559GlnSer: 2.559 ± 0.457
2.394GlnThr: 2.394 ± 0.398
1.734GlnVal: 1.734 ± 0.362
0.495GlnTrp: 0.495 ± 0.206
1.404GlnTyr: 1.404 ± 0.342
0.0GlnXaa: 0.0 ± 0.0
Arg
1.156ArgAla: 1.156 ± 0.443
0.661ArgCys: 0.661 ± 0.215
0.908ArgAsp: 0.908 ± 0.281
1.899ArgGlu: 1.899 ± 0.45
1.073ArgPhe: 1.073 ± 0.323
0.908ArgGly: 0.908 ± 0.299
0.413ArgHis: 0.413 ± 0.195
3.055ArgIle: 3.055 ± 0.64
2.807ArgLys: 2.807 ± 0.403
3.468ArgLeu: 3.468 ± 0.447
0.661ArgMet: 0.661 ± 0.211
2.559ArgAsn: 2.559 ± 0.394
0.248ArgPro: 0.248 ± 0.14
0.991ArgGln: 0.991 ± 0.323
1.073ArgArg: 1.073 ± 0.282
2.477ArgSer: 2.477 ± 0.462
2.394ArgThr: 2.394 ± 0.446
2.064ArgVal: 2.064 ± 0.305
0.0ArgTrp: 0.0 ± 0.0
1.486ArgTyr: 1.486 ± 0.391
0.0ArgXaa: 0.0 ± 0.0
Ser
2.642SerAla: 2.642 ± 0.431
0.33SerCys: 0.33 ± 0.17
3.715SerAsp: 3.715 ± 0.731
5.532SerGlu: 5.532 ± 0.735
4.211SerPhe: 4.211 ± 0.705
4.458SerGly: 4.458 ± 0.705
0.743SerHis: 0.743 ± 0.219
5.779SerIle: 5.779 ± 0.733
6.027SerLys: 6.027 ± 0.699
5.201SerLeu: 5.201 ± 0.619
1.486SerMet: 1.486 ± 0.382
5.367SerAsn: 5.367 ± 0.836
1.321SerPro: 1.321 ± 0.334
2.394SerGln: 2.394 ± 0.468
2.064SerArg: 2.064 ± 0.413
3.633SerSer: 3.633 ± 0.701
2.89SerThr: 2.89 ± 0.485
4.128SerVal: 4.128 ± 0.494
0.495SerTrp: 0.495 ± 0.227
1.651SerTyr: 1.651 ± 0.397
0.0SerXaa: 0.0 ± 0.0
Thr
4.046ThrAla: 4.046 ± 0.713
0.661ThrCys: 0.661 ± 0.213
4.789ThrAsp: 4.789 ± 0.578
4.789ThrGlu: 4.789 ± 0.791
4.293ThrPhe: 4.293 ± 0.671
3.055ThrGly: 3.055 ± 0.742
0.826ThrHis: 0.826 ± 0.275
6.192ThrIle: 6.192 ± 0.943
5.367ThrLys: 5.367 ± 0.654
5.945ThrLeu: 5.945 ± 0.719
0.991ThrMet: 0.991 ± 0.262
4.954ThrAsn: 4.954 ± 0.735
1.899ThrPro: 1.899 ± 0.433
2.642ThrGln: 2.642 ± 0.481
1.404ThrArg: 1.404 ± 0.27
3.468ThrSer: 3.468 ± 0.601
4.789ThrThr: 4.789 ± 1.024
1.651ThrVal: 1.651 ± 0.43
0.661ThrTrp: 0.661 ± 0.211
2.559ThrTyr: 2.559 ± 0.485
0.0ThrXaa: 0.0 ± 0.0
Val
2.972ValAla: 2.972 ± 0.571
0.495ValCys: 0.495 ± 0.218
3.468ValAsp: 3.468 ± 0.517
3.303ValGlu: 3.303 ± 0.59
2.147ValPhe: 2.147 ± 0.419
3.385ValGly: 3.385 ± 0.425
0.578ValHis: 0.578 ± 0.205
4.293ValIle: 4.293 ± 0.476
5.449ValLys: 5.449 ± 0.615
4.458ValLeu: 4.458 ± 0.528
1.486ValMet: 1.486 ± 0.29
5.036ValAsn: 5.036 ± 0.771
1.156ValPro: 1.156 ± 0.31
2.229ValGln: 2.229 ± 0.403
1.486ValArg: 1.486 ± 0.392
3.715ValSer: 3.715 ± 0.577
2.229ValThr: 2.229 ± 0.416
2.725ValVal: 2.725 ± 0.6
0.826ValTrp: 0.826 ± 0.246
2.147ValTyr: 2.147 ± 0.344
0.0ValXaa: 0.0 ± 0.0
Trp
0.495TrpAla: 0.495 ± 0.182
0.33TrpCys: 0.33 ± 0.151
0.578TrpAsp: 0.578 ± 0.272
0.908TrpGlu: 0.908 ± 0.298
0.413TrpPhe: 0.413 ± 0.179
0.33TrpGly: 0.33 ± 0.131
0.413TrpHis: 0.413 ± 0.229
0.826TrpIle: 0.826 ± 0.273
0.908TrpLys: 0.908 ± 0.287
1.156TrpLeu: 1.156 ± 0.248
0.083TrpMet: 0.083 ± 0.079
0.578TrpAsn: 0.578 ± 0.207
0.0TrpPro: 0.0 ± 0.0
0.248TrpGln: 0.248 ± 0.135
0.248TrpArg: 0.248 ± 0.125
0.743TrpSer: 0.743 ± 0.325
0.33TrpThr: 0.33 ± 0.152
0.248TrpVal: 0.248 ± 0.125
0.0TrpTrp: 0.0 ± 0.0
0.495TrpTyr: 0.495 ± 0.206
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.982TyrAla: 1.982 ± 0.32
0.33TyrCys: 0.33 ± 0.159
3.055TyrAsp: 3.055 ± 0.459
3.055TyrGlu: 3.055 ± 0.46
2.064TyrPhe: 2.064 ± 0.453
1.899TyrGly: 1.899 ± 0.341
0.743TyrHis: 0.743 ± 0.236
3.88TyrIle: 3.88 ± 0.599
4.624TyrLys: 4.624 ± 0.644
3.385TyrLeu: 3.385 ± 0.513
0.578TyrMet: 0.578 ± 0.214
3.468TyrAsn: 3.468 ± 0.527
1.238TyrPro: 1.238 ± 0.309
1.404TyrGln: 1.404 ± 0.288
1.651TyrArg: 1.651 ± 0.435
2.807TyrSer: 2.807 ± 0.499
2.807TyrThr: 2.807 ± 0.561
1.899TyrVal: 1.899 ± 0.427
0.578TyrTrp: 0.578 ± 0.189
2.477TyrTyr: 2.477 ± 0.504
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 73 proteins (12113 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski