Amino acid dipepetide frequency for Phage NBSal001

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.676AlaAla: 7.676 ± 0.958
0.692AlaCys: 0.692 ± 0.267
4.426AlaAsp: 4.426 ± 0.613
5.601AlaGlu: 5.601 ± 0.635
3.181AlaPhe: 3.181 ± 0.471
5.117AlaGly: 5.117 ± 0.63
0.899AlaHis: 0.899 ± 0.218
5.532AlaIle: 5.532 ± 0.615
7.607AlaLys: 7.607 ± 1.196
7.123AlaLeu: 7.123 ± 0.715
2.697AlaMet: 2.697 ± 0.403
3.596AlaAsn: 3.596 ± 0.603
1.936AlaPro: 1.936 ± 0.309
4.08AlaGln: 4.08 ± 0.591
4.218AlaArg: 4.218 ± 0.631
4.771AlaSer: 4.771 ± 0.569
3.734AlaThr: 3.734 ± 0.551
5.048AlaVal: 5.048 ± 0.675
0.761AlaTrp: 0.761 ± 0.208
2.835AlaTyr: 2.835 ± 0.41
0.0AlaXaa: 0.0 ± 0.0
Cys
1.176CysAla: 1.176 ± 0.359
0.346CysCys: 0.346 ± 0.233
1.176CysAsp: 1.176 ± 0.267
0.899CysGlu: 0.899 ± 0.3
0.484CysPhe: 0.484 ± 0.2
1.037CysGly: 1.037 ± 0.344
0.415CysHis: 0.415 ± 0.158
0.899CysIle: 0.899 ± 0.314
1.176CysLys: 1.176 ± 0.302
0.692CysLeu: 0.692 ± 0.187
0.761CysMet: 0.761 ± 0.205
0.692CysAsn: 0.692 ± 0.216
0.415CysPro: 0.415 ± 0.155
0.069CysGln: 0.069 ± 0.07
0.553CysArg: 0.553 ± 0.223
0.622CysSer: 0.622 ± 0.174
0.761CysThr: 0.761 ± 0.273
0.83CysVal: 0.83 ± 0.219
0.484CysTrp: 0.484 ± 0.171
0.415CysTyr: 0.415 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
4.841AspAla: 4.841 ± 0.606
0.692AspCys: 0.692 ± 0.232
3.527AspAsp: 3.527 ± 0.549
4.633AspGlu: 4.633 ± 0.535
2.144AspPhe: 2.144 ± 0.33
5.117AspGly: 5.117 ± 0.731
1.037AspHis: 1.037 ± 0.327
3.527AspIle: 3.527 ± 0.457
4.287AspLys: 4.287 ± 0.445
5.117AspLeu: 5.117 ± 0.441
1.59AspMet: 1.59 ± 0.331
2.974AspAsn: 2.974 ± 0.406
2.559AspPro: 2.559 ± 0.371
1.521AspGln: 1.521 ± 0.32
2.213AspArg: 2.213 ± 0.391
2.974AspSer: 2.974 ± 0.468
3.665AspThr: 3.665 ± 0.451
3.734AspVal: 3.734 ± 0.471
0.968AspTrp: 0.968 ± 0.254
2.974AspTyr: 2.974 ± 0.469
0.0AspXaa: 0.0 ± 0.0
Glu
5.117GluAla: 5.117 ± 0.53
1.037GluCys: 1.037 ± 0.24
2.766GluAsp: 2.766 ± 0.405
4.426GluGlu: 4.426 ± 0.636
3.872GluPhe: 3.872 ± 0.522
3.25GluGly: 3.25 ± 0.429
1.314GluHis: 1.314 ± 0.258
4.979GluIle: 4.979 ± 0.606
5.74GluLys: 5.74 ± 0.651
5.325GluLeu: 5.325 ± 0.56
2.974GluMet: 2.974 ± 0.352
3.803GluAsn: 3.803 ± 0.478
1.729GluPro: 1.729 ± 0.394
3.458GluGln: 3.458 ± 0.571
3.872GluArg: 3.872 ± 0.609
4.426GluSer: 4.426 ± 0.471
3.527GluThr: 3.527 ± 0.419
4.702GluVal: 4.702 ± 0.675
1.037GluTrp: 1.037 ± 0.253
3.527GluTyr: 3.527 ± 0.428
0.0GluXaa: 0.0 ± 0.0
Phe
2.974PheAla: 2.974 ± 0.518
0.484PheCys: 0.484 ± 0.202
3.458PheAsp: 3.458 ± 0.48
2.351PheGlu: 2.351 ± 0.398
1.383PhePhe: 1.383 ± 0.37
2.974PheGly: 2.974 ± 0.512
0.83PheHis: 0.83 ± 0.239
2.075PheIle: 2.075 ± 0.408
2.559PheLys: 2.559 ± 0.445
2.213PheLeu: 2.213 ± 0.43
1.037PheMet: 1.037 ± 0.246
2.489PheAsn: 2.489 ± 0.432
1.867PhePro: 1.867 ± 0.347
1.521PheGln: 1.521 ± 0.296
1.176PheArg: 1.176 ± 0.302
2.489PheSer: 2.489 ± 0.399
2.628PheThr: 2.628 ± 0.473
3.043PheVal: 3.043 ± 0.365
0.622PheTrp: 0.622 ± 0.191
1.106PheTyr: 1.106 ± 0.376
0.0PheXaa: 0.0 ± 0.0
Gly
4.149GlyAla: 4.149 ± 0.705
1.383GlyCys: 1.383 ± 0.306
3.596GlyAsp: 3.596 ± 0.656
5.325GlyGlu: 5.325 ± 0.488
2.766GlyPhe: 2.766 ± 0.51
6.569GlyGly: 6.569 ± 1.036
1.037GlyHis: 1.037 ± 0.29
4.841GlyIle: 4.841 ± 0.553
6.224GlyLys: 6.224 ± 0.504
4.426GlyLeu: 4.426 ± 0.482
2.835GlyMet: 2.835 ± 0.379
4.011GlyAsn: 4.011 ± 0.372
0.138GlyPro: 0.138 ± 0.113
1.936GlyGln: 1.936 ± 0.462
2.559GlyArg: 2.559 ± 0.444
6.085GlySer: 6.085 ± 0.655
2.974GlyThr: 2.974 ± 0.437
5.463GlyVal: 5.463 ± 0.624
1.521GlyTrp: 1.521 ± 0.284
3.458GlyTyr: 3.458 ± 0.411
0.0GlyXaa: 0.0 ± 0.0
His
1.314HisAla: 1.314 ± 0.295
0.277HisCys: 0.277 ± 0.145
1.106HisAsp: 1.106 ± 0.314
1.314HisGlu: 1.314 ± 0.37
0.83HisPhe: 0.83 ± 0.237
1.66HisGly: 1.66 ± 0.339
0.692HisHis: 0.692 ± 0.265
0.83HisIle: 0.83 ± 0.256
1.66HisLys: 1.66 ± 0.384
1.106HisLeu: 1.106 ± 0.333
0.415HisMet: 0.415 ± 0.155
0.83HisAsn: 0.83 ± 0.303
0.622HisPro: 0.622 ± 0.249
0.692HisGln: 0.692 ± 0.215
0.899HisArg: 0.899 ± 0.241
1.037HisSer: 1.037 ± 0.293
1.245HisThr: 1.245 ± 0.28
1.245HisVal: 1.245 ± 0.27
0.069HisTrp: 0.069 ± 0.065
0.968HisTyr: 0.968 ± 0.332
0.0HisXaa: 0.0 ± 0.0
Ile
6.224IleAla: 6.224 ± 0.566
1.037IleCys: 1.037 ± 0.292
5.117IleAsp: 5.117 ± 0.57
4.841IleGlu: 4.841 ± 0.482
2.213IlePhe: 2.213 ± 0.362
3.388IleGly: 3.388 ± 0.442
1.59IleHis: 1.59 ± 0.378
4.08IleIle: 4.08 ± 0.508
4.979IleLys: 4.979 ± 0.603
2.835IleLeu: 2.835 ± 0.419
2.628IleMet: 2.628 ± 0.428
3.596IleAsn: 3.596 ± 0.578
2.766IlePro: 2.766 ± 0.494
2.42IleGln: 2.42 ± 0.389
2.974IleArg: 2.974 ± 0.44
4.149IleSer: 4.149 ± 0.511
5.186IleThr: 5.186 ± 0.659
4.149IleVal: 4.149 ± 0.39
1.245IleTrp: 1.245 ± 0.274
2.351IleTyr: 2.351 ± 0.428
0.0IleXaa: 0.0 ± 0.0
Lys
7.33LysAla: 7.33 ± 0.794
0.622LysCys: 0.622 ± 0.209
4.771LysAsp: 4.771 ± 0.529
6.984LysGlu: 6.984 ± 0.748
2.213LysPhe: 2.213 ± 0.447
4.287LysGly: 4.287 ± 0.575
1.452LysHis: 1.452 ± 0.368
4.633LysIle: 4.633 ± 0.49
4.564LysLys: 4.564 ± 0.758
5.463LysLeu: 5.463 ± 0.661
3.043LysMet: 3.043 ± 0.528
3.596LysAsn: 3.596 ± 0.453
3.527LysPro: 3.527 ± 0.415
2.144LysGln: 2.144 ± 0.53
3.872LysArg: 3.872 ± 0.585
3.872LysSer: 3.872 ± 0.476
4.633LysThr: 4.633 ± 0.476
4.495LysVal: 4.495 ± 0.539
1.176LysTrp: 1.176 ± 0.284
2.628LysTyr: 2.628 ± 0.441
0.0LysXaa: 0.0 ± 0.0
Leu
6.085LeuAla: 6.085 ± 0.915
0.899LeuCys: 0.899 ± 0.266
3.872LeuAsp: 3.872 ± 0.514
4.218LeuGlu: 4.218 ± 0.621
2.559LeuPhe: 2.559 ± 0.396
3.734LeuGly: 3.734 ± 0.568
0.968LeuHis: 0.968 ± 0.206
4.702LeuIle: 4.702 ± 0.467
5.048LeuLys: 5.048 ± 0.572
3.872LeuLeu: 3.872 ± 0.483
2.213LeuMet: 2.213 ± 0.374
4.702LeuAsn: 4.702 ± 0.491
2.766LeuPro: 2.766 ± 0.481
1.798LeuGln: 1.798 ± 0.454
3.596LeuArg: 3.596 ± 0.485
4.633LeuSer: 4.633 ± 0.548
4.633LeuThr: 4.633 ± 0.638
3.872LeuVal: 3.872 ± 0.479
0.692LeuTrp: 0.692 ± 0.216
2.213LeuTyr: 2.213 ± 0.374
0.0LeuXaa: 0.0 ± 0.0
Met
2.974MetAla: 2.974 ± 0.45
0.553MetCys: 0.553 ± 0.207
1.383MetAsp: 1.383 ± 0.315
1.037MetGlu: 1.037 ± 0.227
1.245MetPhe: 1.245 ± 0.303
1.59MetGly: 1.59 ± 0.339
0.968MetHis: 0.968 ± 0.274
2.559MetIle: 2.559 ± 0.392
2.628MetLys: 2.628 ± 0.381
2.144MetLeu: 2.144 ± 0.385
1.106MetMet: 1.106 ± 0.307
1.452MetAsn: 1.452 ± 0.252
0.968MetPro: 0.968 ± 0.251
1.867MetGln: 1.867 ± 0.336
1.798MetArg: 1.798 ± 0.331
2.075MetSer: 2.075 ± 0.405
2.144MetThr: 2.144 ± 0.318
2.282MetVal: 2.282 ± 0.325
0.346MetTrp: 0.346 ± 0.127
0.761MetTyr: 0.761 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.218AsnAla: 4.218 ± 0.627
0.761AsnCys: 0.761 ± 0.244
2.974AsnAsp: 2.974 ± 0.387
3.319AsnGlu: 3.319 ± 0.414
1.798AsnPhe: 1.798 ± 0.33
5.117AsnGly: 5.117 ± 0.615
1.176AsnHis: 1.176 ± 0.342
3.319AsnIle: 3.319 ± 0.406
3.665AsnLys: 3.665 ± 0.558
3.112AsnLeu: 3.112 ± 0.509
1.383AsnMet: 1.383 ± 0.347
2.974AsnAsn: 2.974 ± 0.595
1.314AsnPro: 1.314 ± 0.275
1.66AsnGln: 1.66 ± 0.408
2.282AsnArg: 2.282 ± 0.444
2.835AsnSer: 2.835 ± 0.366
1.936AsnThr: 1.936 ± 0.357
3.803AsnVal: 3.803 ± 0.486
0.692AsnTrp: 0.692 ± 0.212
1.59AsnTyr: 1.59 ± 0.318
0.0AsnXaa: 0.0 ± 0.0
Pro
2.213ProAla: 2.213 ± 0.339
0.553ProCys: 0.553 ± 0.274
2.559ProAsp: 2.559 ± 0.438
3.665ProGlu: 3.665 ± 0.563
1.245ProPhe: 1.245 ± 0.249
2.835ProGly: 2.835 ± 0.447
0.83ProHis: 0.83 ± 0.249
1.936ProIle: 1.936 ± 0.453
1.176ProLys: 1.176 ± 0.255
1.59ProLeu: 1.59 ± 0.339
0.83ProMet: 0.83 ± 0.228
1.936ProAsn: 1.936 ± 0.338
1.383ProPro: 1.383 ± 0.347
1.383ProGln: 1.383 ± 0.304
1.245ProArg: 1.245 ± 0.36
1.59ProSer: 1.59 ± 0.263
1.383ProThr: 1.383 ± 0.329
3.043ProVal: 3.043 ± 0.437
0.484ProTrp: 0.484 ± 0.149
1.037ProTyr: 1.037 ± 0.217
0.0ProXaa: 0.0 ± 0.0
Gln
3.319GlnAla: 3.319 ± 0.435
0.484GlnCys: 0.484 ± 0.194
2.005GlnAsp: 2.005 ± 0.359
2.697GlnGlu: 2.697 ± 0.403
1.245GlnPhe: 1.245 ± 0.31
2.075GlnGly: 2.075 ± 0.469
0.415GlnHis: 0.415 ± 0.173
2.904GlnIle: 2.904 ± 0.431
2.42GlnLys: 2.42 ± 0.324
3.596GlnLeu: 3.596 ± 0.543
0.83GlnMet: 0.83 ± 0.227
1.521GlnAsn: 1.521 ± 0.336
1.106GlnPro: 1.106 ± 0.32
2.974GlnGln: 2.974 ± 0.864
2.559GlnArg: 2.559 ± 0.43
2.282GlnSer: 2.282 ± 0.42
1.798GlnThr: 1.798 ± 0.367
2.144GlnVal: 2.144 ± 0.385
0.346GlnTrp: 0.346 ± 0.133
1.66GlnTyr: 1.66 ± 0.265
0.0GlnXaa: 0.0 ± 0.0
Arg
3.942ArgAla: 3.942 ± 0.476
0.968ArgCys: 0.968 ± 0.313
2.904ArgAsp: 2.904 ± 0.429
3.388ArgGlu: 3.388 ± 0.455
2.144ArgPhe: 2.144 ± 0.311
2.904ArgGly: 2.904 ± 0.362
0.692ArgHis: 0.692 ± 0.25
4.426ArgIle: 4.426 ± 0.695
4.149ArgLys: 4.149 ± 0.62
3.319ArgLeu: 3.319 ± 0.505
1.521ArgMet: 1.521 ± 0.341
1.245ArgAsn: 1.245 ± 0.31
1.452ArgPro: 1.452 ± 0.376
1.798ArgGln: 1.798 ± 0.36
2.974ArgArg: 2.974 ± 0.606
2.282ArgSer: 2.282 ± 0.382
1.314ArgThr: 1.314 ± 0.273
4.357ArgVal: 4.357 ± 0.605
0.553ArgTrp: 0.553 ± 0.189
2.075ArgTyr: 2.075 ± 0.321
0.0ArgXaa: 0.0 ± 0.0
Ser
4.91SerAla: 4.91 ± 0.679
0.692SerCys: 0.692 ± 0.192
4.218SerAsp: 4.218 ± 0.469
4.287SerGlu: 4.287 ± 0.577
2.213SerPhe: 2.213 ± 0.382
5.878SerGly: 5.878 ± 0.61
1.037SerHis: 1.037 ± 0.313
4.633SerIle: 4.633 ± 0.625
4.287SerLys: 4.287 ± 0.596
3.872SerLeu: 3.872 ± 0.514
1.798SerMet: 1.798 ± 0.311
1.936SerAsn: 1.936 ± 0.342
1.59SerPro: 1.59 ± 0.296
2.974SerGln: 2.974 ± 0.547
2.697SerArg: 2.697 ± 0.437
3.181SerSer: 3.181 ± 0.5
2.559SerThr: 2.559 ± 0.365
4.357SerVal: 4.357 ± 0.476
0.83SerTrp: 0.83 ± 0.259
2.42SerTyr: 2.42 ± 0.419
0.0SerXaa: 0.0 ± 0.0
Thr
4.91ThrAla: 4.91 ± 0.726
0.553ThrCys: 0.553 ± 0.258
2.766ThrAsp: 2.766 ± 0.357
2.974ThrGlu: 2.974 ± 0.583
3.181ThrPhe: 3.181 ± 0.526
5.67ThrGly: 5.67 ± 0.54
1.037ThrHis: 1.037 ± 0.258
3.734ThrIle: 3.734 ± 0.425
3.388ThrLys: 3.388 ± 0.688
3.803ThrLeu: 3.803 ± 0.467
1.314ThrMet: 1.314 ± 0.357
2.489ThrAsn: 2.489 ± 0.515
2.974ThrPro: 2.974 ± 0.42
2.075ThrGln: 2.075 ± 0.315
2.075ThrArg: 2.075 ± 0.369
3.319ThrSer: 3.319 ± 0.592
2.005ThrThr: 2.005 ± 0.339
3.25ThrVal: 3.25 ± 0.528
1.176ThrTrp: 1.176 ± 0.283
1.729ThrTyr: 1.729 ± 0.34
0.0ThrXaa: 0.0 ± 0.0
Val
4.357ValAla: 4.357 ± 0.526
0.899ValCys: 0.899 ± 0.264
3.803ValAsp: 3.803 ± 0.471
5.74ValGlu: 5.74 ± 0.708
2.213ValPhe: 2.213 ± 0.402
4.011ValGly: 4.011 ± 0.445
0.899ValHis: 0.899 ± 0.258
4.357ValIle: 4.357 ± 0.492
5.947ValLys: 5.947 ± 0.548
4.218ValLeu: 4.218 ± 0.39
1.936ValMet: 1.936 ± 0.327
3.734ValAsn: 3.734 ± 0.438
1.867ValPro: 1.867 ± 0.368
1.798ValGln: 1.798 ± 0.304
3.665ValArg: 3.665 ± 0.535
4.702ValSer: 4.702 ± 0.61
5.117ValThr: 5.117 ± 0.597
3.319ValVal: 3.319 ± 0.483
0.968ValTrp: 0.968 ± 0.229
2.213ValTyr: 2.213 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
1.037TrpAla: 1.037 ± 0.244
0.277TrpCys: 0.277 ± 0.143
0.692TrpAsp: 0.692 ± 0.188
0.968TrpGlu: 0.968 ± 0.332
0.968TrpPhe: 0.968 ± 0.304
1.106TrpGly: 1.106 ± 0.248
0.484TrpHis: 0.484 ± 0.154
0.968TrpIle: 0.968 ± 0.247
1.037TrpLys: 1.037 ± 0.292
1.314TrpLeu: 1.314 ± 0.27
0.207TrpMet: 0.207 ± 0.125
0.692TrpAsn: 0.692 ± 0.179
0.415TrpPro: 0.415 ± 0.171
0.553TrpGln: 0.553 ± 0.171
1.106TrpArg: 1.106 ± 0.3
0.761TrpSer: 0.761 ± 0.269
1.037TrpThr: 1.037 ± 0.275
0.553TrpVal: 0.553 ± 0.166
0.138TrpTrp: 0.138 ± 0.092
0.346TrpTyr: 0.346 ± 0.168
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.766TyrAla: 2.766 ± 0.46
0.692TyrCys: 0.692 ± 0.206
2.974TyrAsp: 2.974 ± 0.444
2.213TyrGlu: 2.213 ± 0.448
1.383TyrPhe: 1.383 ± 0.408
2.974TyrGly: 2.974 ± 0.387
1.037TyrHis: 1.037 ± 0.251
2.835TyrIle: 2.835 ± 0.431
2.835TyrLys: 2.835 ± 0.547
1.936TyrLeu: 1.936 ± 0.32
0.692TyrMet: 0.692 ± 0.204
1.66TyrAsn: 1.66 ± 0.403
1.521TyrPro: 1.521 ± 0.311
1.66TyrGln: 1.66 ± 0.289
2.075TyrArg: 2.075 ± 0.412
2.282TyrSer: 2.282 ± 0.413
2.144TyrThr: 2.144 ± 0.354
2.075TyrVal: 2.075 ± 0.325
0.553TyrTrp: 0.553 ± 0.204
1.037TyrTyr: 1.037 ± 0.306
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 77 proteins (14462 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski