Amino acid dipepetide frequency for Bacillus phage PfNC7401

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.665AlaAla: 4.665 ± 0.717
0.583AlaCys: 0.583 ± 0.203
2.915AlaAsp: 2.915 ± 0.456
5.612AlaGlu: 5.612 ± 0.635
2.988AlaPhe: 2.988 ± 0.394
3.207AlaGly: 3.207 ± 0.485
0.364AlaHis: 0.364 ± 0.158
4.956AlaIle: 4.956 ± 0.506
5.758AlaLys: 5.758 ± 0.631
5.831AlaLeu: 5.831 ± 0.737
2.478AlaMet: 2.478 ± 0.549
3.644AlaAsn: 3.644 ± 0.496
1.676AlaPro: 1.676 ± 0.312
2.77AlaGln: 2.77 ± 0.541
2.405AlaArg: 2.405 ± 0.608
3.207AlaSer: 3.207 ± 0.404
3.426AlaThr: 3.426 ± 0.579
4.227AlaVal: 4.227 ± 0.557
0.51AlaTrp: 0.51 ± 0.164
2.041AlaTyr: 2.041 ± 0.495
0.0AlaXaa: 0.0 ± 0.0
Cys
0.292CysAla: 0.292 ± 0.13
0.073CysCys: 0.073 ± 0.062
0.583CysAsp: 0.583 ± 0.216
0.875CysGlu: 0.875 ± 0.282
0.583CysPhe: 0.583 ± 0.179
0.219CysGly: 0.219 ± 0.134
0.146CysHis: 0.146 ± 0.101
0.219CysIle: 0.219 ± 0.106
0.292CysLys: 0.292 ± 0.157
0.219CysLeu: 0.219 ± 0.112
0.0CysMet: 0.0 ± 0.0
0.51CysAsn: 0.51 ± 0.234
0.51CysPro: 0.51 ± 0.231
0.146CysGln: 0.146 ± 0.123
0.219CysArg: 0.219 ± 0.13
0.437CysSer: 0.437 ± 0.181
0.583CysThr: 0.583 ± 0.248
0.583CysVal: 0.583 ± 0.225
0.0CysTrp: 0.0 ± 0.0
0.364CysTyr: 0.364 ± 0.214
0.0CysXaa: 0.0 ± 0.0
Asp
3.353AspAla: 3.353 ± 0.546
0.219AspCys: 0.219 ± 0.115
3.426AspAsp: 3.426 ± 0.658
5.102AspGlu: 5.102 ± 0.804
2.551AspPhe: 2.551 ± 0.389
3.571AspGly: 3.571 ± 0.593
1.166AspHis: 1.166 ± 0.336
4.665AspIle: 4.665 ± 0.564
4.155AspLys: 4.155 ± 0.554
5.102AspLeu: 5.102 ± 0.761
2.259AspMet: 2.259 ± 0.461
2.624AspAsn: 2.624 ± 0.339
1.312AspPro: 1.312 ± 0.327
1.968AspGln: 1.968 ± 0.371
2.114AspArg: 2.114 ± 0.362
2.843AspSer: 2.843 ± 0.455
2.478AspThr: 2.478 ± 0.528
4.3AspVal: 4.3 ± 0.707
0.948AspTrp: 0.948 ± 0.257
2.988AspTyr: 2.988 ± 0.481
0.0AspXaa: 0.0 ± 0.0
Glu
6.851GluAla: 6.851 ± 0.704
1.093GluCys: 1.093 ± 0.34
3.571GluAsp: 3.571 ± 0.548
7.726GluGlu: 7.726 ± 1.266
2.988GluPhe: 2.988 ± 0.445
4.665GluGly: 4.665 ± 0.554
1.239GluHis: 1.239 ± 0.397
6.414GluIle: 6.414 ± 0.858
7.143GluLys: 7.143 ± 0.919
7.07GluLeu: 7.07 ± 0.914
2.551GluMet: 2.551 ± 0.47
4.373GluAsn: 4.373 ± 0.702
2.259GluPro: 2.259 ± 0.508
3.207GluGln: 3.207 ± 0.536
3.353GluArg: 3.353 ± 0.625
4.3GluSer: 4.3 ± 0.485
3.28GluThr: 3.28 ± 0.475
5.102GluVal: 5.102 ± 0.65
1.676GluTrp: 1.676 ± 0.282
2.988GluTyr: 2.988 ± 0.536
0.0GluXaa: 0.0 ± 0.0
Phe
1.895PheAla: 1.895 ± 0.253
0.292PheCys: 0.292 ± 0.162
2.77PheAsp: 2.77 ± 0.54
2.624PheGlu: 2.624 ± 0.577
1.093PhePhe: 1.093 ± 0.238
2.187PheGly: 2.187 ± 0.391
0.437PheHis: 0.437 ± 0.171
3.28PheIle: 3.28 ± 0.383
2.478PheLys: 2.478 ± 0.364
2.478PheLeu: 2.478 ± 0.376
0.948PheMet: 0.948 ± 0.278
2.332PheAsn: 2.332 ± 0.439
1.458PhePro: 1.458 ± 0.337
1.385PheGln: 1.385 ± 0.312
2.332PheArg: 2.332 ± 0.481
2.77PheSer: 2.77 ± 0.425
2.915PheThr: 2.915 ± 0.465
2.259PheVal: 2.259 ± 0.434
0.364PheTrp: 0.364 ± 0.162
1.895PheTyr: 1.895 ± 0.43
0.0PheXaa: 0.0 ± 0.0
Gly
2.624GlyAla: 2.624 ± 0.386
0.437GlyCys: 0.437 ± 0.195
3.061GlyAsp: 3.061 ± 0.548
4.227GlyGlu: 4.227 ± 0.607
3.28GlyPhe: 3.28 ± 0.504
4.009GlyGly: 4.009 ± 0.577
0.729GlyHis: 0.729 ± 0.25
6.05GlyIle: 6.05 ± 0.897
5.466GlyLys: 5.466 ± 0.661
5.029GlyLeu: 5.029 ± 0.964
3.061GlyMet: 3.061 ± 0.555
2.915GlyAsn: 2.915 ± 0.571
0.875GlyPro: 0.875 ± 0.306
1.895GlyGln: 1.895 ± 0.462
2.405GlyArg: 2.405 ± 0.37
2.697GlySer: 2.697 ± 0.441
3.571GlyThr: 3.571 ± 0.629
3.79GlyVal: 3.79 ± 0.441
0.729GlyTrp: 0.729 ± 0.22
3.499GlyTyr: 3.499 ± 0.488
0.0GlyXaa: 0.0 ± 0.0
His
0.583HisAla: 0.583 ± 0.231
0.073HisCys: 0.073 ± 0.077
0.875HisAsp: 0.875 ± 0.275
1.603HisGlu: 1.603 ± 0.417
0.583HisPhe: 0.583 ± 0.208
0.948HisGly: 0.948 ± 0.205
0.364HisHis: 0.364 ± 0.161
1.239HisIle: 1.239 ± 0.269
0.948HisLys: 0.948 ± 0.257
1.385HisLeu: 1.385 ± 0.29
0.219HisMet: 0.219 ± 0.118
0.875HisAsn: 0.875 ± 0.239
0.583HisPro: 0.583 ± 0.142
0.364HisGln: 0.364 ± 0.147
0.948HisArg: 0.948 ± 0.292
0.948HisSer: 0.948 ± 0.229
0.656HisThr: 0.656 ± 0.258
0.656HisVal: 0.656 ± 0.225
0.146HisTrp: 0.146 ± 0.085
0.51HisTyr: 0.51 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
6.195IleAla: 6.195 ± 0.569
0.219IleCys: 0.219 ± 0.188
5.321IleAsp: 5.321 ± 0.663
6.997IleGlu: 6.997 ± 0.78
1.312IlePhe: 1.312 ± 0.369
3.936IleGly: 3.936 ± 0.786
1.312IleHis: 1.312 ± 0.341
5.321IleIle: 5.321 ± 0.941
6.56IleLys: 6.56 ± 0.819
5.029IleLeu: 5.029 ± 0.764
1.458IleMet: 1.458 ± 0.299
5.175IleAsn: 5.175 ± 0.696
2.332IlePro: 2.332 ± 0.43
3.061IleGln: 3.061 ± 0.612
3.863IleArg: 3.863 ± 0.407
3.936IleSer: 3.936 ± 0.656
4.883IleThr: 4.883 ± 0.669
4.446IleVal: 4.446 ± 0.734
0.802IleTrp: 0.802 ± 0.217
2.187IleTyr: 2.187 ± 0.518
0.0IleXaa: 0.0 ± 0.0
Lys
4.883LysAla: 4.883 ± 0.635
0.437LysCys: 0.437 ± 0.158
5.175LysAsp: 5.175 ± 0.755
7.945LysGlu: 7.945 ± 1.128
2.332LysPhe: 2.332 ± 0.385
4.592LysGly: 4.592 ± 0.728
1.458LysHis: 1.458 ± 0.345
4.446LysIle: 4.446 ± 0.517
8.892LysLys: 8.892 ± 1.135
7.799LysLeu: 7.799 ± 0.826
2.478LysMet: 2.478 ± 0.388
3.644LysAsn: 3.644 ± 0.513
2.478LysPro: 2.478 ± 0.438
4.3LysGln: 4.3 ± 0.634
4.81LysArg: 4.81 ± 0.639
5.248LysSer: 5.248 ± 0.546
5.758LysThr: 5.758 ± 0.57
5.321LysVal: 5.321 ± 0.704
1.458LysTrp: 1.458 ± 0.294
4.009LysTyr: 4.009 ± 0.587
0.0LysXaa: 0.0 ± 0.0
Leu
5.102LeuAla: 5.102 ± 0.727
0.437LeuCys: 0.437 ± 0.163
4.956LeuAsp: 4.956 ± 0.51
6.268LeuGlu: 6.268 ± 0.686
2.915LeuPhe: 2.915 ± 0.531
5.175LeuGly: 5.175 ± 0.679
1.093LeuHis: 1.093 ± 0.21
5.394LeuIle: 5.394 ± 0.601
7.143LeuLys: 7.143 ± 0.869
5.758LeuLeu: 5.758 ± 0.668
2.551LeuMet: 2.551 ± 0.463
4.592LeuAsn: 4.592 ± 0.616
3.061LeuPro: 3.061 ± 0.987
3.28LeuGln: 3.28 ± 0.535
4.373LeuArg: 4.373 ± 0.539
5.029LeuSer: 5.029 ± 0.649
4.665LeuThr: 4.665 ± 0.639
4.009LeuVal: 4.009 ± 0.518
0.51LeuTrp: 0.51 ± 0.175
3.79LeuTyr: 3.79 ± 0.741
0.0LeuXaa: 0.0 ± 0.0
Met
2.259MetAla: 2.259 ± 0.438
0.292MetCys: 0.292 ± 0.152
1.458MetAsp: 1.458 ± 0.289
3.207MetGlu: 3.207 ± 0.496
0.656MetPhe: 0.656 ± 0.193
2.114MetGly: 2.114 ± 0.946
0.219MetHis: 0.219 ± 0.107
1.895MetIle: 1.895 ± 0.362
3.28MetLys: 3.28 ± 0.375
2.405MetLeu: 2.405 ± 0.526
1.312MetMet: 1.312 ± 0.512
2.405MetAsn: 2.405 ± 0.38
0.583MetPro: 0.583 ± 0.245
1.531MetGln: 1.531 ± 0.342
1.603MetArg: 1.603 ± 0.33
1.822MetSer: 1.822 ± 0.448
1.676MetThr: 1.676 ± 0.349
1.312MetVal: 1.312 ± 0.426
0.292MetTrp: 0.292 ± 0.149
1.312MetTyr: 1.312 ± 0.283
0.0MetXaa: 0.0 ± 0.0
Asn
2.259AsnAla: 2.259 ± 0.509
0.51AsnCys: 0.51 ± 0.222
3.426AsnAsp: 3.426 ± 0.558
3.644AsnGlu: 3.644 ± 0.641
2.259AsnPhe: 2.259 ± 0.413
4.956AsnGly: 4.956 ± 0.508
0.729AsnHis: 0.729 ± 0.258
4.155AsnIle: 4.155 ± 0.539
5.102AsnLys: 5.102 ± 0.625
4.956AsnLeu: 4.956 ± 0.581
2.041AsnMet: 2.041 ± 0.528
3.28AsnAsn: 3.28 ± 0.472
2.405AsnPro: 2.405 ± 0.371
2.332AsnGln: 2.332 ± 0.321
1.749AsnArg: 1.749 ± 0.432
3.207AsnSer: 3.207 ± 0.611
2.915AsnThr: 2.915 ± 0.382
3.863AsnVal: 3.863 ± 0.48
0.437AsnTrp: 0.437 ± 0.178
1.458AsnTyr: 1.458 ± 0.299
0.0AsnXaa: 0.0 ± 0.0
Pro
1.458ProAla: 1.458 ± 0.293
0.292ProCys: 0.292 ± 0.147
1.968ProAsp: 1.968 ± 0.421
2.332ProGlu: 2.332 ± 0.392
1.312ProPhe: 1.312 ± 0.318
1.749ProGly: 1.749 ± 0.443
0.292ProHis: 0.292 ± 0.135
1.531ProIle: 1.531 ± 0.351
2.114ProLys: 2.114 ± 0.422
1.895ProLeu: 1.895 ± 0.35
0.729ProMet: 0.729 ± 0.178
1.312ProAsn: 1.312 ± 0.27
1.093ProPro: 1.093 ± 0.287
1.676ProGln: 1.676 ± 0.367
1.093ProArg: 1.093 ± 0.254
2.697ProSer: 2.697 ± 0.471
2.77ProThr: 2.77 ± 0.493
2.77ProVal: 2.77 ± 0.516
0.219ProTrp: 0.219 ± 0.121
1.531ProTyr: 1.531 ± 0.36
0.0ProXaa: 0.0 ± 0.0
Gln
1.968GlnAla: 1.968 ± 0.43
0.219GlnCys: 0.219 ± 0.127
1.895GlnAsp: 1.895 ± 0.307
2.624GlnGlu: 2.624 ± 0.566
0.948GlnPhe: 0.948 ± 0.227
2.478GlnGly: 2.478 ± 0.385
0.364GlnHis: 0.364 ± 0.189
3.79GlnIle: 3.79 ± 0.463
3.426GlnLys: 3.426 ± 0.456
3.061GlnLeu: 3.061 ± 0.484
1.458GlnMet: 1.458 ± 0.276
2.041GlnAsn: 2.041 ± 0.521
1.458GlnPro: 1.458 ± 0.274
2.551GlnGln: 2.551 ± 0.636
2.624GlnArg: 2.624 ± 0.448
1.822GlnSer: 1.822 ± 0.294
2.332GlnThr: 2.332 ± 0.304
2.478GlnVal: 2.478 ± 0.502
0.729GlnTrp: 0.729 ± 0.215
1.822GlnTyr: 1.822 ± 0.326
0.0GlnXaa: 0.0 ± 0.0
Arg
2.551ArgAla: 2.551 ± 0.43
0.364ArgCys: 0.364 ± 0.184
2.843ArgAsp: 2.843 ± 0.348
3.644ArgGlu: 3.644 ± 0.537
2.114ArgPhe: 2.114 ± 0.409
2.332ArgGly: 2.332 ± 0.499
0.948ArgHis: 0.948 ± 0.257
4.082ArgIle: 4.082 ± 0.545
4.956ArgLys: 4.956 ± 0.991
3.644ArgLeu: 3.644 ± 0.561
1.531ArgMet: 1.531 ± 0.278
2.405ArgAsn: 2.405 ± 0.398
1.166ArgPro: 1.166 ± 0.317
1.531ArgGln: 1.531 ± 0.505
2.405ArgArg: 2.405 ± 0.381
2.041ArgSer: 2.041 ± 0.345
2.697ArgThr: 2.697 ± 0.46
2.77ArgVal: 2.77 ± 0.485
0.437ArgTrp: 0.437 ± 0.16
1.458ArgTyr: 1.458 ± 0.288
0.0ArgXaa: 0.0 ± 0.0
Ser
3.207SerAla: 3.207 ± 0.489
0.146SerCys: 0.146 ± 0.107
2.478SerAsp: 2.478 ± 0.402
4.009SerGlu: 4.009 ± 0.696
2.915SerPhe: 2.915 ± 0.473
3.353SerGly: 3.353 ± 0.508
0.948SerHis: 0.948 ± 0.26
3.79SerIle: 3.79 ± 0.553
5.758SerLys: 5.758 ± 0.693
4.665SerLeu: 4.665 ± 0.532
1.749SerMet: 1.749 ± 0.397
2.843SerAsn: 2.843 ± 0.64
1.531SerPro: 1.531 ± 0.366
2.77SerGln: 2.77 ± 0.363
2.478SerArg: 2.478 ± 0.422
3.499SerSer: 3.499 ± 0.851
3.134SerThr: 3.134 ± 0.395
4.446SerVal: 4.446 ± 0.61
0.656SerTrp: 0.656 ± 0.161
2.114SerTyr: 2.114 ± 0.409
0.0SerXaa: 0.0 ± 0.0
Thr
5.321ThrAla: 5.321 ± 0.691
0.364ThrCys: 0.364 ± 0.181
2.624ThrAsp: 2.624 ± 0.397
2.843ThrGlu: 2.843 ± 0.463
2.915ThrPhe: 2.915 ± 0.385
4.009ThrGly: 4.009 ± 0.523
1.093ThrHis: 1.093 ± 0.267
4.373ThrIle: 4.373 ± 0.904
4.227ThrLys: 4.227 ± 0.559
5.685ThrLeu: 5.685 ± 0.831
1.166ThrMet: 1.166 ± 0.325
4.3ThrAsn: 4.3 ± 0.613
3.207ThrPro: 3.207 ± 0.525
1.603ThrGln: 1.603 ± 0.371
1.895ThrArg: 1.895 ± 0.405
3.28ThrSer: 3.28 ± 0.52
3.426ThrThr: 3.426 ± 0.507
4.082ThrVal: 4.082 ± 0.471
0.656ThrTrp: 0.656 ± 0.187
1.895ThrTyr: 1.895 ± 0.395
0.0ThrXaa: 0.0 ± 0.0
Val
4.227ValAla: 4.227 ± 0.794
0.437ValCys: 0.437 ± 0.175
4.519ValAsp: 4.519 ± 0.685
5.394ValGlu: 5.394 ± 0.682
1.895ValPhe: 1.895 ± 0.327
4.009ValGly: 4.009 ± 0.681
0.875ValHis: 0.875 ± 0.277
4.738ValIle: 4.738 ± 0.586
5.904ValLys: 5.904 ± 0.571
3.426ValLeu: 3.426 ± 0.51
2.041ValMet: 2.041 ± 0.482
3.353ValAsn: 3.353 ± 0.493
2.259ValPro: 2.259 ± 0.371
2.114ValGln: 2.114 ± 0.439
2.77ValArg: 2.77 ± 0.629
4.155ValSer: 4.155 ± 0.555
4.592ValThr: 4.592 ± 0.712
4.665ValVal: 4.665 ± 0.53
0.875ValTrp: 0.875 ± 0.238
1.895ValTyr: 1.895 ± 0.441
0.0ValXaa: 0.0 ± 0.0
Trp
0.948TrpAla: 0.948 ± 0.304
0.073TrpCys: 0.073 ± 0.07
1.02TrpAsp: 1.02 ± 0.315
1.531TrpGlu: 1.531 ± 0.379
0.583TrpPhe: 0.583 ± 0.186
0.437TrpGly: 0.437 ± 0.169
0.437TrpHis: 0.437 ± 0.193
0.948TrpIle: 0.948 ± 0.212
0.802TrpLys: 0.802 ± 0.26
0.948TrpLeu: 0.948 ± 0.257
0.219TrpMet: 0.219 ± 0.123
0.948TrpAsn: 0.948 ± 0.243
0.0TrpPro: 0.0 ± 0.0
0.146TrpGln: 0.146 ± 0.094
0.656TrpArg: 0.656 ± 0.212
0.51TrpSer: 0.51 ± 0.243
0.802TrpThr: 0.802 ± 0.207
0.583TrpVal: 0.583 ± 0.245
0.146TrpTrp: 0.146 ± 0.1
0.219TrpTyr: 0.219 ± 0.111
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.697TyrAla: 2.697 ± 0.411
0.292TyrCys: 0.292 ± 0.183
2.259TyrAsp: 2.259 ± 0.387
3.644TyrGlu: 3.644 ± 0.607
1.968TyrPhe: 1.968 ± 0.453
2.405TyrGly: 2.405 ± 0.39
0.292TyrHis: 0.292 ± 0.139
3.061TyrIle: 3.061 ± 0.465
3.061TyrLys: 3.061 ± 0.458
3.717TyrLeu: 3.717 ± 0.515
1.312TyrMet: 1.312 ± 0.3
2.114TyrAsn: 2.114 ± 0.416
0.583TyrPro: 0.583 ± 0.235
1.458TyrGln: 1.458 ± 0.33
1.822TyrArg: 1.822 ± 0.358
2.041TyrSer: 2.041 ± 0.427
2.405TyrThr: 2.405 ± 0.475
2.478TyrVal: 2.478 ± 0.461
0.364TyrTrp: 0.364 ± 0.164
2.259TyrTyr: 2.259 ± 0.478
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 80 proteins (13721 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski