Amino acid dipepetide frequency for Bacillus phage vB_BcM_Sam112

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.629AlaAla: 11.629 ± 1.597
0.214AlaCys: 0.214 ± 0.13
4.066AlaAsp: 4.066 ± 0.474
7.776AlaGlu: 7.776 ± 0.901
2.711AlaPhe: 2.711 ± 0.612
6.421AlaGly: 6.421 ± 1.026
1.213AlaHis: 1.213 ± 0.311
6.706AlaIle: 6.706 ± 1.094
7.491AlaLys: 7.491 ± 0.665
5.85AlaLeu: 5.85 ± 0.7
2.64AlaMet: 2.64 ± 0.567
4.994AlaAsn: 4.994 ± 0.569
2.14AlaPro: 2.14 ± 0.425
3.139AlaGln: 3.139 ± 0.435
2.212AlaArg: 2.212 ± 0.373
3.282AlaSer: 3.282 ± 0.522
5.779AlaThr: 5.779 ± 1.117
4.851AlaVal: 4.851 ± 0.845
1.355AlaTrp: 1.355 ± 0.352
2.854AlaTyr: 2.854 ± 0.529
0.0AlaXaa: 0.0 ± 0.0
Cys
0.357CysAla: 0.357 ± 0.189
0.0CysCys: 0.0 ± 0.0
0.571CysAsp: 0.571 ± 0.202
0.428CysGlu: 0.428 ± 0.184
0.214CysPhe: 0.214 ± 0.12
0.499CysGly: 0.499 ± 0.205
0.0CysHis: 0.0 ± 0.0
0.428CysIle: 0.428 ± 0.17
0.428CysLys: 0.428 ± 0.212
0.357CysLeu: 0.357 ± 0.154
0.214CysMet: 0.214 ± 0.123
0.428CysAsn: 0.428 ± 0.205
0.285CysPro: 0.285 ± 0.149
0.143CysGln: 0.143 ± 0.095
0.285CysArg: 0.285 ± 0.117
0.285CysSer: 0.285 ± 0.148
0.357CysThr: 0.357 ± 0.172
0.357CysVal: 0.357 ± 0.161
0.0CysTrp: 0.0 ± 0.0
0.285CysTyr: 0.285 ± 0.125
0.0CysXaa: 0.0 ± 0.0
Asp
4.352AspAla: 4.352 ± 0.558
0.214AspCys: 0.214 ± 0.118
3.567AspAsp: 3.567 ± 0.561
3.995AspGlu: 3.995 ± 0.608
2.925AspPhe: 2.925 ± 0.458
4.994AspGly: 4.994 ± 0.615
1.07AspHis: 1.07 ± 0.287
4.709AspIle: 4.709 ± 0.623
4.78AspLys: 4.78 ± 0.575
3.567AspLeu: 3.567 ± 0.539
1.498AspMet: 1.498 ± 0.3
3.852AspAsn: 3.852 ± 0.611
1.784AspPro: 1.784 ± 0.367
1.855AspGln: 1.855 ± 0.337
1.998AspArg: 1.998 ± 0.353
3.139AspSer: 3.139 ± 0.533
2.354AspThr: 2.354 ± 0.501
3.353AspVal: 3.353 ± 0.429
0.927AspTrp: 0.927 ± 0.278
1.712AspTyr: 1.712 ± 0.344
0.0AspXaa: 0.0 ± 0.0
Glu
6.278GluAla: 6.278 ± 0.897
0.499GluCys: 0.499 ± 0.186
3.496GluAsp: 3.496 ± 0.626
5.707GluGlu: 5.707 ± 0.944
3.852GluPhe: 3.852 ± 0.54
3.852GluGly: 3.852 ± 0.452
1.284GluHis: 1.284 ± 0.304
5.422GluIle: 5.422 ± 0.877
6.777GluLys: 6.777 ± 1.007
6.207GluLeu: 6.207 ± 0.645
3.567GluMet: 3.567 ± 0.673
3.139GluAsn: 3.139 ± 0.462
2.426GluPro: 2.426 ± 0.435
3.496GluGln: 3.496 ± 0.806
3.567GluArg: 3.567 ± 0.673
2.782GluSer: 2.782 ± 0.4
4.281GluThr: 4.281 ± 0.735
4.851GluVal: 4.851 ± 0.502
1.355GluTrp: 1.355 ± 0.24
2.711GluTyr: 2.711 ± 0.393
0.0GluXaa: 0.0 ± 0.0
Phe
2.283PheAla: 2.283 ± 0.342
0.214PheCys: 0.214 ± 0.132
3.353PheAsp: 3.353 ± 0.563
3.995PheGlu: 3.995 ± 0.715
1.355PhePhe: 1.355 ± 0.323
2.925PheGly: 2.925 ± 0.485
0.285PheHis: 0.285 ± 0.134
2.64PheIle: 2.64 ± 0.348
3.567PheLys: 3.567 ± 0.471
2.283PheLeu: 2.283 ± 0.452
1.07PheMet: 1.07 ± 0.355
1.998PheAsn: 1.998 ± 0.378
1.141PhePro: 1.141 ± 0.259
1.284PheGln: 1.284 ± 0.335
0.856PheArg: 0.856 ± 0.206
2.354PheSer: 2.354 ± 0.616
2.854PheThr: 2.854 ± 0.667
1.926PheVal: 1.926 ± 0.37
0.357PheTrp: 0.357 ± 0.138
1.784PheTyr: 1.784 ± 0.397
0.0PheXaa: 0.0 ± 0.0
Gly
5.707GlyAla: 5.707 ± 1.168
0.499GlyCys: 0.499 ± 0.203
3.638GlyAsp: 3.638 ± 0.564
5.137GlyGlu: 5.137 ± 0.519
2.782GlyPhe: 2.782 ± 0.386
4.495GlyGly: 4.495 ± 0.736
1.213GlyHis: 1.213 ± 0.306
4.566GlyIle: 4.566 ± 0.543
4.637GlyLys: 4.637 ± 0.67
4.851GlyLeu: 4.851 ± 0.955
2.14GlyMet: 2.14 ± 0.454
2.711GlyAsn: 2.711 ± 0.484
1.213GlyPro: 1.213 ± 0.265
2.854GlyGln: 2.854 ± 0.489
2.568GlyArg: 2.568 ± 0.444
4.566GlySer: 4.566 ± 0.484
4.851GlyThr: 4.851 ± 0.829
6.278GlyVal: 6.278 ± 0.545
0.499GlyTrp: 0.499 ± 0.201
3.71GlyTyr: 3.71 ± 0.615
0.0GlyXaa: 0.0 ± 0.0
His
1.355HisAla: 1.355 ± 0.344
0.0HisCys: 0.0 ± 0.0
0.713HisAsp: 0.713 ± 0.233
1.284HisGlu: 1.284 ± 0.361
0.499HisPhe: 0.499 ± 0.204
1.427HisGly: 1.427 ± 0.296
0.214HisHis: 0.214 ± 0.12
0.999HisIle: 0.999 ± 0.254
1.784HisLys: 1.784 ± 0.363
1.141HisLeu: 1.141 ± 0.219
0.642HisMet: 0.642 ± 0.247
0.927HisAsn: 0.927 ± 0.25
0.499HisPro: 0.499 ± 0.235
0.642HisGln: 0.642 ± 0.217
0.856HisArg: 0.856 ± 0.22
0.785HisSer: 0.785 ± 0.246
1.07HisThr: 1.07 ± 0.27
1.07HisVal: 1.07 ± 0.292
0.428HisTrp: 0.428 ± 0.178
0.856HisTyr: 0.856 ± 0.395
0.0HisXaa: 0.0 ± 0.0
Ile
5.921IleAla: 5.921 ± 0.778
0.499IleCys: 0.499 ± 0.181
4.352IleAsp: 4.352 ± 0.603
5.493IleGlu: 5.493 ± 0.649
2.354IlePhe: 2.354 ± 0.481
4.495IleGly: 4.495 ± 0.499
1.213IleHis: 1.213 ± 0.352
4.495IleIle: 4.495 ± 0.437
6.064IleLys: 6.064 ± 0.713
4.281IleLeu: 4.281 ± 0.53
1.355IleMet: 1.355 ± 0.329
3.353IleAsn: 3.353 ± 0.501
2.568IlePro: 2.568 ± 0.647
3.068IleGln: 3.068 ± 0.547
2.996IleArg: 2.996 ± 0.439
3.71IleSer: 3.71 ± 0.77
4.495IleThr: 4.495 ± 0.593
4.352IleVal: 4.352 ± 0.597
1.07IleTrp: 1.07 ± 0.647
2.212IleTyr: 2.212 ± 0.409
0.0IleXaa: 0.0 ± 0.0
Lys
7.562LysAla: 7.562 ± 1.065
0.499LysCys: 0.499 ± 0.275
5.137LysAsp: 5.137 ± 0.635
7.919LysGlu: 7.919 ± 0.932
2.354LysPhe: 2.354 ± 0.346
5.351LysGly: 5.351 ± 0.705
2.14LysHis: 2.14 ± 0.365
6.278LysIle: 6.278 ± 0.87
8.204LysLys: 8.204 ± 1.211
6.92LysLeu: 6.92 ± 1.054
2.212LysMet: 2.212 ± 0.438
3.496LysAsn: 3.496 ± 0.544
2.711LysPro: 2.711 ± 0.591
2.64LysGln: 2.64 ± 0.397
4.78LysArg: 4.78 ± 0.81
3.282LysSer: 3.282 ± 0.421
3.995LysThr: 3.995 ± 0.597
5.779LysVal: 5.779 ± 0.729
1.213LysTrp: 1.213 ± 0.314
2.568LysTyr: 2.568 ± 0.514
0.0LysXaa: 0.0 ± 0.0
Leu
5.779LeuAla: 5.779 ± 0.57
1.07LeuCys: 1.07 ± 0.37
3.567LeuAsp: 3.567 ± 0.439
6.135LeuGlu: 6.135 ± 0.958
2.64LeuPhe: 2.64 ± 0.443
4.209LeuGly: 4.209 ± 0.723
0.927LeuHis: 0.927 ± 0.242
4.281LeuIle: 4.281 ± 0.633
5.779LeuLys: 5.779 ± 0.832
4.352LeuLeu: 4.352 ± 0.752
2.069LeuMet: 2.069 ± 0.462
4.066LeuAsn: 4.066 ± 0.567
2.782LeuPro: 2.782 ± 0.618
3.282LeuGln: 3.282 ± 0.607
3.139LeuArg: 3.139 ± 0.525
4.637LeuSer: 4.637 ± 0.716
5.208LeuThr: 5.208 ± 0.568
4.994LeuVal: 4.994 ± 0.83
0.927LeuTrp: 0.927 ± 0.314
2.283LeuTyr: 2.283 ± 0.326
0.0LeuXaa: 0.0 ± 0.0
Met
2.354MetAla: 2.354 ± 0.409
0.0MetCys: 0.0 ± 0.0
1.998MetAsp: 1.998 ± 0.326
1.641MetGlu: 1.641 ± 0.387
1.141MetPhe: 1.141 ± 0.242
2.14MetGly: 2.14 ± 0.44
0.499MetHis: 0.499 ± 0.148
1.641MetIle: 1.641 ± 0.424
2.925MetLys: 2.925 ± 0.494
3.068MetLeu: 3.068 ± 0.494
0.999MetMet: 0.999 ± 0.262
1.998MetAsn: 1.998 ± 0.362
0.856MetPro: 0.856 ± 0.225
1.57MetGln: 1.57 ± 0.34
1.07MetArg: 1.07 ± 0.263
2.354MetSer: 2.354 ± 0.435
1.355MetThr: 1.355 ± 0.328
1.784MetVal: 1.784 ± 0.395
0.428MetTrp: 0.428 ± 0.189
0.428MetTyr: 0.428 ± 0.188
0.0MetXaa: 0.0 ± 0.0
Asn
5.422AsnAla: 5.422 ± 0.903
0.357AsnCys: 0.357 ± 0.167
3.282AsnAsp: 3.282 ± 0.375
2.711AsnGlu: 2.711 ± 0.399
2.069AsnPhe: 2.069 ± 0.442
4.066AsnGly: 4.066 ± 0.563
0.713AsnHis: 0.713 ± 0.232
2.782AsnIle: 2.782 ± 0.407
4.066AsnLys: 4.066 ± 0.641
3.068AsnLeu: 3.068 ± 0.575
1.498AsnMet: 1.498 ± 0.309
2.854AsnAsn: 2.854 ± 0.712
1.641AsnPro: 1.641 ± 0.314
1.998AsnGln: 1.998 ± 0.396
2.782AsnArg: 2.782 ± 0.447
2.782AsnSer: 2.782 ± 0.542
3.139AsnThr: 3.139 ± 0.446
4.138AsnVal: 4.138 ± 0.533
1.284AsnTrp: 1.284 ± 0.367
1.07AsnTyr: 1.07 ± 0.326
0.0AsnXaa: 0.0 ± 0.0
Pro
2.568ProAla: 2.568 ± 0.436
0.071ProCys: 0.071 ± 0.069
2.354ProAsp: 2.354 ± 0.437
2.497ProGlu: 2.497 ± 0.454
1.07ProPhe: 1.07 ± 0.3
2.14ProGly: 2.14 ± 0.427
0.713ProHis: 0.713 ± 0.221
2.212ProIle: 2.212 ± 0.395
2.854ProLys: 2.854 ± 0.585
1.998ProLeu: 1.998 ± 0.395
0.785ProMet: 0.785 ± 0.24
1.855ProAsn: 1.855 ± 0.392
1.498ProPro: 1.498 ± 0.379
0.713ProGln: 0.713 ± 0.248
1.427ProArg: 1.427 ± 0.227
1.712ProSer: 1.712 ± 0.476
1.784ProThr: 1.784 ± 0.333
2.212ProVal: 2.212 ± 0.371
0.285ProTrp: 0.285 ± 0.153
0.999ProTyr: 0.999 ± 0.291
0.0ProXaa: 0.0 ± 0.0
Gln
3.282GlnAla: 3.282 ± 0.459
0.214GlnCys: 0.214 ± 0.143
1.498GlnAsp: 1.498 ± 0.321
2.568GlnGlu: 2.568 ± 0.478
1.57GlnPhe: 1.57 ± 0.306
2.996GlnGly: 2.996 ± 0.498
0.642GlnHis: 0.642 ± 0.181
1.784GlnIle: 1.784 ± 0.352
3.139GlnLys: 3.139 ± 0.448
2.925GlnLeu: 2.925 ± 0.428
1.427GlnMet: 1.427 ± 0.272
1.57GlnAsn: 1.57 ± 0.303
1.213GlnPro: 1.213 ± 0.339
2.14GlnGln: 2.14 ± 0.39
1.213GlnArg: 1.213 ± 0.34
2.283GlnSer: 2.283 ± 0.519
3.068GlnThr: 3.068 ± 0.558
3.21GlnVal: 3.21 ± 0.466
0.285GlnTrp: 0.285 ± 0.135
1.712GlnTyr: 1.712 ± 0.444
0.0GlnXaa: 0.0 ± 0.0
Arg
1.998ArgAla: 1.998 ± 0.365
0.214ArgCys: 0.214 ± 0.126
2.426ArgAsp: 2.426 ± 0.436
3.282ArgGlu: 3.282 ± 0.395
1.57ArgPhe: 1.57 ± 0.323
2.782ArgGly: 2.782 ± 0.454
0.856ArgHis: 0.856 ± 0.225
2.212ArgIle: 2.212 ± 0.423
4.495ArgLys: 4.495 ± 0.743
3.924ArgLeu: 3.924 ± 0.719
1.712ArgMet: 1.712 ± 0.391
1.784ArgAsn: 1.784 ± 0.432
1.284ArgPro: 1.284 ± 0.339
1.784ArgGln: 1.784 ± 0.338
2.782ArgArg: 2.782 ± 0.631
1.712ArgSer: 1.712 ± 0.363
1.855ArgThr: 1.855 ± 0.479
2.354ArgVal: 2.354 ± 0.403
0.428ArgTrp: 0.428 ± 0.161
2.212ArgTyr: 2.212 ± 0.372
0.0ArgXaa: 0.0 ± 0.0
Ser
4.709SerAla: 4.709 ± 1.151
0.285SerCys: 0.285 ± 0.139
2.711SerAsp: 2.711 ± 0.324
3.139SerGlu: 3.139 ± 0.625
2.925SerPhe: 2.925 ± 0.429
4.423SerGly: 4.423 ± 0.778
1.07SerHis: 1.07 ± 0.271
3.496SerIle: 3.496 ± 0.511
4.209SerLys: 4.209 ± 0.581
3.71SerLeu: 3.71 ± 0.458
1.784SerMet: 1.784 ± 0.407
2.925SerAsn: 2.925 ± 0.667
1.213SerPro: 1.213 ± 0.312
1.427SerGln: 1.427 ± 0.268
1.213SerArg: 1.213 ± 0.322
2.568SerSer: 2.568 ± 0.494
3.282SerThr: 3.282 ± 0.499
3.139SerVal: 3.139 ± 0.616
0.713SerTrp: 0.713 ± 0.204
1.784SerTyr: 1.784 ± 0.367
0.0SerXaa: 0.0 ± 0.0
Thr
6.635ThrAla: 6.635 ± 1.112
0.285ThrCys: 0.285 ± 0.174
3.496ThrAsp: 3.496 ± 0.493
4.423ThrGlu: 4.423 ± 0.716
2.354ThrPhe: 2.354 ± 0.408
4.281ThrGly: 4.281 ± 0.554
0.927ThrHis: 0.927 ± 0.238
5.351ThrIle: 5.351 ± 0.909
4.066ThrLys: 4.066 ± 0.567
5.065ThrLeu: 5.065 ± 0.75
1.641ThrMet: 1.641 ± 0.554
2.354ThrAsn: 2.354 ± 0.4
2.782ThrPro: 2.782 ± 0.384
2.568ThrGln: 2.568 ± 0.43
2.782ThrArg: 2.782 ± 0.352
3.21ThrSer: 3.21 ± 0.711
4.209ThrThr: 4.209 ± 0.893
3.496ThrVal: 3.496 ± 0.395
0.999ThrTrp: 0.999 ± 0.231
2.426ThrTyr: 2.426 ± 0.491
0.0ThrXaa: 0.0 ± 0.0
Val
5.85ValAla: 5.85 ± 0.853
0.428ValCys: 0.428 ± 0.192
3.638ValAsp: 3.638 ± 0.372
4.281ValGlu: 4.281 ± 0.592
2.568ValPhe: 2.568 ± 0.431
3.781ValGly: 3.781 ± 0.603
1.57ValHis: 1.57 ± 0.319
4.923ValIle: 4.923 ± 0.599
4.78ValLys: 4.78 ± 0.617
4.851ValLeu: 4.851 ± 0.722
1.213ValMet: 1.213 ± 0.288
4.709ValAsn: 4.709 ± 0.692
2.568ValPro: 2.568 ± 0.46
2.996ValGln: 2.996 ± 0.39
2.354ValArg: 2.354 ± 0.441
2.996ValSer: 2.996 ± 0.62
5.065ValThr: 5.065 ± 0.843
4.637ValVal: 4.637 ± 0.742
0.785ValTrp: 0.785 ± 0.243
1.926ValTyr: 1.926 ± 0.396
0.0ValXaa: 0.0 ± 0.0
Trp
0.642TrpAla: 0.642 ± 0.194
0.0TrpCys: 0.0 ± 0.0
0.642TrpAsp: 0.642 ± 0.323
0.785TrpGlu: 0.785 ± 0.172
0.499TrpPhe: 0.499 ± 0.203
1.141TrpGly: 1.141 ± 0.224
0.285TrpHis: 0.285 ± 0.181
0.785TrpIle: 0.785 ± 0.231
1.07TrpLys: 1.07 ± 0.303
1.284TrpLeu: 1.284 ± 0.232
0.285TrpMet: 0.285 ± 0.158
0.927TrpAsn: 0.927 ± 0.569
0.0TrpPro: 0.0 ± 0.0
0.285TrpGln: 0.285 ± 0.117
0.713TrpArg: 0.713 ± 0.21
0.713TrpSer: 0.713 ± 0.204
2.069TrpThr: 2.069 ± 0.617
1.07TrpVal: 1.07 ± 0.266
0.285TrpTrp: 0.285 ± 0.144
0.856TrpTyr: 0.856 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.64TyrAla: 2.64 ± 0.498
0.357TyrCys: 0.357 ± 0.16
2.14TyrAsp: 2.14 ± 0.318
2.354TyrGlu: 2.354 ± 0.452
1.07TyrPhe: 1.07 ± 0.296
2.497TyrGly: 2.497 ± 0.456
0.428TyrHis: 0.428 ± 0.215
2.64TyrIle: 2.64 ± 0.47
3.924TyrLys: 3.924 ± 0.489
2.426TyrLeu: 2.426 ± 0.378
1.427TyrMet: 1.427 ± 0.407
1.784TyrAsn: 1.784 ± 0.345
1.141TyrPro: 1.141 ± 0.342
0.856TyrGln: 0.856 ± 0.221
2.069TyrArg: 2.069 ± 0.475
1.57TyrSer: 1.57 ± 0.347
2.283TyrThr: 2.283 ± 0.458
2.069TyrVal: 2.069 ± 0.392
0.713TyrTrp: 0.713 ± 0.261
1.57TyrTyr: 1.57 ± 0.423
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 75 proteins (14018 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski