Amino acid dipepetide frequency for Bacillus phage vB_BthP-Goe4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.409AlaAla: 0.409 ± 0.212
0.953AlaCys: 0.953 ± 0.34
2.724AlaAsp: 2.724 ± 0.603
3.813AlaGlu: 3.813 ± 1.145
3.405AlaPhe: 3.405 ± 0.789
3.405AlaGly: 3.405 ± 0.832
0.136AlaHis: 0.136 ± 0.112
2.315AlaIle: 2.315 ± 0.543
3.541AlaLys: 3.541 ± 0.664
2.043AlaLeu: 2.043 ± 0.45
1.907AlaMet: 1.907 ± 0.494
2.179AlaAsn: 2.179 ± 0.612
1.362AlaPro: 1.362 ± 0.507
1.907AlaGln: 1.907 ± 0.379
1.498AlaArg: 1.498 ± 0.336
2.043AlaSer: 2.043 ± 0.539
2.86AlaThr: 2.86 ± 0.516
3.541AlaVal: 3.541 ± 0.81
0.681AlaTrp: 0.681 ± 0.304
2.179AlaTyr: 2.179 ± 0.588
0.0AlaXaa: 0.0 ± 0.0
Cys
0.681CysAla: 0.681 ± 0.299
0.136CysCys: 0.136 ± 0.125
1.498CysAsp: 1.498 ± 0.514
1.226CysGlu: 1.226 ± 0.357
0.409CysPhe: 0.409 ± 0.26
0.817CysGly: 0.817 ± 0.414
0.272CysHis: 0.272 ± 0.224
0.817CysIle: 0.817 ± 0.318
1.907CysLys: 1.907 ± 0.503
0.545CysLeu: 0.545 ± 0.253
0.409CysMet: 0.409 ± 0.214
0.272CysAsn: 0.272 ± 0.16
0.681CysPro: 0.681 ± 0.35
0.136CysGln: 0.136 ± 0.124
0.409CysArg: 0.409 ± 0.224
0.272CysSer: 0.272 ± 0.182
0.0CysThr: 0.0 ± 0.0
0.681CysVal: 0.681 ± 0.291
0.409CysTrp: 0.409 ± 0.244
0.681CysTyr: 0.681 ± 0.258
0.0CysXaa: 0.0 ± 0.0
Asp
2.315AspAla: 2.315 ± 0.507
1.089AspCys: 1.089 ± 0.515
2.996AspAsp: 2.996 ± 0.621
5.856AspGlu: 5.856 ± 0.851
4.086AspPhe: 4.086 ± 0.693
5.72AspGly: 5.72 ± 1.071
0.953AspHis: 0.953 ± 0.338
5.311AspIle: 5.311 ± 0.793
5.72AspLys: 5.72 ± 0.645
4.086AspLeu: 4.086 ± 1.021
2.179AspMet: 2.179 ± 0.531
5.039AspAsn: 5.039 ± 0.679
2.724AspPro: 2.724 ± 0.691
2.179AspGln: 2.179 ± 0.42
2.043AspArg: 2.043 ± 0.598
1.634AspSer: 1.634 ± 0.595
2.996AspThr: 2.996 ± 0.636
5.311AspVal: 5.311 ± 0.699
0.409AspTrp: 0.409 ± 0.195
2.996AspTyr: 2.996 ± 0.594
0.0AspXaa: 0.0 ± 0.0
Glu
2.996GluAla: 2.996 ± 0.72
0.681GluCys: 0.681 ± 0.285
3.949GluAsp: 3.949 ± 0.655
6.673GluGlu: 6.673 ± 1.651
3.949GluPhe: 3.949 ± 0.837
5.311GluGly: 5.311 ± 0.766
0.953GluHis: 0.953 ± 0.263
5.992GluIle: 5.992 ± 1.036
7.218GluLys: 7.218 ± 1.664
10.214GluLeu: 10.214 ± 1.33
1.907GluMet: 1.907 ± 0.492
6.128GluAsn: 6.128 ± 0.785
0.953GluPro: 0.953 ± 0.333
2.724GluGln: 2.724 ± 0.607
3.813GluArg: 3.813 ± 0.798
5.039GluSer: 5.039 ± 0.874
4.358GluThr: 4.358 ± 1.069
4.903GluVal: 4.903 ± 1.041
1.226GluTrp: 1.226 ± 0.328
2.996GluTyr: 2.996 ± 0.705
0.0GluXaa: 0.0 ± 0.0
Phe
2.043PheAla: 2.043 ± 0.376
0.817PheCys: 0.817 ± 0.341
4.766PheAsp: 4.766 ± 0.794
3.268PheGlu: 3.268 ± 0.575
1.498PhePhe: 1.498 ± 0.428
1.77PheGly: 1.77 ± 0.625
0.953PheHis: 0.953 ± 0.375
4.63PheIle: 4.63 ± 0.809
4.494PheLys: 4.494 ± 0.822
2.587PheLeu: 2.587 ± 0.57
1.634PheMet: 1.634 ± 0.464
2.315PheAsn: 2.315 ± 0.524
0.681PhePro: 0.681 ± 0.286
0.817PheGln: 0.817 ± 0.388
0.681PheArg: 0.681 ± 0.336
3.405PheSer: 3.405 ± 0.706
2.86PheThr: 2.86 ± 0.718
2.86PheVal: 2.86 ± 0.563
0.409PheTrp: 0.409 ± 0.176
2.451PheTyr: 2.451 ± 0.627
0.0PheXaa: 0.0 ± 0.0
Gly
2.86GlyAla: 2.86 ± 0.665
0.817GlyCys: 0.817 ± 0.323
2.996GlyAsp: 2.996 ± 0.915
5.72GlyGlu: 5.72 ± 0.688
3.405GlyPhe: 3.405 ± 0.467
3.405GlyGly: 3.405 ± 0.7
0.545GlyHis: 0.545 ± 0.328
4.358GlyIle: 4.358 ± 0.963
6.537GlyLys: 6.537 ± 1.324
4.766GlyLeu: 4.766 ± 0.535
2.86GlyMet: 2.86 ± 0.564
4.494GlyAsn: 4.494 ± 0.907
0.545GlyPro: 0.545 ± 0.235
1.498GlyGln: 1.498 ± 0.345
1.907GlyArg: 1.907 ± 0.5
4.222GlySer: 4.222 ± 0.762
5.72GlyThr: 5.72 ± 1.049
3.541GlyVal: 3.541 ± 0.491
0.272GlyTrp: 0.272 ± 0.161
3.677GlyTyr: 3.677 ± 0.73
0.0GlyXaa: 0.0 ± 0.0
His
0.409HisAla: 0.409 ± 0.191
0.136HisCys: 0.136 ± 0.132
0.953HisAsp: 0.953 ± 0.387
0.409HisGlu: 0.409 ± 0.281
0.681HisPhe: 0.681 ± 0.258
0.272HisGly: 0.272 ± 0.183
0.545HisHis: 0.545 ± 0.29
1.498HisIle: 1.498 ± 0.32
0.953HisLys: 0.953 ± 0.315
1.226HisLeu: 1.226 ± 0.412
0.272HisMet: 0.272 ± 0.211
0.681HisAsn: 0.681 ± 0.222
0.136HisPro: 0.136 ± 0.126
0.409HisGln: 0.409 ± 0.215
0.409HisArg: 0.409 ± 0.224
0.953HisSer: 0.953 ± 0.358
1.226HisThr: 1.226 ± 0.449
0.953HisVal: 0.953 ± 0.402
0.136HisTrp: 0.136 ± 0.138
0.953HisTyr: 0.953 ± 0.337
0.0HisXaa: 0.0 ± 0.0
Ile
3.268IleAla: 3.268 ± 0.644
0.272IleCys: 0.272 ± 0.167
6.128IleAsp: 6.128 ± 0.754
7.218IleGlu: 7.218 ± 1.247
1.634IlePhe: 1.634 ± 0.454
3.949IleGly: 3.949 ± 0.55
0.817IleHis: 0.817 ± 0.321
3.677IleIle: 3.677 ± 0.586
7.626IleLys: 7.626 ± 1.081
3.268IleLeu: 3.268 ± 0.851
2.315IleMet: 2.315 ± 0.532
5.584IleAsn: 5.584 ± 0.7
1.362IlePro: 1.362 ± 0.397
2.451IleGln: 2.451 ± 0.511
4.222IleArg: 4.222 ± 0.87
3.132IleSer: 3.132 ± 0.6
3.541IleThr: 3.541 ± 0.742
5.039IleVal: 5.039 ± 1.127
0.681IleTrp: 0.681 ± 0.265
3.405IleTyr: 3.405 ± 0.82
0.0IleXaa: 0.0 ± 0.0
Lys
4.494LysAla: 4.494 ± 1.156
1.226LysCys: 1.226 ± 0.407
7.082LysAsp: 7.082 ± 1.039
9.261LysGlu: 9.261 ± 1.845
3.677LysPhe: 3.677 ± 0.592
6.537LysGly: 6.537 ± 1.008
1.226LysHis: 1.226 ± 0.329
5.856LysIle: 5.856 ± 0.791
10.895LysLys: 10.895 ± 1.381
7.762LysLeu: 7.762 ± 0.991
3.405LysMet: 3.405 ± 0.666
4.63LysAsn: 4.63 ± 0.717
1.498LysPro: 1.498 ± 0.516
3.268LysGln: 3.268 ± 0.767
4.63LysArg: 4.63 ± 0.906
3.268LysSer: 3.268 ± 0.694
5.584LysThr: 5.584 ± 1.01
7.082LysVal: 7.082 ± 0.902
1.498LysTrp: 1.498 ± 0.389
4.494LysTyr: 4.494 ± 0.723
0.0LysXaa: 0.0 ± 0.0
Leu
3.268LeuAla: 3.268 ± 0.784
0.681LeuCys: 0.681 ± 0.274
5.311LeuAsp: 5.311 ± 0.689
5.311LeuGlu: 5.311 ± 0.986
2.179LeuPhe: 2.179 ± 0.509
4.494LeuGly: 4.494 ± 0.896
1.634LeuHis: 1.634 ± 0.473
4.766LeuIle: 4.766 ± 0.881
8.443LeuLys: 8.443 ± 1.49
3.813LeuLeu: 3.813 ± 0.531
2.315LeuMet: 2.315 ± 0.491
5.175LeuAsn: 5.175 ± 0.74
2.587LeuPro: 2.587 ± 0.615
2.996LeuGln: 2.996 ± 0.72
2.86LeuArg: 2.86 ± 0.739
3.268LeuSer: 3.268 ± 0.626
4.766LeuThr: 4.766 ± 0.695
3.268LeuVal: 3.268 ± 0.715
0.953LeuTrp: 0.953 ± 0.342
3.677LeuTyr: 3.677 ± 0.525
0.0LeuXaa: 0.0 ± 0.0
Met
0.681MetAla: 0.681 ± 0.254
0.817MetCys: 0.817 ± 0.326
1.634MetAsp: 1.634 ± 0.467
2.179MetGlu: 2.179 ± 0.547
2.179MetPhe: 2.179 ± 0.549
2.315MetGly: 2.315 ± 0.396
0.136MetHis: 0.136 ± 0.132
1.634MetIle: 1.634 ± 0.436
1.907MetLys: 1.907 ± 0.394
2.587MetLeu: 2.587 ± 0.71
1.362MetMet: 1.362 ± 0.526
3.268MetAsn: 3.268 ± 0.54
0.817MetPro: 0.817 ± 0.232
0.817MetGln: 0.817 ± 0.276
1.634MetArg: 1.634 ± 0.441
2.451MetSer: 2.451 ± 0.48
1.498MetThr: 1.498 ± 0.409
0.953MetVal: 0.953 ± 0.33
1.226MetTrp: 1.226 ± 0.376
1.634MetTyr: 1.634 ± 0.438
0.0MetXaa: 0.0 ± 0.0
Asn
2.315AsnAla: 2.315 ± 0.647
0.681AsnCys: 0.681 ± 0.299
4.766AsnAsp: 4.766 ± 0.623
5.311AsnGlu: 5.311 ± 0.918
3.405AsnPhe: 3.405 ± 0.721
4.086AsnGly: 4.086 ± 0.713
0.545AsnHis: 0.545 ± 0.246
4.766AsnIle: 4.766 ± 0.636
6.128AsnLys: 6.128 ± 0.996
4.903AsnLeu: 4.903 ± 0.788
1.498AsnMet: 1.498 ± 0.537
7.354AsnAsn: 7.354 ± 1.408
2.451AsnPro: 2.451 ± 0.648
2.179AsnGln: 2.179 ± 0.461
3.132AsnArg: 3.132 ± 0.8
5.039AsnSer: 5.039 ± 0.805
3.949AsnThr: 3.949 ± 0.85
4.086AsnVal: 4.086 ± 0.71
0.953AsnTrp: 0.953 ± 0.304
3.268AsnTyr: 3.268 ± 0.899
0.0AsnXaa: 0.0 ± 0.0
Pro
1.089ProAla: 1.089 ± 0.374
0.272ProCys: 0.272 ± 0.17
1.77ProAsp: 1.77 ± 0.509
1.77ProGlu: 1.77 ± 0.368
1.089ProPhe: 1.089 ± 0.52
0.817ProGly: 0.817 ± 0.279
0.272ProHis: 0.272 ± 0.175
1.634ProIle: 1.634 ± 0.398
2.315ProLys: 2.315 ± 0.565
0.953ProLeu: 0.953 ± 0.37
0.953ProMet: 0.953 ± 0.484
1.77ProAsn: 1.77 ± 0.523
1.089ProPro: 1.089 ± 0.306
0.817ProGln: 0.817 ± 0.249
0.817ProArg: 0.817 ± 0.371
1.634ProSer: 1.634 ± 0.384
1.498ProThr: 1.498 ± 0.45
1.634ProVal: 1.634 ± 0.458
0.409ProTrp: 0.409 ± 0.2
2.043ProTyr: 2.043 ± 0.731
0.0ProXaa: 0.0 ± 0.0
Gln
1.362GlnAla: 1.362 ± 0.352
0.272GlnCys: 0.272 ± 0.19
1.226GlnAsp: 1.226 ± 0.493
2.043GlnGlu: 2.043 ± 0.645
0.953GlnPhe: 0.953 ± 0.427
2.043GlnGly: 2.043 ± 0.664
0.136GlnHis: 0.136 ± 0.151
1.907GlnIle: 1.907 ± 0.451
2.315GlnLys: 2.315 ± 0.451
4.086GlnLeu: 4.086 ± 0.788
0.409GlnMet: 0.409 ± 0.216
2.724GlnAsn: 2.724 ± 0.526
1.226GlnPro: 1.226 ± 0.253
0.817GlnGln: 0.817 ± 0.284
1.089GlnArg: 1.089 ± 0.32
1.498GlnSer: 1.498 ± 0.586
2.043GlnThr: 2.043 ± 0.423
2.315GlnVal: 2.315 ± 0.612
0.409GlnTrp: 0.409 ± 0.202
2.587GlnTyr: 2.587 ± 0.563
0.0GlnXaa: 0.0 ± 0.0
Arg
2.996ArgAla: 2.996 ± 0.535
0.545ArgCys: 0.545 ± 0.268
2.179ArgAsp: 2.179 ± 0.592
4.086ArgGlu: 4.086 ± 0.788
2.451ArgPhe: 2.451 ± 0.502
2.451ArgGly: 2.451 ± 0.614
0.272ArgHis: 0.272 ± 0.152
2.179ArgIle: 2.179 ± 0.52
4.63ArgLys: 4.63 ± 0.872
2.315ArgLeu: 2.315 ± 0.672
1.089ArgMet: 1.089 ± 0.422
2.043ArgAsn: 2.043 ± 0.457
0.681ArgPro: 0.681 ± 0.242
1.634ArgGln: 1.634 ± 0.551
2.043ArgArg: 2.043 ± 0.408
2.315ArgSer: 2.315 ± 0.631
2.179ArgThr: 2.179 ± 0.502
1.77ArgVal: 1.77 ± 0.588
0.272ArgTrp: 0.272 ± 0.167
2.587ArgTyr: 2.587 ± 0.857
0.0ArgXaa: 0.0 ± 0.0
Ser
2.587SerAla: 2.587 ± 0.5
0.681SerCys: 0.681 ± 0.246
3.132SerAsp: 3.132 ± 0.551
3.813SerGlu: 3.813 ± 0.684
2.587SerPhe: 2.587 ± 0.656
2.587SerGly: 2.587 ± 0.474
0.681SerHis: 0.681 ± 0.292
3.813SerIle: 3.813 ± 0.771
4.494SerLys: 4.494 ± 0.752
4.358SerLeu: 4.358 ± 0.831
1.907SerMet: 1.907 ± 0.432
5.175SerAsn: 5.175 ± 0.905
0.953SerPro: 0.953 ± 0.305
1.634SerGln: 1.634 ± 0.398
2.315SerArg: 2.315 ± 0.518
2.587SerSer: 2.587 ± 0.651
3.268SerThr: 3.268 ± 0.737
2.587SerVal: 2.587 ± 0.568
0.136SerTrp: 0.136 ± 0.133
2.587SerTyr: 2.587 ± 0.456
0.0SerXaa: 0.0 ± 0.0
Thr
3.405ThrAla: 3.405 ± 0.85
0.545ThrCys: 0.545 ± 0.23
3.132ThrAsp: 3.132 ± 0.724
3.813ThrGlu: 3.813 ± 0.664
2.315ThrPhe: 2.315 ± 0.568
4.766ThrGly: 4.766 ± 0.994
0.545ThrHis: 0.545 ± 0.284
6.264ThrIle: 6.264 ± 0.842
5.72ThrLys: 5.72 ± 0.76
5.311ThrLeu: 5.311 ± 1.004
1.907ThrMet: 1.907 ± 0.41
3.541ThrAsn: 3.541 ± 0.606
1.362ThrPro: 1.362 ± 0.392
1.498ThrGln: 1.498 ± 0.389
2.315ThrArg: 2.315 ± 0.549
3.677ThrSer: 3.677 ± 0.699
4.086ThrThr: 4.086 ± 1.053
4.086ThrVal: 4.086 ± 0.705
0.136ThrTrp: 0.136 ± 0.132
1.77ThrTyr: 1.77 ± 0.504
0.0ThrXaa: 0.0 ± 0.0
Val
2.451ValAla: 2.451 ± 0.73
0.409ValCys: 0.409 ± 0.258
5.447ValAsp: 5.447 ± 0.783
5.311ValGlu: 5.311 ± 0.795
2.043ValPhe: 2.043 ± 0.437
5.584ValGly: 5.584 ± 0.829
1.498ValHis: 1.498 ± 0.459
4.358ValIle: 4.358 ± 1.038
6.264ValLys: 6.264 ± 0.826
3.677ValLeu: 3.677 ± 0.555
1.498ValMet: 1.498 ± 0.457
3.132ValAsn: 3.132 ± 0.673
2.179ValPro: 2.179 ± 0.49
1.634ValGln: 1.634 ± 0.702
2.179ValArg: 2.179 ± 0.538
2.86ValSer: 2.86 ± 0.485
4.766ValThr: 4.766 ± 1.076
3.813ValVal: 3.813 ± 0.762
0.953ValTrp: 0.953 ± 0.345
2.587ValTyr: 2.587 ± 0.663
0.0ValXaa: 0.0 ± 0.0
Trp
0.136TrpAla: 0.136 ± 0.124
0.136TrpCys: 0.136 ± 0.126
0.409TrpAsp: 0.409 ± 0.192
0.817TrpGlu: 0.817 ± 0.319
1.362TrpPhe: 1.362 ± 0.393
0.681TrpGly: 0.681 ± 0.314
0.545TrpHis: 0.545 ± 0.191
0.545TrpIle: 0.545 ± 0.22
1.362TrpLys: 1.362 ± 0.394
0.817TrpLeu: 0.817 ± 0.335
0.545TrpMet: 0.545 ± 0.272
1.362TrpAsn: 1.362 ± 0.348
0.0TrpPro: 0.0 ± 0.0
0.681TrpGln: 0.681 ± 0.25
0.545TrpArg: 0.545 ± 0.219
0.545TrpSer: 0.545 ± 0.287
0.136TrpThr: 0.136 ± 0.124
0.545TrpVal: 0.545 ± 0.253
0.272TrpTrp: 0.272 ± 0.193
0.953TrpTyr: 0.953 ± 0.358
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.132TyrAla: 3.132 ± 0.572
1.089TyrCys: 1.089 ± 0.456
3.541TyrAsp: 3.541 ± 0.556
4.086TyrGlu: 4.086 ± 0.688
1.634TyrPhe: 1.634 ± 0.363
3.132TyrGly: 3.132 ± 0.651
0.545TyrHis: 0.545 ± 0.236
3.268TyrIle: 3.268 ± 0.694
4.903TyrLys: 4.903 ± 0.785
2.451TyrLeu: 2.451 ± 0.494
1.226TyrMet: 1.226 ± 0.37
3.949TyrAsn: 3.949 ± 0.714
1.362TyrPro: 1.362 ± 0.334
1.362TyrGln: 1.362 ± 0.466
2.179TyrArg: 2.179 ± 0.487
2.043TyrSer: 2.043 ± 0.487
2.86TyrThr: 2.86 ± 0.455
3.677TyrVal: 3.677 ± 0.801
1.089TyrTrp: 1.089 ± 0.479
2.179TyrTyr: 2.179 ± 0.544
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 43 proteins (7344 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski