Amino acid dipepetide frequency for Bacillus phage vB_BveP-Goe6

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.989AlaAla: 2.989 ± 0.719
0.83AlaCys: 0.83 ± 0.411
3.321AlaAsp: 3.321 ± 0.696
3.819AlaGlu: 3.819 ± 0.859
2.657AlaPhe: 2.657 ± 0.603
3.321AlaGly: 3.321 ± 0.754
0.83AlaHis: 0.83 ± 0.318
2.823AlaIle: 2.823 ± 0.688
3.985AlaLys: 3.985 ± 0.637
4.484AlaLeu: 4.484 ± 0.647
0.996AlaMet: 0.996 ± 0.429
2.491AlaAsn: 2.491 ± 0.617
1.495AlaPro: 1.495 ± 0.539
2.657AlaGln: 2.657 ± 0.813
2.491AlaArg: 2.491 ± 0.53
4.65AlaSer: 4.65 ± 0.721
3.653AlaThr: 3.653 ± 0.902
4.816AlaVal: 4.816 ± 1.355
0.996AlaTrp: 0.996 ± 0.431
2.989AlaTyr: 2.989 ± 0.483
0.0AlaXaa: 0.0 ± 0.0
Cys
0.332CysAla: 0.332 ± 0.204
0.0CysCys: 0.0 ± 0.0
0.498CysAsp: 0.498 ± 0.341
0.83CysGlu: 0.83 ± 0.308
0.166CysPhe: 0.166 ± 0.163
0.498CysGly: 0.498 ± 0.26
0.332CysHis: 0.332 ± 0.235
0.332CysIle: 0.332 ± 0.228
0.0CysLys: 0.0 ± 0.0
0.498CysLeu: 0.498 ± 0.251
0.166CysMet: 0.166 ± 0.143
0.332CysAsn: 0.332 ± 0.195
0.166CysPro: 0.166 ± 0.207
0.0CysGln: 0.0 ± 0.0
0.332CysArg: 0.332 ± 0.242
0.332CysSer: 0.332 ± 0.235
0.332CysThr: 0.332 ± 0.196
0.664CysVal: 0.664 ± 0.297
0.0CysTrp: 0.0 ± 0.0
0.83CysTyr: 0.83 ± 0.339
0.0CysXaa: 0.0 ± 0.0
Asp
2.657AspAla: 2.657 ± 0.638
0.664AspCys: 0.664 ± 0.23
4.816AspAsp: 4.816 ± 0.705
4.151AspGlu: 4.151 ± 0.69
3.653AspPhe: 3.653 ± 1.191
5.978AspGly: 5.978 ± 1.234
0.83AspHis: 0.83 ± 0.331
5.314AspIle: 5.314 ± 0.974
5.314AspLys: 5.314 ± 1.569
5.48AspLeu: 5.48 ± 1.064
1.495AspMet: 1.495 ± 0.612
4.65AspAsn: 4.65 ± 0.617
1.993AspPro: 1.993 ± 0.499
0.664AspGln: 0.664 ± 0.278
1.827AspArg: 1.827 ± 0.736
3.321AspSer: 3.321 ± 0.518
2.989AspThr: 2.989 ± 0.788
4.318AspVal: 4.318 ± 0.796
0.996AspTrp: 0.996 ± 0.539
3.321AspTyr: 3.321 ± 1.178
0.0AspXaa: 0.0 ± 0.0
Glu
3.321GluAla: 3.321 ± 0.83
0.166GluCys: 0.166 ± 0.168
4.318GluAsp: 4.318 ± 1.001
4.318GluGlu: 4.318 ± 0.782
1.661GluPhe: 1.661 ± 0.495
3.985GluGly: 3.985 ± 0.776
1.162GluHis: 1.162 ± 0.416
5.148GluIle: 5.148 ± 0.716
5.48GluLys: 5.48 ± 1.057
6.144GluLeu: 6.144 ± 1.044
1.827GluMet: 1.827 ± 0.5
4.65GluAsn: 4.65 ± 0.738
0.996GluPro: 0.996 ± 0.433
3.155GluGln: 3.155 ± 0.894
2.657GluArg: 2.657 ± 0.596
3.653GluSer: 3.653 ± 0.755
4.318GluThr: 4.318 ± 0.858
4.151GluVal: 4.151 ± 1.127
0.996GluTrp: 0.996 ± 0.529
4.151GluTyr: 4.151 ± 0.797
0.0GluXaa: 0.0 ± 0.0
Phe
2.657PheAla: 2.657 ± 0.525
0.166PheCys: 0.166 ± 0.207
3.985PheAsp: 3.985 ± 0.95
3.819PheGlu: 3.819 ± 0.754
1.328PhePhe: 1.328 ± 0.541
1.993PheGly: 1.993 ± 0.558
0.83PheHis: 0.83 ± 0.286
3.487PheIle: 3.487 ± 0.973
3.985PheLys: 3.985 ± 1.633
2.989PheLeu: 2.989 ± 0.765
1.162PheMet: 1.162 ± 0.354
3.653PheAsn: 3.653 ± 0.8
1.495PhePro: 1.495 ± 0.409
0.996PheGln: 0.996 ± 0.478
1.827PheArg: 1.827 ± 0.527
2.823PheSer: 2.823 ± 0.524
2.325PheThr: 2.325 ± 0.507
2.159PheVal: 2.159 ± 0.683
0.166PheTrp: 0.166 ± 0.168
2.159PheTyr: 2.159 ± 0.53
0.0PheXaa: 0.0 ± 0.0
Gly
3.653GlyAla: 3.653 ± 0.757
0.332GlyCys: 0.332 ± 0.266
4.982GlyAsp: 4.982 ± 1.27
3.653GlyGlu: 3.653 ± 0.63
3.985GlyPhe: 3.985 ± 0.909
6.144GlyGly: 6.144 ± 1.329
0.996GlyHis: 0.996 ± 0.41
4.484GlyIle: 4.484 ± 1.626
4.151GlyLys: 4.151 ± 0.76
5.646GlyLeu: 5.646 ± 0.835
1.827GlyMet: 1.827 ± 0.718
5.812GlyAsn: 5.812 ± 1.41
0.0GlyPro: 0.0 ± 0.0
2.491GlyGln: 2.491 ± 0.546
1.495GlyArg: 1.495 ± 0.543
4.151GlySer: 4.151 ± 0.928
4.982GlyThr: 4.982 ± 0.604
4.65GlyVal: 4.65 ± 1.077
0.83GlyTrp: 0.83 ± 0.524
3.155GlyTyr: 3.155 ± 0.704
0.0GlyXaa: 0.0 ± 0.0
His
0.664HisAla: 0.664 ± 0.299
0.0HisCys: 0.0 ± 0.0
0.83HisAsp: 0.83 ± 0.39
1.495HisGlu: 1.495 ± 0.584
0.83HisPhe: 0.83 ± 0.288
1.162HisGly: 1.162 ± 0.506
0.498HisHis: 0.498 ± 0.296
0.83HisIle: 0.83 ± 0.41
1.661HisLys: 1.661 ± 0.557
0.83HisLeu: 0.83 ± 0.357
0.332HisMet: 0.332 ± 0.211
0.996HisAsn: 0.996 ± 0.308
0.166HisPro: 0.166 ± 0.163
0.332HisGln: 0.332 ± 0.214
0.0HisArg: 0.0 ± 0.0
0.498HisSer: 0.498 ± 0.286
0.664HisThr: 0.664 ± 0.255
1.495HisVal: 1.495 ± 0.42
0.166HisTrp: 0.166 ± 0.157
0.996HisTyr: 0.996 ± 0.4
0.0HisXaa: 0.0 ± 0.0
Ile
3.487IleAla: 3.487 ± 0.589
0.664IleCys: 0.664 ± 0.321
4.816IleAsp: 4.816 ± 0.772
5.148IleGlu: 5.148 ± 1.1
2.823IlePhe: 2.823 ± 0.723
4.65IleGly: 4.65 ± 0.941
1.162IleHis: 1.162 ± 0.43
4.65IleIle: 4.65 ± 1.124
6.144IleLys: 6.144 ± 1.268
3.321IleLeu: 3.321 ± 0.662
1.162IleMet: 1.162 ± 0.555
6.144IleAsn: 6.144 ± 1.043
2.325IlePro: 2.325 ± 0.56
2.159IleGln: 2.159 ± 0.765
3.487IleArg: 3.487 ± 0.661
4.484IleSer: 4.484 ± 0.948
4.982IleThr: 4.982 ± 0.638
3.487IleVal: 3.487 ± 0.681
0.332IleTrp: 0.332 ± 0.196
2.159IleTyr: 2.159 ± 0.623
0.0IleXaa: 0.0 ± 0.0
Lys
4.65LysAla: 4.65 ± 0.943
0.166LysCys: 0.166 ± 0.17
4.484LysAsp: 4.484 ± 0.866
5.314LysGlu: 5.314 ± 1.559
3.155LysPhe: 3.155 ± 0.862
4.151LysGly: 4.151 ± 0.659
0.498LysHis: 0.498 ± 0.354
4.982LysIle: 4.982 ± 0.993
5.48LysLys: 5.48 ± 1.243
5.812LysLeu: 5.812 ± 1.145
3.653LysMet: 3.653 ± 0.84
4.318LysAsn: 4.318 ± 0.73
2.491LysPro: 2.491 ± 0.624
2.989LysGln: 2.989 ± 0.473
4.151LysArg: 4.151 ± 0.83
4.65LysSer: 4.65 ± 0.793
4.982LysThr: 4.982 ± 0.765
5.48LysVal: 5.48 ± 0.858
1.328LysTrp: 1.328 ± 0.503
1.827LysTyr: 1.827 ± 0.444
0.0LysXaa: 0.0 ± 0.0
Leu
3.985LeuAla: 3.985 ± 1.036
0.664LeuCys: 0.664 ± 0.297
4.484LeuAsp: 4.484 ± 0.741
6.476LeuGlu: 6.476 ± 1.235
3.653LeuPhe: 3.653 ± 0.89
4.151LeuGly: 4.151 ± 0.746
1.162LeuHis: 1.162 ± 0.369
4.65LeuIle: 4.65 ± 0.649
6.476LeuLys: 6.476 ± 1.254
4.65LeuLeu: 4.65 ± 0.785
2.657LeuMet: 2.657 ± 0.54
5.812LeuAsn: 5.812 ± 0.747
2.657LeuPro: 2.657 ± 0.674
2.989LeuGln: 2.989 ± 0.844
3.321LeuArg: 3.321 ± 0.703
7.14LeuSer: 7.14 ± 0.841
5.148LeuThr: 5.148 ± 0.835
4.816LeuVal: 4.816 ± 0.938
0.664LeuTrp: 0.664 ± 0.326
2.657LeuTyr: 2.657 ± 0.832
0.0LeuXaa: 0.0 ± 0.0
Met
1.495MetAla: 1.495 ± 0.437
0.166MetCys: 0.166 ± 0.157
1.328MetAsp: 1.328 ± 0.515
1.661MetGlu: 1.661 ± 0.537
0.664MetPhe: 0.664 ± 0.651
1.495MetGly: 1.495 ± 0.392
0.332MetHis: 0.332 ± 0.228
2.491MetIle: 2.491 ± 0.698
1.827MetLys: 1.827 ± 0.593
1.827MetLeu: 1.827 ± 0.604
0.996MetMet: 0.996 ± 0.378
1.328MetAsn: 1.328 ± 0.325
0.996MetPro: 0.996 ± 0.331
1.162MetGln: 1.162 ± 0.425
1.495MetArg: 1.495 ± 0.612
1.495MetSer: 1.495 ± 0.431
1.495MetThr: 1.495 ± 0.443
2.491MetVal: 2.491 ± 0.724
0.166MetTrp: 0.166 ± 0.14
1.162MetTyr: 1.162 ± 0.358
0.0MetXaa: 0.0 ± 0.0
Asn
4.982AsnAla: 4.982 ± 1.165
0.664AsnCys: 0.664 ± 0.532
4.982AsnAsp: 4.982 ± 0.944
3.487AsnGlu: 3.487 ± 0.81
1.661AsnPhe: 1.661 ± 0.465
4.484AsnGly: 4.484 ± 1.107
0.498AsnHis: 0.498 ± 0.353
4.484AsnIle: 4.484 ± 1.139
5.812AsnLys: 5.812 ± 0.971
4.982AsnLeu: 4.982 ± 0.832
1.993AsnMet: 1.993 ± 0.481
4.151AsnAsn: 4.151 ± 0.931
3.155AsnPro: 3.155 ± 0.635
2.989AsnGln: 2.989 ± 0.848
2.159AsnArg: 2.159 ± 0.577
3.985AsnSer: 3.985 ± 0.472
4.982AsnThr: 4.982 ± 0.784
5.148AsnVal: 5.148 ± 0.743
0.664AsnTrp: 0.664 ± 0.23
2.989AsnTyr: 2.989 ± 0.811
0.0AsnXaa: 0.0 ± 0.0
Pro
1.495ProAla: 1.495 ± 0.403
0.166ProCys: 0.166 ± 0.157
2.325ProAsp: 2.325 ± 0.672
1.661ProGlu: 1.661 ± 0.501
1.328ProPhe: 1.328 ± 0.46
0.498ProGly: 0.498 ± 0.218
0.498ProHis: 0.498 ± 0.336
0.83ProIle: 0.83 ± 0.337
1.328ProLys: 1.328 ± 0.528
2.325ProLeu: 2.325 ± 0.507
0.332ProMet: 0.332 ± 0.204
3.321ProAsn: 3.321 ± 0.736
0.498ProPro: 0.498 ± 0.266
1.162ProGln: 1.162 ± 0.467
1.661ProArg: 1.661 ± 0.405
1.827ProSer: 1.827 ± 0.446
2.325ProThr: 2.325 ± 0.526
2.491ProVal: 2.491 ± 1.118
0.166ProTrp: 0.166 ± 0.169
2.159ProTyr: 2.159 ± 0.45
0.0ProXaa: 0.0 ± 0.0
Gln
2.823GlnAla: 2.823 ± 0.669
0.0GlnCys: 0.0 ± 0.0
1.661GlnAsp: 1.661 ± 0.454
2.823GlnGlu: 2.823 ± 0.713
2.325GlnPhe: 2.325 ± 0.634
3.321GlnGly: 3.321 ± 0.639
0.332GlnHis: 0.332 ± 0.258
2.159GlnIle: 2.159 ± 0.597
2.325GlnLys: 2.325 ± 0.77
3.653GlnLeu: 3.653 ± 0.736
0.83GlnMet: 0.83 ± 0.307
1.162GlnAsn: 1.162 ± 0.388
0.332GlnPro: 0.332 ± 0.21
0.83GlnGln: 0.83 ± 0.433
1.827GlnArg: 1.827 ± 0.752
1.827GlnSer: 1.827 ± 0.496
1.827GlnThr: 1.827 ± 0.754
2.823GlnVal: 2.823 ± 0.61
0.83GlnTrp: 0.83 ± 0.401
1.661GlnTyr: 1.661 ± 0.526
0.0GlnXaa: 0.0 ± 0.0
Arg
2.325ArgAla: 2.325 ± 0.565
0.332ArgCys: 0.332 ± 0.237
1.495ArgAsp: 1.495 ± 0.46
2.989ArgGlu: 2.989 ± 0.657
1.993ArgPhe: 1.993 ± 0.956
3.321ArgGly: 3.321 ± 0.582
0.498ArgHis: 0.498 ± 0.28
3.321ArgIle: 3.321 ± 0.82
2.491ArgLys: 2.491 ± 0.735
3.653ArgLeu: 3.653 ± 0.732
1.495ArgMet: 1.495 ± 0.541
2.491ArgAsn: 2.491 ± 0.672
1.162ArgPro: 1.162 ± 0.298
1.162ArgGln: 1.162 ± 0.467
2.159ArgArg: 2.159 ± 0.679
2.657ArgSer: 2.657 ± 0.445
3.155ArgThr: 3.155 ± 1.045
2.491ArgVal: 2.491 ± 0.512
0.664ArgTrp: 0.664 ± 0.265
1.495ArgTyr: 1.495 ± 0.414
0.0ArgXaa: 0.0 ± 0.0
Ser
2.823SerAla: 2.823 ± 0.883
0.332SerCys: 0.332 ± 0.204
3.155SerAsp: 3.155 ± 0.849
5.148SerGlu: 5.148 ± 0.989
3.985SerPhe: 3.985 ± 1.075
5.646SerGly: 5.646 ± 1.189
0.996SerHis: 0.996 ± 0.413
3.653SerIle: 3.653 ± 0.549
3.487SerLys: 3.487 ± 0.871
7.14SerLeu: 7.14 ± 0.981
0.83SerMet: 0.83 ± 0.336
5.314SerAsn: 5.314 ± 1.066
1.661SerPro: 1.661 ± 0.408
2.657SerGln: 2.657 ± 0.602
3.487SerArg: 3.487 ± 0.646
5.978SerSer: 5.978 ± 1.176
3.487SerThr: 3.487 ± 0.849
4.982SerVal: 4.982 ± 0.882
0.332SerTrp: 0.332 ± 0.202
3.321SerTyr: 3.321 ± 0.749
0.0SerXaa: 0.0 ± 0.0
Thr
4.816ThrAla: 4.816 ± 1.035
0.498ThrCys: 0.498 ± 0.24
5.148ThrAsp: 5.148 ± 0.673
3.155ThrGlu: 3.155 ± 0.584
3.321ThrPhe: 3.321 ± 0.798
4.65ThrGly: 4.65 ± 0.894
0.83ThrHis: 0.83 ± 0.323
4.484ThrIle: 4.484 ± 0.869
4.816ThrLys: 4.816 ± 0.699
4.816ThrLeu: 4.816 ± 0.886
0.996ThrMet: 0.996 ± 0.428
2.823ThrAsn: 2.823 ± 0.61
2.657ThrPro: 2.657 ± 0.637
1.495ThrGln: 1.495 ± 0.591
1.993ThrArg: 1.993 ± 0.598
4.816ThrSer: 4.816 ± 0.945
6.642ThrThr: 6.642 ± 1.582
4.816ThrVal: 4.816 ± 0.974
1.328ThrTrp: 1.328 ± 0.337
2.159ThrTyr: 2.159 ± 0.537
0.0ThrXaa: 0.0 ± 0.0
Val
4.151ValAla: 4.151 ± 0.955
0.332ValCys: 0.332 ± 0.252
4.65ValAsp: 4.65 ± 1.213
3.155ValGlu: 3.155 ± 0.701
3.155ValPhe: 3.155 ± 1.109
3.321ValGly: 3.321 ± 0.555
0.498ValHis: 0.498 ± 0.257
4.484ValIle: 4.484 ± 0.699
5.148ValLys: 5.148 ± 0.969
5.48ValLeu: 5.48 ± 1.133
1.162ValMet: 1.162 ± 0.471
4.151ValAsn: 4.151 ± 0.726
2.491ValPro: 2.491 ± 0.587
3.155ValGln: 3.155 ± 0.571
2.491ValArg: 2.491 ± 0.583
6.974ValSer: 6.974 ± 1.504
4.982ValThr: 4.982 ± 0.724
3.985ValVal: 3.985 ± 0.696
1.328ValTrp: 1.328 ± 0.514
3.487ValTyr: 3.487 ± 0.761
0.0ValXaa: 0.0 ± 0.0
Trp
0.83TrpAla: 0.83 ± 0.51
0.0TrpCys: 0.0 ± 0.0
0.332TrpAsp: 0.332 ± 0.238
0.498TrpGlu: 0.498 ± 0.259
0.83TrpPhe: 0.83 ± 0.477
0.498TrpGly: 0.498 ± 0.264
0.498TrpHis: 0.498 ± 0.242
0.498TrpIle: 0.498 ± 0.297
0.83TrpLys: 0.83 ± 0.408
0.996TrpLeu: 0.996 ± 0.588
0.498TrpMet: 0.498 ± 0.254
1.328TrpAsn: 1.328 ± 0.371
0.0TrpPro: 0.0 ± 0.0
1.162TrpGln: 1.162 ± 0.432
0.664TrpArg: 0.664 ± 0.308
0.996TrpSer: 0.996 ± 0.295
0.996TrpThr: 0.996 ± 0.633
0.83TrpVal: 0.83 ± 0.341
0.0TrpTrp: 0.0 ± 0.0
0.498TrpTyr: 0.498 ± 0.215
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.159TyrAla: 2.159 ± 0.529
0.498TyrCys: 0.498 ± 0.319
2.823TyrAsp: 2.823 ± 0.681
2.657TyrGlu: 2.657 ± 0.527
0.83TyrPhe: 0.83 ± 0.437
4.151TyrGly: 4.151 ± 0.937
1.162TyrHis: 1.162 ± 0.526
4.484TyrIle: 4.484 ± 1.025
3.819TyrLys: 3.819 ± 0.9
3.487TyrLeu: 3.487 ± 0.625
1.495TyrMet: 1.495 ± 0.697
3.321TyrAsn: 3.321 ± 0.731
1.661TyrPro: 1.661 ± 0.428
1.162TyrGln: 1.162 ± 0.499
1.827TyrArg: 1.827 ± 0.497
2.159TyrSer: 2.159 ± 0.439
1.993TyrThr: 1.993 ± 0.616
2.657TyrVal: 2.657 ± 0.541
0.83TyrTrp: 0.83 ± 0.339
1.495TyrTyr: 1.495 ± 0.593
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (6023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski