Amino acid dipepetide frequency for Arthrobacter phage BlueFeather

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.272AlaAla: 21.272 ± 2.124
0.197AlaCys: 0.197 ± 0.172
7.288AlaAsp: 7.288 ± 0.975
8.076AlaGlu: 8.076 ± 1.59
4.136AlaPhe: 4.136 ± 0.715
14.969AlaGly: 14.969 ± 1.933
2.167AlaHis: 2.167 ± 0.589
4.727AlaIle: 4.727 ± 0.819
4.727AlaLys: 4.727 ± 1.649
11.621AlaLeu: 11.621 ± 1.548
3.545AlaMet: 3.545 ± 1.005
3.939AlaAsn: 3.939 ± 0.64
6.106AlaPro: 6.106 ± 1.193
3.939AlaGln: 3.939 ± 1.096
7.485AlaArg: 7.485 ± 1.547
7.682AlaSer: 7.682 ± 1.194
7.682AlaThr: 7.682 ± 1.024
10.439AlaVal: 10.439 ± 0.869
2.364AlaTrp: 2.364 ± 0.952
2.561AlaTyr: 2.561 ± 0.8
0.0AlaXaa: 0.0 ± 0.0
Cys
0.394CysAla: 0.394 ± 0.269
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.591CysGlu: 0.591 ± 0.31
0.0CysPhe: 0.0 ± 0.0
0.591CysGly: 0.591 ± 0.291
0.394CysHis: 0.394 ± 0.344
0.197CysIle: 0.197 ± 0.172
0.0CysLys: 0.0 ± 0.0
1.182CysLeu: 1.182 ± 0.474
0.0CysMet: 0.0 ± 0.0
0.197CysAsn: 0.197 ± 0.2
0.197CysPro: 0.197 ± 0.257
0.394CysGln: 0.394 ± 0.284
0.788CysArg: 0.788 ± 0.433
0.197CysSer: 0.197 ± 0.172
0.394CysThr: 0.394 ± 0.268
0.197CysVal: 0.197 ± 0.182
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
10.833AspAla: 10.833 ± 1.671
0.197AspCys: 0.197 ± 0.172
3.151AspAsp: 3.151 ± 0.719
2.955AspGlu: 2.955 ± 0.579
2.561AspPhe: 2.561 ± 0.813
5.712AspGly: 5.712 ± 0.939
1.576AspHis: 1.576 ± 0.62
1.97AspIle: 1.97 ± 0.469
3.545AspLys: 3.545 ± 1.152
6.5AspLeu: 6.5 ± 0.964
0.394AspMet: 0.394 ± 0.313
2.167AspAsn: 2.167 ± 0.524
2.561AspPro: 2.561 ± 0.876
0.788AspGln: 0.788 ± 0.304
3.742AspArg: 3.742 ± 0.823
3.348AspSer: 3.348 ± 0.67
5.318AspThr: 5.318 ± 0.99
3.742AspVal: 3.742 ± 0.925
1.182AspTrp: 1.182 ± 0.529
2.561AspTyr: 2.561 ± 0.756
0.0AspXaa: 0.0 ± 0.0
Glu
13.0GluAla: 13.0 ± 3.056
0.394GluCys: 0.394 ± 0.237
3.742GluAsp: 3.742 ± 1.078
3.742GluGlu: 3.742 ± 1.394
1.182GluPhe: 1.182 ± 0.449
2.561GluGly: 2.561 ± 0.592
0.197GluHis: 0.197 ± 0.19
1.576GluIle: 1.576 ± 0.58
0.788GluLys: 0.788 ± 0.388
5.909GluLeu: 5.909 ± 0.998
1.182GluMet: 1.182 ± 0.443
2.167GluAsn: 2.167 ± 0.747
1.97GluPro: 1.97 ± 0.481
1.379GluGln: 1.379 ± 0.59
4.136GluArg: 4.136 ± 0.914
3.939GluSer: 3.939 ± 1.007
1.773GluThr: 1.773 ± 0.556
3.545GluVal: 3.545 ± 0.756
1.182GluTrp: 1.182 ± 0.483
1.379GluTyr: 1.379 ± 0.408
0.0GluXaa: 0.0 ± 0.0
Phe
2.364PheAla: 2.364 ± 0.679
0.197PheCys: 0.197 ± 0.172
3.151PheAsp: 3.151 ± 0.81
1.576PheGlu: 1.576 ± 0.465
0.197PhePhe: 0.197 ± 0.172
3.742PheGly: 3.742 ± 1.072
0.591PheHis: 0.591 ± 0.536
1.773PheIle: 1.773 ± 0.74
1.379PheLys: 1.379 ± 0.463
1.182PheLeu: 1.182 ± 0.225
0.394PheMet: 0.394 ± 0.241
1.576PheAsn: 1.576 ± 0.428
1.182PhePro: 1.182 ± 0.411
1.379PheGln: 1.379 ± 0.606
0.985PheArg: 0.985 ± 0.501
0.788PheSer: 0.788 ± 0.346
2.561PheThr: 2.561 ± 0.859
1.97PheVal: 1.97 ± 0.658
0.591PheTrp: 0.591 ± 0.246
0.591PheTyr: 0.591 ± 0.341
0.0PheXaa: 0.0 ± 0.0
Gly
7.485GlyAla: 7.485 ± 0.947
0.394GlyCys: 0.394 ± 0.259
5.712GlyAsp: 5.712 ± 1.242
5.318GlyGlu: 5.318 ± 0.919
3.939GlyPhe: 3.939 ± 0.719
7.879GlyGly: 7.879 ± 2.029
0.788GlyHis: 0.788 ± 0.557
3.348GlyIle: 3.348 ± 0.792
3.151GlyLys: 3.151 ± 0.846
6.303GlyLeu: 6.303 ± 1.154
2.561GlyMet: 2.561 ± 0.745
2.758GlyAsn: 2.758 ± 0.739
4.53GlyPro: 4.53 ± 1.016
3.151GlyGln: 3.151 ± 0.831
5.712GlyArg: 5.712 ± 1.08
4.136GlySer: 4.136 ± 1.216
7.091GlyThr: 7.091 ± 1.277
5.515GlyVal: 5.515 ± 0.913
2.167GlyTrp: 2.167 ± 0.815
2.167GlyTyr: 2.167 ± 0.663
0.0GlyXaa: 0.0 ± 0.0
His
2.364HisAla: 2.364 ± 0.614
0.197HisCys: 0.197 ± 0.183
0.788HisAsp: 0.788 ± 0.438
0.394HisGlu: 0.394 ± 0.319
0.197HisPhe: 0.197 ± 0.197
1.576HisGly: 1.576 ± 0.398
0.394HisHis: 0.394 ± 0.279
1.576HisIle: 1.576 ± 0.487
0.394HisLys: 0.394 ± 0.256
2.364HisLeu: 2.364 ± 0.676
0.197HisMet: 0.197 ± 0.171
0.394HisAsn: 0.394 ± 0.284
0.985HisPro: 0.985 ± 0.526
0.197HisGln: 0.197 ± 0.179
1.773HisArg: 1.773 ± 0.611
0.591HisSer: 0.591 ± 0.311
0.985HisThr: 0.985 ± 0.455
0.985HisVal: 0.985 ± 0.439
0.0HisTrp: 0.0 ± 0.0
0.394HisTyr: 0.394 ± 0.211
0.0HisXaa: 0.0 ± 0.0
Ile
3.545IleAla: 3.545 ± 0.788
0.197IleCys: 0.197 ± 0.182
1.379IleAsp: 1.379 ± 0.563
2.167IleGlu: 2.167 ± 0.742
0.788IlePhe: 0.788 ± 0.365
3.545IleGly: 3.545 ± 0.516
0.985IleHis: 0.985 ± 0.376
0.985IleIle: 0.985 ± 0.44
2.364IleLys: 2.364 ± 0.631
2.561IleLeu: 2.561 ± 0.734
0.0IleMet: 0.0 ± 0.0
2.167IleAsn: 2.167 ± 0.517
3.151IlePro: 3.151 ± 0.706
2.364IleGln: 2.364 ± 0.614
2.758IleArg: 2.758 ± 0.812
2.364IleSer: 2.364 ± 0.529
3.742IleThr: 3.742 ± 0.718
1.97IleVal: 1.97 ± 0.61
0.788IleTrp: 0.788 ± 0.234
2.167IleTyr: 2.167 ± 0.612
0.0IleXaa: 0.0 ± 0.0
Lys
3.742LysAla: 3.742 ± 0.822
0.394LysCys: 0.394 ± 0.28
3.545LysAsp: 3.545 ± 1.273
3.348LysGlu: 3.348 ± 1.175
0.985LysPhe: 0.985 ± 0.37
3.545LysGly: 3.545 ± 0.813
0.197LysHis: 0.197 ± 0.197
1.97LysIle: 1.97 ± 0.654
0.591LysLys: 0.591 ± 0.31
4.136LysLeu: 4.136 ± 0.841
0.197LysMet: 0.197 ± 0.189
1.379LysAsn: 1.379 ± 0.443
2.561LysPro: 2.561 ± 0.802
0.591LysGln: 0.591 ± 0.275
1.379LysArg: 1.379 ± 0.443
0.985LysSer: 0.985 ± 0.4
3.545LysThr: 3.545 ± 1.01
3.348LysVal: 3.348 ± 1.09
0.788LysTrp: 0.788 ± 0.31
0.591LysTyr: 0.591 ± 0.321
0.0LysXaa: 0.0 ± 0.0
Leu
9.454LeuAla: 9.454 ± 0.986
0.591LeuCys: 0.591 ± 0.33
6.106LeuAsp: 6.106 ± 1.121
5.712LeuGlu: 5.712 ± 1.236
1.97LeuPhe: 1.97 ± 0.587
5.909LeuGly: 5.909 ± 1.074
0.985LeuHis: 0.985 ± 0.531
2.364LeuIle: 2.364 ± 0.425
3.545LeuLys: 3.545 ± 0.74
3.545LeuLeu: 3.545 ± 1.066
1.576LeuMet: 1.576 ± 0.541
2.561LeuAsn: 2.561 ± 0.591
4.53LeuPro: 4.53 ± 0.928
3.939LeuGln: 3.939 ± 0.81
6.5LeuArg: 6.5 ± 1.12
3.939LeuSer: 3.939 ± 0.907
9.651LeuThr: 9.651 ± 1.016
4.727LeuVal: 4.727 ± 1.091
1.576LeuTrp: 1.576 ± 0.647
2.758LeuTyr: 2.758 ± 0.718
0.0LeuXaa: 0.0 ± 0.0
Met
2.364MetAla: 2.364 ± 0.853
0.197MetCys: 0.197 ± 0.171
1.773MetAsp: 1.773 ± 0.632
0.788MetGlu: 0.788 ± 0.32
0.197MetPhe: 0.197 ± 0.188
1.379MetGly: 1.379 ± 0.442
0.0MetHis: 0.0 ± 0.0
0.0MetIle: 0.0 ± 0.0
0.591MetLys: 0.591 ± 0.275
2.167MetLeu: 2.167 ± 0.54
0.788MetMet: 0.788 ± 0.381
1.182MetAsn: 1.182 ± 0.41
0.985MetPro: 0.985 ± 0.437
0.591MetGln: 0.591 ± 0.408
1.379MetArg: 1.379 ± 0.722
1.576MetSer: 1.576 ± 0.494
1.773MetThr: 1.773 ± 0.519
2.758MetVal: 2.758 ± 0.856
0.394MetTrp: 0.394 ± 0.208
0.197MetTyr: 0.197 ± 0.206
0.0MetXaa: 0.0 ± 0.0
Asn
4.136AsnAla: 4.136 ± 0.617
0.0AsnCys: 0.0 ± 0.0
1.182AsnAsp: 1.182 ± 0.486
0.591AsnGlu: 0.591 ± 0.403
1.773AsnPhe: 1.773 ± 0.788
3.151AsnGly: 3.151 ± 0.681
0.394AsnHis: 0.394 ± 0.264
1.576AsnIle: 1.576 ± 0.608
1.773AsnLys: 1.773 ± 0.812
2.561AsnLeu: 2.561 ± 0.67
0.591AsnMet: 0.591 ± 0.321
0.591AsnAsn: 0.591 ± 0.275
2.955AsnPro: 2.955 ± 0.97
0.788AsnGln: 0.788 ± 0.328
0.985AsnArg: 0.985 ± 0.548
2.364AsnSer: 2.364 ± 0.722
3.348AsnThr: 3.348 ± 0.868
3.151AsnVal: 3.151 ± 0.685
0.0AsnTrp: 0.0 ± 0.0
0.591AsnTyr: 0.591 ± 0.263
0.0AsnXaa: 0.0 ± 0.0
Pro
8.273ProAla: 8.273 ± 1.444
0.591ProCys: 0.591 ± 0.387
3.939ProAsp: 3.939 ± 0.834
4.136ProGlu: 4.136 ± 0.82
1.576ProPhe: 1.576 ± 0.479
4.53ProGly: 4.53 ± 0.922
0.985ProHis: 0.985 ± 0.586
2.561ProIle: 2.561 ± 0.612
1.97ProLys: 1.97 ± 0.662
4.333ProLeu: 4.333 ± 0.709
0.197ProMet: 0.197 ± 0.186
0.985ProAsn: 0.985 ± 0.403
1.97ProPro: 1.97 ± 0.925
2.167ProGln: 2.167 ± 0.574
3.348ProArg: 3.348 ± 1.011
4.333ProSer: 4.333 ± 1.182
2.758ProThr: 2.758 ± 1.585
3.151ProVal: 3.151 ± 0.867
0.591ProTrp: 0.591 ± 0.28
1.576ProTyr: 1.576 ± 0.728
0.0ProXaa: 0.0 ± 0.0
Gln
3.348GlnAla: 3.348 ± 0.882
0.394GlnCys: 0.394 ± 0.28
3.742GlnAsp: 3.742 ± 0.786
2.758GlnGlu: 2.758 ± 0.54
0.985GlnPhe: 0.985 ± 0.347
2.955GlnGly: 2.955 ± 1.344
0.394GlnHis: 0.394 ± 0.236
1.576GlnIle: 1.576 ± 0.563
0.985GlnLys: 0.985 ± 0.429
1.576GlnLeu: 1.576 ± 0.493
1.182GlnMet: 1.182 ± 0.464
0.788GlnAsn: 0.788 ± 0.528
1.576GlnPro: 1.576 ± 0.551
0.197GlnGln: 0.197 ± 0.202
3.151GlnArg: 3.151 ± 0.665
1.182GlnSer: 1.182 ± 0.539
1.773GlnThr: 1.773 ± 0.65
2.364GlnVal: 2.364 ± 0.583
0.591GlnTrp: 0.591 ± 0.307
0.985GlnTyr: 0.985 ± 0.361
0.0GlnXaa: 0.0 ± 0.0
Arg
10.045ArgAla: 10.045 ± 1.473
0.591ArgCys: 0.591 ± 0.28
2.955ArgAsp: 2.955 ± 1.028
4.924ArgGlu: 4.924 ± 1.066
0.788ArgPhe: 0.788 ± 0.304
3.348ArgGly: 3.348 ± 0.809
1.379ArgHis: 1.379 ± 0.614
3.348ArgIle: 3.348 ± 0.919
2.758ArgLys: 2.758 ± 0.442
4.727ArgLeu: 4.727 ± 1.116
2.364ArgMet: 2.364 ± 0.655
1.773ArgAsn: 1.773 ± 0.5
5.318ArgPro: 5.318 ± 1.055
1.576ArgGln: 1.576 ± 0.619
5.712ArgArg: 5.712 ± 1.162
3.545ArgSer: 3.545 ± 0.831
2.758ArgThr: 2.758 ± 0.815
5.121ArgVal: 5.121 ± 0.878
1.379ArgTrp: 1.379 ± 0.604
1.773ArgTyr: 1.773 ± 0.879
0.0ArgXaa: 0.0 ± 0.0
Ser
5.712SerAla: 5.712 ± 1.091
0.394SerCys: 0.394 ± 0.262
2.364SerAsp: 2.364 ± 0.639
0.788SerGlu: 0.788 ± 0.421
1.97SerPhe: 1.97 ± 0.399
5.121SerGly: 5.121 ± 1.057
1.576SerHis: 1.576 ± 0.797
2.955SerIle: 2.955 ± 0.579
2.561SerLys: 2.561 ± 0.795
7.288SerLeu: 7.288 ± 0.652
1.773SerMet: 1.773 ± 0.602
0.985SerAsn: 0.985 ± 0.445
2.364SerPro: 2.364 ± 0.545
2.955SerGln: 2.955 ± 0.9
3.348SerArg: 3.348 ± 0.709
2.955SerSer: 2.955 ± 0.743
5.318SerThr: 5.318 ± 1.244
3.545SerVal: 3.545 ± 1.01
0.197SerTrp: 0.197 ± 0.218
1.379SerTyr: 1.379 ± 0.534
0.0SerXaa: 0.0 ± 0.0
Thr
12.015ThrAla: 12.015 ± 1.336
0.197ThrCys: 0.197 ± 0.171
5.909ThrAsp: 5.909 ± 1.592
3.151ThrGlu: 3.151 ± 0.696
2.364ThrPhe: 2.364 ± 0.68
6.106ThrGly: 6.106 ± 1.381
1.182ThrHis: 1.182 ± 0.451
2.955ThrIle: 2.955 ± 0.639
1.97ThrLys: 1.97 ± 0.658
6.106ThrLeu: 6.106 ± 0.989
1.182ThrMet: 1.182 ± 0.429
1.97ThrAsn: 1.97 ± 0.728
6.106ThrPro: 6.106 ± 1.071
1.379ThrGln: 1.379 ± 0.442
4.333ThrArg: 4.333 ± 0.978
3.348ThrSer: 3.348 ± 0.697
4.333ThrThr: 4.333 ± 0.708
7.091ThrVal: 7.091 ± 1.715
1.576ThrTrp: 1.576 ± 0.486
1.182ThrTyr: 1.182 ± 0.51
0.0ThrXaa: 0.0 ± 0.0
Val
7.879ValAla: 7.879 ± 1.132
0.197ValCys: 0.197 ± 0.164
5.909ValAsp: 5.909 ± 1.004
3.348ValGlu: 3.348 ± 0.757
1.182ValPhe: 1.182 ± 0.356
3.545ValGly: 3.545 ± 0.966
1.773ValHis: 1.773 ± 0.546
2.167ValIle: 2.167 ± 0.468
3.545ValLys: 3.545 ± 0.858
4.53ValLeu: 4.53 ± 0.773
1.773ValMet: 1.773 ± 0.65
3.545ValAsn: 3.545 ± 0.748
3.151ValPro: 3.151 ± 0.654
3.545ValGln: 3.545 ± 0.93
4.727ValArg: 4.727 ± 1.182
5.318ValSer: 5.318 ± 0.865
7.288ValThr: 7.288 ± 0.654
4.727ValVal: 4.727 ± 0.764
1.182ValTrp: 1.182 ± 0.471
1.97ValTyr: 1.97 ± 0.523
0.0ValXaa: 0.0 ± 0.0
Trp
2.364TrpAla: 2.364 ± 0.672
0.0TrpCys: 0.0 ± 0.0
0.985TrpAsp: 0.985 ± 0.378
0.197TrpGlu: 0.197 ± 0.217
0.788TrpPhe: 0.788 ± 0.364
0.788TrpGly: 0.788 ± 0.33
0.591TrpHis: 0.591 ± 0.375
0.985TrpIle: 0.985 ± 0.482
0.591TrpLys: 0.591 ± 0.328
1.97TrpLeu: 1.97 ± 0.531
0.788TrpMet: 0.788 ± 0.424
0.394TrpAsn: 0.394 ± 0.29
0.394TrpPro: 0.394 ± 0.248
0.394TrpGln: 0.394 ± 0.261
1.773TrpArg: 1.773 ± 0.656
0.985TrpSer: 0.985 ± 0.431
1.576TrpThr: 1.576 ± 0.543
1.379TrpVal: 1.379 ± 0.526
0.788TrpTrp: 0.788 ± 0.363
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.53TyrAla: 4.53 ± 0.979
0.394TyrCys: 0.394 ± 0.313
1.182TyrAsp: 1.182 ± 0.543
1.182TyrGlu: 1.182 ± 0.531
0.591TyrPhe: 0.591 ± 0.352
3.151TyrGly: 3.151 ± 0.506
0.197TyrHis: 0.197 ± 0.171
1.379TyrIle: 1.379 ± 0.471
0.591TyrLys: 0.591 ± 0.303
1.379TyrLeu: 1.379 ± 0.443
0.197TyrMet: 0.197 ± 0.188
0.985TyrAsn: 0.985 ± 0.386
1.379TyrPro: 1.379 ± 0.571
0.788TyrGln: 0.788 ± 0.438
2.167TyrArg: 2.167 ± 0.552
1.97TyrSer: 1.97 ± 0.594
0.788TyrThr: 0.788 ± 0.311
1.576TyrVal: 1.576 ± 0.432
0.394TyrTrp: 0.394 ± 0.236
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 25 proteins (5078 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski