Amino acid dipepetide frequency for Bacillus phage PBP180

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.583AlaAla: 5.583 ± 0.843
0.233AlaCys: 0.233 ± 0.182
3.722AlaAsp: 3.722 ± 0.777
5.117AlaGlu: 5.117 ± 0.888
2.675AlaPhe: 2.675 ± 0.468
5.234AlaGly: 5.234 ± 0.903
1.628AlaHis: 1.628 ± 0.471
3.838AlaIle: 3.838 ± 0.59
5.35AlaLys: 5.35 ± 0.852
5.466AlaLeu: 5.466 ± 0.757
1.047AlaMet: 1.047 ± 0.371
4.187AlaAsn: 4.187 ± 0.887
1.628AlaPro: 1.628 ± 0.496
2.559AlaGln: 2.559 ± 0.61
2.908AlaArg: 2.908 ± 0.601
5.117AlaSer: 5.117 ± 0.978
3.605AlaThr: 3.605 ± 0.951
4.42AlaVal: 4.42 ± 0.838
0.349AlaTrp: 0.349 ± 0.191
1.396AlaTyr: 1.396 ± 0.417
0.0AlaXaa: 0.0 ± 0.0
Cys
0.116CysAla: 0.116 ± 0.092
0.233CysCys: 0.233 ± 0.184
0.233CysAsp: 0.233 ± 0.166
0.349CysGlu: 0.349 ± 0.192
0.116CysPhe: 0.116 ± 0.126
0.116CysGly: 0.116 ± 0.119
0.116CysHis: 0.116 ± 0.117
0.233CysIle: 0.233 ± 0.154
0.233CysLys: 0.233 ± 0.165
0.233CysLeu: 0.233 ± 0.16
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.465CysPro: 0.465 ± 0.214
0.0CysGln: 0.0 ± 0.0
0.349CysArg: 0.349 ± 0.184
0.582CysSer: 0.582 ± 0.338
0.582CysThr: 0.582 ± 0.242
0.116CysVal: 0.116 ± 0.095
0.116CysTrp: 0.116 ± 0.117
0.349CysTyr: 0.349 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
4.187AspAla: 4.187 ± 0.877
0.116AspCys: 0.116 ± 0.115
3.838AspAsp: 3.838 ± 0.712
3.489AspGlu: 3.489 ± 0.779
2.21AspPhe: 2.21 ± 0.559
4.652AspGly: 4.652 ± 0.891
1.512AspHis: 1.512 ± 0.508
4.769AspIle: 4.769 ± 0.829
4.42AspLys: 4.42 ± 0.757
5.001AspLeu: 5.001 ± 0.853
1.279AspMet: 1.279 ± 0.43
1.977AspAsn: 1.977 ± 0.718
1.628AspPro: 1.628 ± 0.394
3.257AspGln: 3.257 ± 0.729
2.791AspArg: 2.791 ± 0.742
3.373AspSer: 3.373 ± 0.585
3.14AspThr: 3.14 ± 0.567
5.466AspVal: 5.466 ± 0.712
0.698AspTrp: 0.698 ± 0.326
1.745AspTyr: 1.745 ± 0.465
0.0AspXaa: 0.0 ± 0.0
Glu
4.42GluAla: 4.42 ± 0.916
0.349GluCys: 0.349 ± 0.192
4.071GluAsp: 4.071 ± 0.699
6.746GluGlu: 6.746 ± 1.611
2.094GluPhe: 2.094 ± 0.477
3.257GluGly: 3.257 ± 0.737
0.814GluHis: 0.814 ± 0.304
4.536GluIle: 4.536 ± 0.829
9.304GluLys: 9.304 ± 1.233
6.746GluLeu: 6.746 ± 1.177
1.396GluMet: 1.396 ± 0.443
3.373GluAsn: 3.373 ± 0.579
1.861GluPro: 1.861 ± 0.53
3.257GluGln: 3.257 ± 0.542
4.303GluArg: 4.303 ± 0.816
3.954GluSer: 3.954 ± 0.64
3.838GluThr: 3.838 ± 0.653
5.001GluVal: 5.001 ± 0.829
1.396GluTrp: 1.396 ± 0.406
3.257GluTyr: 3.257 ± 0.595
0.0GluXaa: 0.0 ± 0.0
Phe
1.512PheAla: 1.512 ± 0.394
0.465PheCys: 0.465 ± 0.22
2.326PheAsp: 2.326 ± 0.623
3.024PheGlu: 3.024 ± 0.521
1.279PhePhe: 1.279 ± 0.452
2.094PheGly: 2.094 ± 0.538
0.698PheHis: 0.698 ± 0.263
1.977PheIle: 1.977 ± 0.608
3.373PheLys: 3.373 ± 0.719
3.024PheLeu: 3.024 ± 0.65
1.047PheMet: 1.047 ± 0.384
2.675PheAsn: 2.675 ± 1.089
1.279PhePro: 1.279 ± 0.46
2.094PheGln: 2.094 ± 0.625
1.861PheArg: 1.861 ± 0.507
2.21PheSer: 2.21 ± 0.382
2.21PheThr: 2.21 ± 0.602
2.094PheVal: 2.094 ± 0.54
0.349PheTrp: 0.349 ± 0.159
2.21PheTyr: 2.21 ± 0.629
0.0PheXaa: 0.0 ± 0.0
Gly
3.605GlyAla: 3.605 ± 0.732
0.0GlyCys: 0.0 ± 0.0
3.954GlyAsp: 3.954 ± 0.73
6.048GlyGlu: 6.048 ± 1.007
2.442GlyPhe: 2.442 ± 0.537
3.257GlyGly: 3.257 ± 0.785
1.047GlyHis: 1.047 ± 0.399
3.605GlyIle: 3.605 ± 0.843
5.001GlyLys: 5.001 ± 0.984
6.513GlyLeu: 6.513 ± 1.171
1.745GlyMet: 1.745 ± 0.592
3.14GlyAsn: 3.14 ± 0.848
1.163GlyPro: 1.163 ± 0.385
3.257GlyGln: 3.257 ± 0.548
4.652GlyArg: 4.652 ± 0.737
4.652GlySer: 4.652 ± 0.642
6.048GlyThr: 6.048 ± 0.879
3.14GlyVal: 3.14 ± 0.684
1.628GlyTrp: 1.628 ± 0.54
1.977GlyTyr: 1.977 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.512HisAla: 1.512 ± 0.486
0.116HisCys: 0.116 ± 0.117
1.047HisAsp: 1.047 ± 0.455
1.396HisGlu: 1.396 ± 0.422
1.396HisPhe: 1.396 ± 0.428
0.582HisGly: 0.582 ± 0.268
0.465HisHis: 0.465 ± 0.268
1.745HisIle: 1.745 ± 0.44
0.814HisLys: 0.814 ± 0.309
1.396HisLeu: 1.396 ± 0.32
0.698HisMet: 0.698 ± 0.289
0.582HisAsn: 0.582 ± 0.246
0.582HisPro: 0.582 ± 0.273
1.047HisGln: 1.047 ± 0.454
0.349HisArg: 0.349 ± 0.176
1.047HisSer: 1.047 ± 0.283
0.93HisThr: 0.93 ± 0.33
1.047HisVal: 1.047 ± 0.341
0.465HisTrp: 0.465 ± 0.252
0.814HisTyr: 0.814 ± 0.298
0.0HisXaa: 0.0 ± 0.0
Ile
4.187IleAla: 4.187 ± 0.609
0.698IleCys: 0.698 ± 0.262
4.42IleAsp: 4.42 ± 0.642
6.048IleGlu: 6.048 ± 1.307
0.814IlePhe: 0.814 ± 0.246
4.652IleGly: 4.652 ± 0.946
1.396IleHis: 1.396 ± 0.415
3.373IleIle: 3.373 ± 0.775
4.769IleLys: 4.769 ± 0.796
4.303IleLeu: 4.303 ± 0.761
1.279IleMet: 1.279 ± 0.428
3.024IleAsn: 3.024 ± 0.587
3.489IlePro: 3.489 ± 0.586
3.838IleGln: 3.838 ± 0.567
3.605IleArg: 3.605 ± 0.568
3.373IleSer: 3.373 ± 0.465
3.954IleThr: 3.954 ± 0.854
3.024IleVal: 3.024 ± 0.588
1.628IleTrp: 1.628 ± 0.418
2.326IleTyr: 2.326 ± 0.63
0.0IleXaa: 0.0 ± 0.0
Lys
6.629LysAla: 6.629 ± 0.701
0.116LysCys: 0.116 ± 0.133
3.14LysAsp: 3.14 ± 0.792
6.746LysGlu: 6.746 ± 0.953
1.861LysPhe: 1.861 ± 0.479
6.978LysGly: 6.978 ± 1.182
1.163LysHis: 1.163 ± 0.465
4.42LysIle: 4.42 ± 0.673
7.327LysLys: 7.327 ± 0.976
5.815LysLeu: 5.815 ± 0.879
2.442LysMet: 2.442 ± 0.56
4.769LysAsn: 4.769 ± 0.75
1.861LysPro: 1.861 ± 0.46
6.164LysGln: 6.164 ± 0.848
4.652LysArg: 4.652 ± 0.953
3.605LysSer: 3.605 ± 0.676
5.117LysThr: 5.117 ± 0.804
3.838LysVal: 3.838 ± 0.51
2.559LysTrp: 2.559 ± 0.935
1.745LysTyr: 1.745 ± 0.413
0.0LysXaa: 0.0 ± 0.0
Leu
4.303LeuAla: 4.303 ± 0.726
0.116LeuCys: 0.116 ± 0.135
6.281LeuAsp: 6.281 ± 0.995
6.164LeuGlu: 6.164 ± 1.185
3.605LeuPhe: 3.605 ± 0.543
3.722LeuGly: 3.722 ± 1.144
1.163LeuHis: 1.163 ± 0.463
4.303LeuIle: 4.303 ± 0.554
7.211LeuLys: 7.211 ± 1.122
6.746LeuLeu: 6.746 ± 1.036
2.326LeuMet: 2.326 ± 0.558
5.117LeuAsn: 5.117 ± 0.778
3.373LeuPro: 3.373 ± 0.576
5.117LeuGln: 5.117 ± 0.998
3.605LeuArg: 3.605 ± 0.594
6.164LeuSer: 6.164 ± 0.727
6.164LeuThr: 6.164 ± 0.907
3.605LeuVal: 3.605 ± 0.789
0.465LeuTrp: 0.465 ± 0.239
3.024LeuTyr: 3.024 ± 0.652
0.0LeuXaa: 0.0 ± 0.0
Met
2.21MetAla: 2.21 ± 0.59
0.116MetCys: 0.116 ± 0.126
1.396MetAsp: 1.396 ± 0.332
1.628MetGlu: 1.628 ± 0.486
0.698MetPhe: 0.698 ± 0.336
1.279MetGly: 1.279 ± 0.374
0.698MetHis: 0.698 ± 0.314
1.977MetIle: 1.977 ± 0.439
2.326MetLys: 2.326 ± 0.524
2.094MetLeu: 2.094 ± 0.484
1.279MetMet: 1.279 ± 0.367
1.628MetAsn: 1.628 ± 0.338
1.047MetPro: 1.047 ± 0.403
1.279MetGln: 1.279 ± 0.442
1.745MetArg: 1.745 ± 0.558
1.977MetSer: 1.977 ± 0.501
2.559MetThr: 2.559 ± 0.51
1.512MetVal: 1.512 ± 0.345
0.0MetTrp: 0.0 ± 0.0
0.93MetTyr: 0.93 ± 0.387
0.0MetXaa: 0.0 ± 0.0
Asn
3.838AsnAla: 3.838 ± 0.601
0.233AsnCys: 0.233 ± 0.164
3.257AsnAsp: 3.257 ± 0.573
3.14AsnGlu: 3.14 ± 0.742
1.512AsnPhe: 1.512 ± 0.705
5.234AsnGly: 5.234 ± 0.709
0.465AsnHis: 0.465 ± 0.203
3.14AsnIle: 3.14 ± 0.538
3.257AsnLys: 3.257 ± 0.855
2.442AsnLeu: 2.442 ± 0.494
1.628AsnMet: 1.628 ± 0.43
1.977AsnAsn: 1.977 ± 0.479
1.512AsnPro: 1.512 ± 0.493
2.21AsnGln: 2.21 ± 0.531
3.14AsnArg: 3.14 ± 0.658
2.559AsnSer: 2.559 ± 0.639
3.024AsnThr: 3.024 ± 0.682
2.442AsnVal: 2.442 ± 0.502
0.814AsnTrp: 0.814 ± 0.296
1.977AsnTyr: 1.977 ± 0.43
0.0AsnXaa: 0.0 ± 0.0
Pro
2.21ProAla: 2.21 ± 0.581
0.116ProCys: 0.116 ± 0.092
2.094ProAsp: 2.094 ± 0.482
3.024ProGlu: 3.024 ± 0.587
1.279ProPhe: 1.279 ± 0.38
1.861ProGly: 1.861 ± 0.46
0.465ProHis: 0.465 ± 0.303
2.791ProIle: 2.791 ± 0.622
2.326ProLys: 2.326 ± 0.584
2.908ProLeu: 2.908 ± 0.768
0.465ProMet: 0.465 ± 0.217
1.396ProAsn: 1.396 ± 0.325
1.279ProPro: 1.279 ± 0.494
0.814ProGln: 0.814 ± 0.268
1.628ProArg: 1.628 ± 0.405
3.489ProSer: 3.489 ± 0.61
2.326ProThr: 2.326 ± 0.756
2.675ProVal: 2.675 ± 0.479
0.116ProTrp: 0.116 ± 0.092
0.93ProTyr: 0.93 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
4.187GlnAla: 4.187 ± 0.817
0.698GlnCys: 0.698 ± 0.293
3.489GlnAsp: 3.489 ± 0.798
3.024GlnGlu: 3.024 ± 0.511
1.861GlnPhe: 1.861 ± 0.472
3.024GlnGly: 3.024 ± 0.519
1.047GlnHis: 1.047 ± 0.314
2.094GlnIle: 2.094 ± 0.649
3.489GlnLys: 3.489 ± 0.784
4.42GlnLeu: 4.42 ± 0.625
2.675GlnMet: 2.675 ± 0.501
2.326GlnAsn: 2.326 ± 0.525
1.977GlnPro: 1.977 ± 0.394
3.257GlnGln: 3.257 ± 0.709
2.21GlnArg: 2.21 ± 0.495
2.675GlnSer: 2.675 ± 0.655
3.838GlnThr: 3.838 ± 0.723
2.442GlnVal: 2.442 ± 0.49
0.698GlnTrp: 0.698 ± 0.251
1.396GlnTyr: 1.396 ± 0.407
0.0GlnXaa: 0.0 ± 0.0
Arg
2.675ArgAla: 2.675 ± 0.725
0.116ArgCys: 0.116 ± 0.121
1.745ArgAsp: 1.745 ± 0.543
3.257ArgGlu: 3.257 ± 0.614
2.559ArgPhe: 2.559 ± 0.658
3.14ArgGly: 3.14 ± 0.573
1.396ArgHis: 1.396 ± 0.386
3.373ArgIle: 3.373 ± 0.69
3.14ArgLys: 3.14 ± 0.504
5.35ArgLeu: 5.35 ± 0.884
1.861ArgMet: 1.861 ± 0.673
1.512ArgAsn: 1.512 ± 0.344
1.745ArgPro: 1.745 ± 0.4
1.977ArgGln: 1.977 ± 0.458
2.442ArgArg: 2.442 ± 0.607
3.257ArgSer: 3.257 ± 0.638
2.442ArgThr: 2.442 ± 0.51
3.024ArgVal: 3.024 ± 0.497
0.582ArgTrp: 0.582 ± 0.312
1.745ArgTyr: 1.745 ± 0.491
0.0ArgXaa: 0.0 ± 0.0
Ser
3.954SerAla: 3.954 ± 0.8
0.349SerCys: 0.349 ± 0.201
3.605SerAsp: 3.605 ± 0.647
4.303SerGlu: 4.303 ± 0.625
2.326SerPhe: 2.326 ± 0.535
4.885SerGly: 4.885 ± 1.313
1.047SerHis: 1.047 ± 0.413
5.234SerIle: 5.234 ± 1.007
4.071SerLys: 4.071 ± 0.639
5.699SerLeu: 5.699 ± 0.701
2.559SerMet: 2.559 ± 0.463
2.559SerAsn: 2.559 ± 0.712
2.791SerPro: 2.791 ± 0.672
3.14SerGln: 3.14 ± 0.52
2.442SerArg: 2.442 ± 0.487
3.722SerSer: 3.722 ± 0.855
3.14SerThr: 3.14 ± 0.676
4.652SerVal: 4.652 ± 0.807
1.396SerTrp: 1.396 ± 0.405
2.442SerTyr: 2.442 ± 0.684
0.0SerXaa: 0.0 ± 0.0
Thr
5.234ThrAla: 5.234 ± 0.693
0.349ThrCys: 0.349 ± 0.182
3.024ThrAsp: 3.024 ± 0.791
2.908ThrGlu: 2.908 ± 0.717
3.954ThrPhe: 3.954 ± 0.912
6.513ThrGly: 6.513 ± 1.135
0.698ThrHis: 0.698 ± 0.3
3.954ThrIle: 3.954 ± 0.482
4.769ThrLys: 4.769 ± 0.69
4.885ThrLeu: 4.885 ± 0.546
1.279ThrMet: 1.279 ± 0.38
2.094ThrAsn: 2.094 ± 0.426
2.675ThrPro: 2.675 ± 0.502
2.326ThrGln: 2.326 ± 0.546
1.512ThrArg: 1.512 ± 0.396
4.769ThrSer: 4.769 ± 0.908
3.489ThrThr: 3.489 ± 0.536
4.652ThrVal: 4.652 ± 0.689
0.582ThrTrp: 0.582 ± 0.229
2.675ThrTyr: 2.675 ± 0.588
0.0ThrXaa: 0.0 ± 0.0
Val
3.14ValAla: 3.14 ± 0.82
0.0ValCys: 0.0 ± 0.0
3.722ValAsp: 3.722 ± 0.519
4.769ValGlu: 4.769 ± 0.636
2.791ValPhe: 2.791 ± 0.63
3.14ValGly: 3.14 ± 0.54
0.814ValHis: 0.814 ± 0.402
4.769ValIle: 4.769 ± 0.662
6.862ValLys: 6.862 ± 0.841
5.234ValLeu: 5.234 ± 1.095
1.745ValMet: 1.745 ± 0.359
2.559ValAsn: 2.559 ± 0.605
2.791ValPro: 2.791 ± 0.482
3.024ValGln: 3.024 ± 0.616
1.745ValArg: 1.745 ± 0.53
4.769ValSer: 4.769 ± 0.557
3.257ValThr: 3.257 ± 0.712
3.373ValVal: 3.373 ± 0.648
0.349ValTrp: 0.349 ± 0.214
1.512ValTyr: 1.512 ± 0.363
0.0ValXaa: 0.0 ± 0.0
Trp
0.349TrpAla: 0.349 ± 0.165
0.116TrpCys: 0.116 ± 0.117
1.861TrpAsp: 1.861 ± 0.676
0.698TrpGlu: 0.698 ± 0.268
0.465TrpPhe: 0.465 ± 0.262
1.047TrpGly: 1.047 ± 0.527
0.582TrpHis: 0.582 ± 0.317
1.512TrpIle: 1.512 ± 0.464
0.814TrpLys: 0.814 ± 0.283
1.163TrpLeu: 1.163 ± 0.368
0.116TrpMet: 0.116 ± 0.126
1.628TrpAsn: 1.628 ± 0.433
0.349TrpPro: 0.349 ± 0.18
0.349TrpGln: 0.349 ± 0.184
0.116TrpArg: 0.116 ± 0.118
1.047TrpSer: 1.047 ± 0.348
1.047TrpThr: 1.047 ± 0.372
1.047TrpVal: 1.047 ± 0.307
0.93TrpTrp: 0.93 ± 0.552
0.582TrpTyr: 0.582 ± 0.267
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.21TyrAla: 2.21 ± 0.451
0.0TyrCys: 0.0 ± 0.0
2.442TyrAsp: 2.442 ± 0.561
2.094TyrGlu: 2.094 ± 0.628
1.861TyrPhe: 1.861 ± 0.565
2.21TyrGly: 2.21 ± 0.481
0.814TyrHis: 0.814 ± 0.317
2.559TyrIle: 2.559 ± 0.537
2.094TyrLys: 2.094 ± 0.379
3.14TyrLeu: 3.14 ± 0.609
1.396TyrMet: 1.396 ± 0.382
1.279TyrAsn: 1.279 ± 0.375
0.698TyrPro: 0.698 ± 0.329
1.861TyrGln: 1.861 ± 0.487
1.279TyrArg: 1.279 ± 0.341
1.977TyrSer: 1.977 ± 0.496
1.628TyrThr: 1.628 ± 0.423
2.791TyrVal: 2.791 ± 0.501
0.698TyrTrp: 0.698 ± 0.309
1.047TyrTyr: 1.047 ± 0.365
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (8599 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski