Amino acid dipepetide frequency for Arthrobacter phage Dewayne

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.709AlaAla: 19.709 ± 2.846
1.195AlaCys: 1.195 ± 0.444
6.769AlaAsp: 6.769 ± 1.015
4.38AlaGlu: 4.38 ± 1.029
4.181AlaPhe: 4.181 ± 0.794
15.13AlaGly: 15.13 ± 1.86
1.991AlaHis: 1.991 ± 0.693
6.968AlaIle: 6.968 ± 1.276
4.778AlaLys: 4.778 ± 1.252
10.153AlaLeu: 10.153 ± 1.214
3.185AlaMet: 3.185 ± 0.884
3.783AlaAsn: 3.783 ± 1.065
6.172AlaPro: 6.172 ± 0.898
6.57AlaGln: 6.57 ± 1.149
9.158AlaArg: 9.158 ± 1.464
5.773AlaSer: 5.773 ± 1.244
9.556AlaThr: 9.556 ± 1.387
13.14AlaVal: 13.14 ± 3.143
2.588AlaTrp: 2.588 ± 1.097
3.783AlaTyr: 3.783 ± 0.719
0.0AlaXaa: 0.0 ± 0.0
Cys
0.796CysAla: 0.796 ± 0.43
0.0CysCys: 0.0 ± 0.0
0.199CysAsp: 0.199 ± 0.197
0.597CysGlu: 0.597 ± 0.349
0.0CysPhe: 0.0 ± 0.0
0.995CysGly: 0.995 ± 0.649
0.199CysHis: 0.199 ± 0.23
0.199CysIle: 0.199 ± 0.197
0.597CysLys: 0.597 ± 0.399
0.199CysLeu: 0.199 ± 0.208
0.199CysMet: 0.199 ± 0.23
0.398CysAsn: 0.398 ± 0.302
0.796CysPro: 0.796 ± 0.46
0.398CysGln: 0.398 ± 0.317
0.995CysArg: 0.995 ± 0.393
0.597CysSer: 0.597 ± 0.342
0.199CysThr: 0.199 ± 0.227
0.398CysVal: 0.398 ± 0.256
0.597CysTrp: 0.597 ± 0.323
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.764AspAla: 7.764 ± 1.317
0.199AspCys: 0.199 ± 0.197
3.185AspAsp: 3.185 ± 0.994
1.792AspGlu: 1.792 ± 0.575
2.19AspPhe: 2.19 ± 0.488
5.773AspGly: 5.773 ± 1.105
1.195AspHis: 1.195 ± 0.506
3.185AspIle: 3.185 ± 0.882
3.783AspLys: 3.783 ± 0.767
4.579AspLeu: 4.579 ± 0.716
1.991AspMet: 1.991 ± 0.837
2.19AspAsn: 2.19 ± 0.663
4.181AspPro: 4.181 ± 0.858
1.593AspGln: 1.593 ± 0.628
1.792AspArg: 1.792 ± 0.714
3.185AspSer: 3.185 ± 0.82
2.19AspThr: 2.19 ± 0.539
5.176AspVal: 5.176 ± 1.133
0.398AspTrp: 0.398 ± 0.286
1.195AspTyr: 1.195 ± 0.419
0.0AspXaa: 0.0 ± 0.0
Glu
7.963GluAla: 7.963 ± 1.101
0.796GluCys: 0.796 ± 0.459
2.986GluAsp: 2.986 ± 0.713
1.195GluGlu: 1.195 ± 0.454
1.792GluPhe: 1.792 ± 0.495
1.991GluGly: 1.991 ± 0.529
0.995GluHis: 0.995 ± 0.455
1.792GluIle: 1.792 ± 0.531
0.398GluLys: 0.398 ± 0.196
6.172GluLeu: 6.172 ± 0.943
0.597GluMet: 0.597 ± 0.259
0.995GluAsn: 0.995 ± 0.388
1.593GluPro: 1.593 ± 0.59
2.986GluGln: 2.986 ± 0.74
3.783GluArg: 3.783 ± 0.788
2.19GluSer: 2.19 ± 0.768
1.792GluThr: 1.792 ± 0.663
3.384GluVal: 3.384 ± 1.032
1.593GluTrp: 1.593 ± 0.75
2.389GluTyr: 2.389 ± 0.669
0.0GluXaa: 0.0 ± 0.0
Phe
3.384PheAla: 3.384 ± 0.804
0.0PheCys: 0.0 ± 0.0
1.593PheAsp: 1.593 ± 0.562
1.991PheGlu: 1.991 ± 0.651
0.796PhePhe: 0.796 ± 0.54
2.787PheGly: 2.787 ± 0.814
0.597PheHis: 0.597 ± 0.32
1.394PheIle: 1.394 ± 0.531
1.593PheLys: 1.593 ± 0.453
0.995PheLeu: 0.995 ± 0.367
0.398PheMet: 0.398 ± 0.299
2.389PheAsn: 2.389 ± 0.691
0.597PhePro: 0.597 ± 0.31
0.995PheGln: 0.995 ± 0.426
0.796PheArg: 0.796 ± 0.29
1.792PheSer: 1.792 ± 0.418
2.19PheThr: 2.19 ± 0.706
1.991PheVal: 1.991 ± 0.691
0.0PheTrp: 0.0 ± 0.0
0.995PheTyr: 0.995 ± 0.421
0.0PheXaa: 0.0 ± 0.0
Gly
11.547GlyAla: 11.547 ± 2.232
0.597GlyCys: 0.597 ± 0.373
4.977GlyAsp: 4.977 ± 0.907
5.375GlyGlu: 5.375 ± 1.314
1.792GlyPhe: 1.792 ± 0.427
6.57GlyGly: 6.57 ± 1.257
1.195GlyHis: 1.195 ± 0.559
4.38GlyIle: 4.38 ± 1.089
3.185GlyLys: 3.185 ± 0.635
5.973GlyLeu: 5.973 ± 1.153
2.19GlyMet: 2.19 ± 0.559
3.384GlyAsn: 3.384 ± 0.765
2.787GlyPro: 2.787 ± 0.986
3.185GlyGln: 3.185 ± 0.507
5.176GlyArg: 5.176 ± 1.078
5.176GlySer: 5.176 ± 0.824
7.167GlyThr: 7.167 ± 1.003
6.371GlyVal: 6.371 ± 1.934
1.792GlyTrp: 1.792 ± 0.609
1.593GlyTyr: 1.593 ± 0.667
0.0GlyXaa: 0.0 ± 0.0
His
2.389HisAla: 2.389 ± 0.879
0.199HisCys: 0.199 ± 0.219
2.19HisAsp: 2.19 ± 0.8
0.796HisGlu: 0.796 ± 0.364
0.199HisPhe: 0.199 ± 0.18
1.195HisGly: 1.195 ± 0.562
0.796HisHis: 0.796 ± 0.568
0.398HisIle: 0.398 ± 0.309
0.199HisLys: 0.199 ± 0.245
1.593HisLeu: 1.593 ± 0.614
0.0HisMet: 0.0 ± 0.0
0.597HisAsn: 0.597 ± 0.511
0.995HisPro: 0.995 ± 0.434
0.199HisGln: 0.199 ± 0.18
1.593HisArg: 1.593 ± 0.517
0.995HisSer: 0.995 ± 0.391
0.398HisThr: 0.398 ± 0.223
0.995HisVal: 0.995 ± 0.422
0.398HisTrp: 0.398 ± 0.416
0.398HisTyr: 0.398 ± 0.344
0.0HisXaa: 0.0 ± 0.0
Ile
4.977IleAla: 4.977 ± 1.035
0.0IleCys: 0.0 ± 0.0
3.783IleAsp: 3.783 ± 1.457
3.783IleGlu: 3.783 ± 0.516
0.796IlePhe: 0.796 ± 0.252
4.38IleGly: 4.38 ± 0.816
0.597IleHis: 0.597 ± 0.512
2.389IleIle: 2.389 ± 1.245
1.394IleLys: 1.394 ± 0.584
2.787IleLeu: 2.787 ± 0.564
0.199IleMet: 0.199 ± 0.19
2.588IleAsn: 2.588 ± 0.945
1.593IlePro: 1.593 ± 0.602
2.787IleGln: 2.787 ± 0.833
2.787IleArg: 2.787 ± 0.651
1.991IleSer: 1.991 ± 0.435
1.991IleThr: 1.991 ± 0.485
4.778IleVal: 4.778 ± 1.013
0.398IleTrp: 0.398 ± 0.319
1.792IleTyr: 1.792 ± 0.657
0.0IleXaa: 0.0 ± 0.0
Lys
5.773LysAla: 5.773 ± 0.829
0.796LysCys: 0.796 ± 0.389
2.588LysAsp: 2.588 ± 0.759
1.394LysGlu: 1.394 ± 0.502
0.199LysPhe: 0.199 ± 0.168
1.394LysGly: 1.394 ± 0.6
0.199LysHis: 0.199 ± 0.159
1.593LysIle: 1.593 ± 0.433
1.195LysLys: 1.195 ± 0.494
5.773LysLeu: 5.773 ± 1.068
0.796LysMet: 0.796 ± 0.485
0.796LysAsn: 0.796 ± 0.352
2.389LysPro: 2.389 ± 0.632
0.995LysGln: 0.995 ± 0.35
1.991LysArg: 1.991 ± 0.717
1.394LysSer: 1.394 ± 0.58
2.787LysThr: 2.787 ± 0.725
2.588LysVal: 2.588 ± 0.759
0.796LysTrp: 0.796 ± 0.375
1.195LysTyr: 1.195 ± 0.433
0.0LysXaa: 0.0 ± 0.0
Leu
10.95LeuAla: 10.95 ± 1.084
1.195LeuCys: 1.195 ± 0.728
4.579LeuAsp: 4.579 ± 0.721
2.389LeuGlu: 2.389 ± 0.573
0.796LeuPhe: 0.796 ± 0.376
7.366LeuGly: 7.366 ± 1.25
1.792LeuHis: 1.792 ± 0.614
2.787LeuIle: 2.787 ± 0.766
3.384LeuLys: 3.384 ± 0.771
5.176LeuLeu: 5.176 ± 1.139
1.792LeuMet: 1.792 ± 0.679
1.991LeuAsn: 1.991 ± 0.632
6.769LeuPro: 6.769 ± 1.447
1.394LeuGln: 1.394 ± 0.467
4.579LeuArg: 4.579 ± 0.865
5.973LeuSer: 5.973 ± 1.327
6.172LeuThr: 6.172 ± 0.816
7.764LeuVal: 7.764 ± 0.869
1.195LeuTrp: 1.195 ± 0.377
1.593LeuTyr: 1.593 ± 0.626
0.0LeuXaa: 0.0 ± 0.0
Met
3.584MetAla: 3.584 ± 0.902
0.0MetCys: 0.0 ± 0.0
1.792MetAsp: 1.792 ± 0.528
0.995MetGlu: 0.995 ± 0.482
0.995MetPhe: 0.995 ± 0.317
2.19MetGly: 2.19 ± 0.705
0.199MetHis: 0.199 ± 0.214
0.597MetIle: 0.597 ± 0.405
0.995MetLys: 0.995 ± 0.454
3.185MetLeu: 3.185 ± 0.807
0.597MetMet: 0.597 ± 0.344
0.796MetAsn: 0.796 ± 0.479
2.19MetPro: 2.19 ± 0.763
0.995MetGln: 0.995 ± 0.55
0.796MetArg: 0.796 ± 0.313
1.593MetSer: 1.593 ± 0.593
2.389MetThr: 2.389 ± 0.847
1.195MetVal: 1.195 ± 0.55
0.398MetTrp: 0.398 ± 0.267
0.199MetTyr: 0.199 ± 0.233
0.0MetXaa: 0.0 ± 0.0
Asn
4.778AsnAla: 4.778 ± 2.325
0.199AsnCys: 0.199 ± 0.197
2.19AsnAsp: 2.19 ± 0.631
1.593AsnGlu: 1.593 ± 0.487
0.597AsnPhe: 0.597 ± 0.281
4.38AsnGly: 4.38 ± 0.804
0.995AsnHis: 0.995 ± 0.594
1.593AsnIle: 1.593 ± 0.519
0.995AsnLys: 0.995 ± 0.289
2.787AsnLeu: 2.787 ± 0.742
1.195AsnMet: 1.195 ± 0.375
1.792AsnAsn: 1.792 ± 0.8
2.389AsnPro: 2.389 ± 0.766
0.995AsnGln: 0.995 ± 0.43
2.787AsnArg: 2.787 ± 0.815
1.991AsnSer: 1.991 ± 0.667
1.792AsnThr: 1.792 ± 0.518
1.991AsnVal: 1.991 ± 0.717
0.398AsnTrp: 0.398 ± 0.26
0.995AsnTyr: 0.995 ± 0.398
0.0AsnXaa: 0.0 ± 0.0
Pro
9.357ProAla: 9.357 ± 1.611
0.597ProCys: 0.597 ± 0.48
2.389ProAsp: 2.389 ± 0.624
2.787ProGlu: 2.787 ± 0.779
1.394ProPhe: 1.394 ± 0.552
4.38ProGly: 4.38 ± 0.931
0.597ProHis: 0.597 ± 0.333
2.389ProIle: 2.389 ± 0.727
2.588ProLys: 2.588 ± 0.668
3.384ProLeu: 3.384 ± 0.891
1.792ProMet: 1.792 ± 0.518
1.991ProAsn: 1.991 ± 0.61
1.394ProPro: 1.394 ± 0.726
0.995ProGln: 0.995 ± 0.511
3.584ProArg: 3.584 ± 0.793
2.986ProSer: 2.986 ± 0.905
4.579ProThr: 4.579 ± 0.957
3.982ProVal: 3.982 ± 1.134
1.792ProTrp: 1.792 ± 0.418
1.195ProTyr: 1.195 ± 0.458
0.0ProXaa: 0.0 ± 0.0
Gln
4.181GlnAla: 4.181 ± 0.822
0.0GlnCys: 0.0 ± 0.0
1.991GlnAsp: 1.991 ± 0.718
1.593GlnGlu: 1.593 ± 0.448
1.195GlnPhe: 1.195 ± 0.466
2.389GlnGly: 2.389 ± 0.688
0.796GlnHis: 0.796 ± 0.274
2.19GlnIle: 2.19 ± 0.497
1.195GlnLys: 1.195 ± 0.699
3.584GlnLeu: 3.584 ± 0.824
0.995GlnMet: 0.995 ± 0.423
1.394GlnAsn: 1.394 ± 0.357
2.787GlnPro: 2.787 ± 0.846
0.995GlnGln: 0.995 ± 0.346
2.588GlnArg: 2.588 ± 0.973
2.986GlnSer: 2.986 ± 0.923
3.185GlnThr: 3.185 ± 0.594
2.389GlnVal: 2.389 ± 0.606
0.796GlnTrp: 0.796 ± 0.362
0.398GlnTyr: 0.398 ± 0.339
0.0GlnXaa: 0.0 ± 0.0
Arg
5.973ArgAla: 5.973 ± 0.973
0.796ArgCys: 0.796 ± 0.429
2.787ArgAsp: 2.787 ± 0.655
1.593ArgGlu: 1.593 ± 0.622
1.394ArgPhe: 1.394 ± 0.651
3.982ArgGly: 3.982 ± 0.766
0.796ArgHis: 0.796 ± 0.374
2.787ArgIle: 2.787 ± 0.58
3.185ArgLys: 3.185 ± 0.867
4.778ArgLeu: 4.778 ± 1.095
1.792ArgMet: 1.792 ± 0.636
2.389ArgAsn: 2.389 ± 0.727
4.38ArgPro: 4.38 ± 1.09
2.19ArgGln: 2.19 ± 0.717
3.384ArgArg: 3.384 ± 0.784
2.986ArgSer: 2.986 ± 0.774
2.986ArgThr: 2.986 ± 0.581
3.384ArgVal: 3.384 ± 1.04
1.394ArgTrp: 1.394 ± 0.584
2.588ArgTyr: 2.588 ± 0.665
0.0ArgXaa: 0.0 ± 0.0
Ser
7.167SerAla: 7.167 ± 1.433
0.995SerCys: 0.995 ± 0.484
2.588SerAsp: 2.588 ± 0.833
2.19SerGlu: 2.19 ± 0.584
1.991SerPhe: 1.991 ± 0.521
5.176SerGly: 5.176 ± 1.08
0.199SerHis: 0.199 ± 0.219
3.982SerIle: 3.982 ± 1.021
1.991SerLys: 1.991 ± 0.548
3.982SerLeu: 3.982 ± 0.949
1.394SerMet: 1.394 ± 0.681
2.389SerAsn: 2.389 ± 0.74
2.389SerPro: 2.389 ± 0.778
2.986SerGln: 2.986 ± 0.78
2.588SerArg: 2.588 ± 0.621
2.19SerSer: 2.19 ± 0.814
4.38SerThr: 4.38 ± 0.849
4.778SerVal: 4.778 ± 1.136
1.792SerTrp: 1.792 ± 0.415
0.796SerTyr: 0.796 ± 0.298
0.0SerXaa: 0.0 ± 0.0
Thr
10.95ThrAla: 10.95 ± 1.451
0.199ThrCys: 0.199 ± 0.23
3.185ThrAsp: 3.185 ± 0.825
5.375ThrGlu: 5.375 ± 1.274
2.389ThrPhe: 2.389 ± 0.631
4.579ThrGly: 4.579 ± 0.782
0.398ThrHis: 0.398 ± 0.437
2.787ThrIle: 2.787 ± 0.918
2.389ThrLys: 2.389 ± 0.615
4.181ThrLeu: 4.181 ± 0.963
1.991ThrMet: 1.991 ± 0.549
0.995ThrAsn: 0.995 ± 0.415
4.38ThrPro: 4.38 ± 0.797
2.19ThrGln: 2.19 ± 0.699
1.991ThrArg: 1.991 ± 0.578
3.982ThrSer: 3.982 ± 1.117
4.579ThrThr: 4.579 ± 1.083
7.366ThrVal: 7.366 ± 1.167
1.195ThrTrp: 1.195 ± 0.501
1.991ThrTyr: 1.991 ± 0.723
0.0ThrXaa: 0.0 ± 0.0
Val
12.343ValAla: 12.343 ± 1.857
0.199ValCys: 0.199 ± 0.197
5.574ValAsp: 5.574 ± 0.838
5.973ValGlu: 5.973 ± 1.366
2.787ValPhe: 2.787 ± 1.068
6.371ValGly: 6.371 ± 1.078
1.593ValHis: 1.593 ± 0.667
2.588ValIle: 2.588 ± 0.698
1.792ValLys: 1.792 ± 0.489
6.769ValLeu: 6.769 ± 0.956
3.185ValMet: 3.185 ± 0.888
3.584ValAsn: 3.584 ± 1.041
3.982ValPro: 3.982 ± 0.801
2.986ValGln: 2.986 ± 1.047
2.986ValArg: 2.986 ± 0.715
5.176ValSer: 5.176 ± 0.799
6.172ValThr: 6.172 ± 0.975
4.977ValVal: 4.977 ± 0.823
0.597ValTrp: 0.597 ± 0.38
1.394ValTyr: 1.394 ± 0.545
0.0ValXaa: 0.0 ± 0.0
Trp
1.593TrpAla: 1.593 ± 0.598
0.199TrpCys: 0.199 ± 0.18
1.195TrpAsp: 1.195 ± 0.429
0.597TrpGlu: 0.597 ± 0.284
1.394TrpPhe: 1.394 ± 0.45
0.398TrpGly: 0.398 ± 0.275
0.597TrpHis: 0.597 ± 0.397
0.398TrpIle: 0.398 ± 0.207
0.796TrpLys: 0.796 ± 0.325
2.19TrpLeu: 2.19 ± 0.879
0.398TrpMet: 0.398 ± 0.247
1.195TrpAsn: 1.195 ± 0.495
0.995TrpPro: 0.995 ± 0.417
1.394TrpGln: 1.394 ± 0.372
0.398TrpArg: 0.398 ± 0.267
0.995TrpSer: 0.995 ± 0.506
1.195TrpThr: 1.195 ± 0.489
1.792TrpVal: 1.792 ± 0.496
0.398TrpTrp: 0.398 ± 0.245
0.597TrpTyr: 0.597 ± 0.342
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.783TyrAla: 3.783 ± 0.777
0.199TyrCys: 0.199 ± 0.197
0.995TyrAsp: 0.995 ± 0.36
0.995TyrGlu: 0.995 ± 0.354
0.398TyrPhe: 0.398 ± 0.196
2.986TyrGly: 2.986 ± 0.834
0.796TyrHis: 0.796 ± 0.366
1.394TyrIle: 1.394 ± 0.623
0.398TyrLys: 0.398 ± 0.258
1.394TyrLeu: 1.394 ± 0.395
0.796TyrMet: 0.796 ± 0.533
0.796TyrAsn: 0.796 ± 0.279
1.394TyrPro: 1.394 ± 0.477
0.796TyrGln: 0.796 ± 0.387
1.792TyrArg: 1.792 ± 0.554
1.991TyrSer: 1.991 ± 0.639
1.394TyrThr: 1.394 ± 0.5
2.588TyrVal: 2.588 ± 0.932
0.199TyrTrp: 0.199 ± 0.245
0.199TyrTyr: 0.199 ± 0.159
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (5024 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski