Amino acid dipepetide frequency for Arthrobacter phage Copper

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.709AlaAla: 19.709 ± 2.649
1.195AlaCys: 1.195 ± 0.467
6.769AlaAsp: 6.769 ± 1.036
4.38AlaGlu: 4.38 ± 1.094
4.181AlaPhe: 4.181 ± 0.847
15.13AlaGly: 15.13 ± 1.643
1.991AlaHis: 1.991 ± 0.728
6.968AlaIle: 6.968 ± 1.28
4.778AlaLys: 4.778 ± 1.257
10.153AlaLeu: 10.153 ± 1.181
3.185AlaMet: 3.185 ± 0.683
3.783AlaAsn: 3.783 ± 1.037
6.172AlaPro: 6.172 ± 0.814
6.57AlaGln: 6.57 ± 1.258
9.158AlaArg: 9.158 ± 1.571
5.773AlaSer: 5.773 ± 1.227
9.556AlaThr: 9.556 ± 1.599
13.14AlaVal: 13.14 ± 3.27
2.588AlaTrp: 2.588 ± 0.964
3.783AlaTyr: 3.783 ± 0.752
0.0AlaXaa: 0.0 ± 0.0
Cys
0.796CysAla: 0.796 ± 0.38
0.0CysCys: 0.0 ± 0.0
0.199CysAsp: 0.199 ± 0.19
0.597CysGlu: 0.597 ± 0.348
0.0CysPhe: 0.0 ± 0.0
0.995CysGly: 0.995 ± 0.613
0.199CysHis: 0.199 ± 0.194
0.199CysIle: 0.199 ± 0.19
0.597CysLys: 0.597 ± 0.388
0.199CysLeu: 0.199 ± 0.21
0.199CysMet: 0.199 ± 0.194
0.398CysAsn: 0.398 ± 0.299
0.796CysPro: 0.796 ± 0.382
0.398CysGln: 0.398 ± 0.286
0.995CysArg: 0.995 ± 0.398
0.597CysSer: 0.597 ± 0.384
0.199CysThr: 0.199 ± 0.224
0.398CysVal: 0.398 ± 0.259
0.597CysTrp: 0.597 ± 0.325
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
7.764AspAla: 7.764 ± 1.328
0.199AspCys: 0.199 ± 0.19
3.185AspAsp: 3.185 ± 0.921
1.792AspGlu: 1.792 ± 0.592
2.19AspPhe: 2.19 ± 0.526
5.773AspGly: 5.773 ± 0.954
1.195AspHis: 1.195 ± 0.49
3.185AspIle: 3.185 ± 0.868
3.783AspLys: 3.783 ± 0.873
4.778AspLeu: 4.778 ± 0.759
1.991AspMet: 1.991 ± 0.809
2.19AspAsn: 2.19 ± 0.663
4.181AspPro: 4.181 ± 0.916
1.593AspGln: 1.593 ± 0.626
1.792AspArg: 1.792 ± 0.665
3.185AspSer: 3.185 ± 0.782
2.19AspThr: 2.19 ± 0.535
5.176AspVal: 5.176 ± 1.13
0.398AspTrp: 0.398 ± 0.274
1.195AspTyr: 1.195 ± 0.443
0.0AspXaa: 0.0 ± 0.0
Glu
7.963GluAla: 7.963 ± 1.299
0.796GluCys: 0.796 ± 0.465
2.986GluAsp: 2.986 ± 0.68
1.195GluGlu: 1.195 ± 0.49
1.792GluPhe: 1.792 ± 0.487
1.991GluGly: 1.991 ± 0.585
0.995GluHis: 0.995 ± 0.404
1.792GluIle: 1.792 ± 0.581
0.398GluLys: 0.398 ± 0.238
6.172GluLeu: 6.172 ± 1.026
0.597GluMet: 0.597 ± 0.265
0.995GluAsn: 0.995 ± 0.38
1.593GluPro: 1.593 ± 0.561
2.986GluGln: 2.986 ± 0.683
3.783GluArg: 3.783 ± 0.727
2.19GluSer: 2.19 ± 0.757
1.792GluThr: 1.792 ± 0.608
3.384GluVal: 3.384 ± 0.954
1.593GluTrp: 1.593 ± 0.741
2.389GluTyr: 2.389 ± 0.745
0.0GluXaa: 0.0 ± 0.0
Phe
3.384PheAla: 3.384 ± 0.767
0.0PheCys: 0.0 ± 0.0
1.593PheAsp: 1.593 ± 0.621
1.991PheGlu: 1.991 ± 0.694
0.796PhePhe: 0.796 ± 0.543
2.787PheGly: 2.787 ± 0.823
0.597PheHis: 0.597 ± 0.295
1.394PheIle: 1.394 ± 0.521
1.593PheLys: 1.593 ± 0.513
0.995PheLeu: 0.995 ± 0.354
0.398PheMet: 0.398 ± 0.323
2.389PheAsn: 2.389 ± 0.677
0.597PhePro: 0.597 ± 0.29
0.995PheGln: 0.995 ± 0.456
0.796PheArg: 0.796 ± 0.328
1.792PheSer: 1.792 ± 0.388
2.19PheThr: 2.19 ± 0.704
1.991PheVal: 1.991 ± 0.731
0.0PheTrp: 0.0 ± 0.0
0.995PheTyr: 0.995 ± 0.388
0.0PheXaa: 0.0 ± 0.0
Gly
11.547GlyAla: 11.547 ± 2.377
0.597GlyCys: 0.597 ± 0.316
4.977GlyAsp: 4.977 ± 0.916
5.375GlyGlu: 5.375 ± 1.189
1.792GlyPhe: 1.792 ± 0.459
6.57GlyGly: 6.57 ± 1.472
1.195GlyHis: 1.195 ± 0.595
4.38GlyIle: 4.38 ± 1.093
3.185GlyLys: 3.185 ± 0.621
5.773GlyLeu: 5.773 ± 1.127
2.19GlyMet: 2.19 ± 0.662
3.384GlyAsn: 3.384 ± 0.827
2.787GlyPro: 2.787 ± 0.897
3.185GlyGln: 3.185 ± 0.519
5.176GlyArg: 5.176 ± 1.156
5.176GlySer: 5.176 ± 0.977
7.167GlyThr: 7.167 ± 1.088
6.371GlyVal: 6.371 ± 1.752
1.792GlyTrp: 1.792 ± 0.651
1.593GlyTyr: 1.593 ± 0.592
0.0GlyXaa: 0.0 ± 0.0
His
2.389HisAla: 2.389 ± 0.901
0.199HisCys: 0.199 ± 0.197
2.19HisAsp: 2.19 ± 0.799
0.796HisGlu: 0.796 ± 0.352
0.199HisPhe: 0.199 ± 0.197
1.195HisGly: 1.195 ± 0.675
0.796HisHis: 0.796 ± 0.499
0.398HisIle: 0.398 ± 0.265
0.199HisLys: 0.199 ± 0.19
1.593HisLeu: 1.593 ± 0.645
0.0HisMet: 0.0 ± 0.0
0.597HisAsn: 0.597 ± 0.489
0.995HisPro: 0.995 ± 0.375
0.199HisGln: 0.199 ± 0.197
1.593HisArg: 1.593 ± 0.481
0.995HisSer: 0.995 ± 0.397
0.398HisThr: 0.398 ± 0.213
0.995HisVal: 0.995 ± 0.378
0.398HisTrp: 0.398 ± 0.42
0.398HisTyr: 0.398 ± 0.27
0.0HisXaa: 0.0 ± 0.0
Ile
4.977IleAla: 4.977 ± 1.118
0.0IleCys: 0.0 ± 0.0
3.783IleAsp: 3.783 ± 1.54
3.783IleGlu: 3.783 ± 0.576
0.796IlePhe: 0.796 ± 0.237
4.38IleGly: 4.38 ± 0.94
0.597IleHis: 0.597 ± 0.466
2.389IleIle: 2.389 ± 1.3
1.394IleLys: 1.394 ± 0.58
2.787IleLeu: 2.787 ± 0.603
0.199IleMet: 0.199 ± 0.2
2.588IleAsn: 2.588 ± 1.059
1.593IlePro: 1.593 ± 0.548
2.787IleGln: 2.787 ± 0.802
2.787IleArg: 2.787 ± 0.656
1.991IleSer: 1.991 ± 0.456
1.991IleThr: 1.991 ± 0.493
4.778IleVal: 4.778 ± 1.085
0.398IleTrp: 0.398 ± 0.326
1.792IleTyr: 1.792 ± 0.697
0.0IleXaa: 0.0 ± 0.0
Lys
5.773LysAla: 5.773 ± 0.9
0.796LysCys: 0.796 ± 0.407
2.588LysAsp: 2.588 ± 0.86
1.394LysGlu: 1.394 ± 0.504
0.199LysPhe: 0.199 ± 0.173
1.394LysGly: 1.394 ± 0.512
0.199LysHis: 0.199 ± 0.163
1.593LysIle: 1.593 ± 0.417
1.195LysLys: 1.195 ± 0.516
5.773LysLeu: 5.773 ± 1.022
0.796LysMet: 0.796 ± 0.462
0.796LysAsn: 0.796 ± 0.35
2.389LysPro: 2.389 ± 0.717
0.995LysGln: 0.995 ± 0.376
1.991LysArg: 1.991 ± 0.614
1.394LysSer: 1.394 ± 0.567
2.787LysThr: 2.787 ± 0.739
2.588LysVal: 2.588 ± 0.733
0.796LysTrp: 0.796 ± 0.369
1.195LysTyr: 1.195 ± 0.396
0.0LysXaa: 0.0 ± 0.0
Leu
10.95LeuAla: 10.95 ± 1.061
1.195LeuCys: 1.195 ± 0.698
4.579LeuAsp: 4.579 ± 0.727
2.389LeuGlu: 2.389 ± 0.592
0.796LeuPhe: 0.796 ± 0.356
7.366LeuGly: 7.366 ± 1.253
1.792LeuHis: 1.792 ± 0.571
2.787LeuIle: 2.787 ± 0.728
3.384LeuLys: 3.384 ± 0.684
5.176LeuLeu: 5.176 ± 0.972
1.792LeuMet: 1.792 ± 0.714
1.991LeuAsn: 1.991 ± 0.584
6.769LeuPro: 6.769 ± 1.409
1.394LeuGln: 1.394 ± 0.55
4.579LeuArg: 4.579 ± 0.957
5.973LeuSer: 5.973 ± 1.323
6.172LeuThr: 6.172 ± 0.792
7.764LeuVal: 7.764 ± 0.927
1.195LeuTrp: 1.195 ± 0.346
1.593LeuTyr: 1.593 ± 0.618
0.0LeuXaa: 0.0 ± 0.0
Met
3.584MetAla: 3.584 ± 0.795
0.0MetCys: 0.0 ± 0.0
1.792MetAsp: 1.792 ± 0.511
0.995MetGlu: 0.995 ± 0.682
0.995MetPhe: 0.995 ± 0.327
2.19MetGly: 2.19 ± 0.76
0.199MetHis: 0.199 ± 0.2
0.597MetIle: 0.597 ± 0.395
0.995MetLys: 0.995 ± 0.458
3.185MetLeu: 3.185 ± 0.923
0.597MetMet: 0.597 ± 0.389
0.796MetAsn: 0.796 ± 0.489
2.19MetPro: 2.19 ± 0.601
0.995MetGln: 0.995 ± 0.587
0.796MetArg: 0.796 ± 0.281
1.593MetSer: 1.593 ± 0.579
2.389MetThr: 2.389 ± 0.914
0.995MetVal: 0.995 ± 0.549
0.398MetTrp: 0.398 ± 0.244
0.199MetTyr: 0.199 ± 0.265
0.0MetXaa: 0.0 ± 0.0
Asn
4.778AsnAla: 4.778 ± 2.47
0.199AsnCys: 0.199 ± 0.19
2.19AsnAsp: 2.19 ± 0.688
1.593AsnGlu: 1.593 ± 0.393
0.597AsnPhe: 0.597 ± 0.284
4.38AsnGly: 4.38 ± 0.857
0.995AsnHis: 0.995 ± 0.594
1.593AsnIle: 1.593 ± 0.503
0.995AsnLys: 0.995 ± 0.336
2.787AsnLeu: 2.787 ± 0.719
1.195AsnMet: 1.195 ± 0.364
1.792AsnAsn: 1.792 ± 0.895
2.389AsnPro: 2.389 ± 0.662
0.995AsnGln: 0.995 ± 0.371
2.787AsnArg: 2.787 ± 0.823
1.991AsnSer: 1.991 ± 0.721
1.792AsnThr: 1.792 ± 0.537
1.991AsnVal: 1.991 ± 0.691
0.398AsnTrp: 0.398 ± 0.268
0.995AsnTyr: 0.995 ± 0.424
0.0AsnXaa: 0.0 ± 0.0
Pro
9.357ProAla: 9.357 ± 1.427
0.597ProCys: 0.597 ± 0.42
2.389ProAsp: 2.389 ± 0.576
2.787ProGlu: 2.787 ± 0.729
1.394ProPhe: 1.394 ± 0.525
4.38ProGly: 4.38 ± 0.938
0.597ProHis: 0.597 ± 0.345
2.389ProIle: 2.389 ± 0.829
2.588ProLys: 2.588 ± 0.761
3.384ProLeu: 3.384 ± 1.064
1.792ProMet: 1.792 ± 0.456
1.991ProAsn: 1.991 ± 0.58
1.394ProPro: 1.394 ± 0.762
0.995ProGln: 0.995 ± 0.45
3.584ProArg: 3.584 ± 0.738
2.986ProSer: 2.986 ± 0.934
4.579ProThr: 4.579 ± 0.852
3.982ProVal: 3.982 ± 1.074
1.792ProTrp: 1.792 ± 0.477
1.195ProTyr: 1.195 ± 0.388
0.0ProXaa: 0.0 ± 0.0
Gln
4.181GlnAla: 4.181 ± 0.785
0.0GlnCys: 0.0 ± 0.0
1.991GlnAsp: 1.991 ± 0.671
1.593GlnGlu: 1.593 ± 0.436
1.195GlnPhe: 1.195 ± 0.446
2.389GlnGly: 2.389 ± 0.694
0.796GlnHis: 0.796 ± 0.341
2.19GlnIle: 2.19 ± 0.458
1.195GlnLys: 1.195 ± 0.559
3.584GlnLeu: 3.584 ± 0.788
0.995GlnMet: 0.995 ± 0.464
1.394GlnAsn: 1.394 ± 0.501
2.787GlnPro: 2.787 ± 0.882
0.995GlnGln: 0.995 ± 0.394
2.588GlnArg: 2.588 ± 0.943
2.986GlnSer: 2.986 ± 1.1
3.185GlnThr: 3.185 ± 0.702
2.389GlnVal: 2.389 ± 0.661
0.796GlnTrp: 0.796 ± 0.424
0.398GlnTyr: 0.398 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
5.973ArgAla: 5.973 ± 1.077
0.796ArgCys: 0.796 ± 0.519
2.787ArgAsp: 2.787 ± 0.625
1.593ArgGlu: 1.593 ± 0.617
1.394ArgPhe: 1.394 ± 0.689
3.982ArgGly: 3.982 ± 0.672
0.796ArgHis: 0.796 ± 0.35
2.787ArgIle: 2.787 ± 0.605
3.185ArgLys: 3.185 ± 0.814
4.778ArgLeu: 4.778 ± 1.022
1.792ArgMet: 1.792 ± 0.597
2.389ArgAsn: 2.389 ± 0.807
4.38ArgPro: 4.38 ± 1.123
2.19ArgGln: 2.19 ± 0.718
3.384ArgArg: 3.384 ± 0.773
2.986ArgSer: 2.986 ± 0.728
2.986ArgThr: 2.986 ± 0.526
3.584ArgVal: 3.584 ± 0.984
1.394ArgTrp: 1.394 ± 0.532
2.588ArgTyr: 2.588 ± 0.714
0.0ArgXaa: 0.0 ± 0.0
Ser
7.167SerAla: 7.167 ± 1.426
0.995SerCys: 0.995 ± 0.492
2.588SerAsp: 2.588 ± 0.84
2.19SerGlu: 2.19 ± 0.553
1.991SerPhe: 1.991 ± 0.547
5.176SerGly: 5.176 ± 1.09
0.199SerHis: 0.199 ± 0.197
3.982SerIle: 3.982 ± 1.025
1.991SerLys: 1.991 ± 0.593
3.982SerLeu: 3.982 ± 0.961
1.394SerMet: 1.394 ± 0.598
2.389SerAsn: 2.389 ± 0.675
2.389SerPro: 2.389 ± 0.757
2.986SerGln: 2.986 ± 0.714
2.588SerArg: 2.588 ± 0.532
2.19SerSer: 2.19 ± 0.871
4.38SerThr: 4.38 ± 0.811
4.778SerVal: 4.778 ± 1.062
1.792SerTrp: 1.792 ± 0.459
0.796SerTyr: 0.796 ± 0.357
0.0SerXaa: 0.0 ± 0.0
Thr
10.95ThrAla: 10.95 ± 1.473
0.199ThrCys: 0.199 ± 0.194
3.185ThrAsp: 3.185 ± 0.762
5.375ThrGlu: 5.375 ± 1.405
2.389ThrPhe: 2.389 ± 0.562
4.579ThrGly: 4.579 ± 0.787
0.398ThrHis: 0.398 ± 0.394
2.787ThrIle: 2.787 ± 0.819
2.389ThrLys: 2.389 ± 0.624
4.181ThrLeu: 4.181 ± 0.938
1.991ThrMet: 1.991 ± 0.61
0.995ThrAsn: 0.995 ± 0.416
4.38ThrPro: 4.38 ± 0.753
2.19ThrGln: 2.19 ± 0.683
1.991ThrArg: 1.991 ± 0.559
3.982ThrSer: 3.982 ± 1.144
4.579ThrThr: 4.579 ± 1.147
7.366ThrVal: 7.366 ± 1.096
1.195ThrTrp: 1.195 ± 0.473
1.991ThrTyr: 1.991 ± 0.723
0.0ThrXaa: 0.0 ± 0.0
Val
12.343ValAla: 12.343 ± 2.38
0.199ValCys: 0.199 ± 0.19
5.773ValAsp: 5.773 ± 0.819
5.973ValGlu: 5.973 ± 1.25
2.787ValPhe: 2.787 ± 1.182
6.172ValGly: 6.172 ± 1.09
1.593ValHis: 1.593 ± 0.688
2.588ValIle: 2.588 ± 0.709
1.792ValLys: 1.792 ± 0.573
6.769ValLeu: 6.769 ± 0.881
2.986ValMet: 2.986 ± 1.23
3.584ValAsn: 3.584 ± 1.053
3.982ValPro: 3.982 ± 0.794
2.986ValGln: 2.986 ± 0.97
3.185ValArg: 3.185 ± 0.937
5.176ValSer: 5.176 ± 0.824
6.172ValThr: 6.172 ± 0.805
4.977ValVal: 4.977 ± 0.879
0.597ValTrp: 0.597 ± 0.358
1.394ValTyr: 1.394 ± 0.529
0.0ValXaa: 0.0 ± 0.0
Trp
1.593TrpAla: 1.593 ± 0.572
0.199TrpCys: 0.199 ± 0.197
1.195TrpAsp: 1.195 ± 0.426
0.597TrpGlu: 0.597 ± 0.336
1.394TrpPhe: 1.394 ± 0.453
0.398TrpGly: 0.398 ± 0.252
0.597TrpHis: 0.597 ± 0.34
0.398TrpIle: 0.398 ± 0.223
0.796TrpLys: 0.796 ± 0.304
2.19TrpLeu: 2.19 ± 0.923
0.398TrpMet: 0.398 ± 0.257
1.195TrpAsn: 1.195 ± 0.474
0.995TrpPro: 0.995 ± 0.483
1.394TrpGln: 1.394 ± 0.376
0.398TrpArg: 0.398 ± 0.271
0.995TrpSer: 0.995 ± 0.451
1.195TrpThr: 1.195 ± 0.511
1.792TrpVal: 1.792 ± 0.418
0.398TrpTrp: 0.398 ± 0.246
0.597TrpTyr: 0.597 ± 0.303
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.783TyrAla: 3.783 ± 0.975
0.199TyrCys: 0.199 ± 0.19
0.995TyrAsp: 0.995 ± 0.404
0.995TyrGlu: 0.995 ± 0.359
0.398TyrPhe: 0.398 ± 0.238
2.986TyrGly: 2.986 ± 0.758
0.796TyrHis: 0.796 ± 0.416
1.394TyrIle: 1.394 ± 0.63
0.398TyrLys: 0.398 ± 0.246
1.394TyrLeu: 1.394 ± 0.355
0.796TyrMet: 0.796 ± 0.586
0.796TyrAsn: 0.796 ± 0.354
1.394TyrPro: 1.394 ± 0.453
0.796TyrGln: 0.796 ± 0.391
1.792TyrArg: 1.792 ± 0.511
1.991TyrSer: 1.991 ± 0.718
1.394TyrThr: 1.394 ± 0.515
2.588TyrVal: 2.588 ± 0.907
0.199TyrTrp: 0.199 ± 0.19
0.199TyrTyr: 0.199 ± 0.163
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 26 proteins (5024 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski