Amino acid dipepetide frequency for Streptococcus phage APCM01

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.983AlaAla: 2.983 ± 0.669
0.32AlaCys: 0.32 ± 0.263
5.007AlaAsp: 5.007 ± 0.692
6.606AlaGlu: 6.606 ± 0.983
3.729AlaPhe: 3.729 ± 0.858
4.368AlaGly: 4.368 ± 1.048
0.746AlaHis: 0.746 ± 0.256
7.138AlaIle: 7.138 ± 1.019
9.589AlaLys: 9.589 ± 1.092
7.564AlaLeu: 7.564 ± 1.061
1.811AlaMet: 1.811 ± 0.513
4.155AlaAsn: 4.155 ± 0.778
2.024AlaPro: 2.024 ± 0.509
4.581AlaGln: 4.581 ± 0.85
3.303AlaArg: 3.303 ± 0.529
5.966AlaSer: 5.966 ± 0.742
4.581AlaThr: 4.581 ± 0.579
4.688AlaVal: 4.688 ± 1.078
0.426AlaTrp: 0.426 ± 0.28
3.729AlaTyr: 3.729 ± 0.863
0.0AlaXaa: 0.0 ± 0.0
Cys
0.107CysAla: 0.107 ± 0.089
0.0CysCys: 0.0 ± 0.0
0.639CysAsp: 0.639 ± 0.291
0.533CysGlu: 0.533 ± 0.335
0.213CysPhe: 0.213 ± 0.148
0.426CysGly: 0.426 ± 0.269
0.213CysHis: 0.213 ± 0.194
0.533CysIle: 0.533 ± 0.259
0.639CysLys: 0.639 ± 0.29
0.107CysLeu: 0.107 ± 0.118
0.107CysMet: 0.107 ± 0.105
0.213CysAsn: 0.213 ± 0.144
0.213CysPro: 0.213 ± 0.17
0.213CysGln: 0.213 ± 0.14
0.107CysArg: 0.107 ± 0.142
0.213CysSer: 0.213 ± 0.168
0.107CysThr: 0.107 ± 0.103
0.533CysVal: 0.533 ± 0.198
0.213CysTrp: 0.213 ± 0.152
0.426CysTyr: 0.426 ± 0.297
0.0CysXaa: 0.0 ± 0.0
Asp
3.835AspAla: 3.835 ± 0.661
0.746AspCys: 0.746 ± 0.31
7.032AspAsp: 7.032 ± 1.489
4.794AspGlu: 4.794 ± 0.878
4.049AspPhe: 4.049 ± 0.693
4.262AspGly: 4.262 ± 0.626
0.213AspHis: 0.213 ± 0.158
3.942AspIle: 3.942 ± 0.733
5.86AspLys: 5.86 ± 0.79
5.114AspLeu: 5.114 ± 1.218
0.746AspMet: 0.746 ± 0.245
4.688AspAsn: 4.688 ± 0.882
2.237AspPro: 2.237 ± 0.602
1.172AspGln: 1.172 ± 0.349
1.918AspArg: 1.918 ± 0.499
4.794AspSer: 4.794 ± 0.632
3.09AspThr: 3.09 ± 0.708
3.729AspVal: 3.729 ± 0.527
0.746AspTrp: 0.746 ± 0.279
4.155AspTyr: 4.155 ± 0.841
0.0AspXaa: 0.0 ± 0.0
Glu
6.925GluAla: 6.925 ± 0.932
0.852GluCys: 0.852 ± 0.376
2.77GluAsp: 2.77 ± 0.514
4.581GluGlu: 4.581 ± 1.035
2.344GluPhe: 2.344 ± 0.443
2.344GluGly: 2.344 ± 0.584
0.639GluHis: 0.639 ± 0.233
4.794GluIle: 4.794 ± 0.883
5.221GluLys: 5.221 ± 0.831
7.351GluLeu: 7.351 ± 1.306
1.918GluMet: 1.918 ± 0.39
5.007GluAsn: 5.007 ± 0.899
1.172GluPro: 1.172 ± 0.397
2.983GluGln: 2.983 ± 0.53
3.09GluArg: 3.09 ± 0.746
4.049GluSer: 4.049 ± 0.464
3.729GluThr: 3.729 ± 0.532
3.942GluVal: 3.942 ± 0.775
0.746GluTrp: 0.746 ± 0.243
2.344GluTyr: 2.344 ± 0.427
0.0GluXaa: 0.0 ± 0.0
Phe
3.729PheAla: 3.729 ± 0.707
0.107PheCys: 0.107 ± 0.125
3.729PheAsp: 3.729 ± 0.638
3.303PheGlu: 3.303 ± 0.755
1.811PhePhe: 1.811 ± 0.5
2.877PheGly: 2.877 ± 0.652
0.32PheHis: 0.32 ± 0.218
1.598PheIle: 1.598 ± 0.363
4.901PheLys: 4.901 ± 0.843
4.049PheLeu: 4.049 ± 0.831
1.385PheMet: 1.385 ± 0.355
2.557PheAsn: 2.557 ± 0.616
0.533PhePro: 0.533 ± 0.244
0.959PheGln: 0.959 ± 0.351
1.811PheArg: 1.811 ± 0.475
2.664PheSer: 2.664 ± 0.608
2.557PheThr: 2.557 ± 0.527
2.664PheVal: 2.664 ± 0.579
0.426PheTrp: 0.426 ± 0.225
1.811PheTyr: 1.811 ± 0.507
0.0PheXaa: 0.0 ± 0.0
Gly
3.516GlyAla: 3.516 ± 0.758
0.32GlyCys: 0.32 ± 0.21
2.77GlyAsp: 2.77 ± 0.542
1.811GlyGlu: 1.811 ± 0.408
2.77GlyPhe: 2.77 ± 0.488
3.622GlyGly: 3.622 ± 0.808
0.852GlyHis: 0.852 ± 0.261
2.877GlyIle: 2.877 ± 0.713
5.221GlyLys: 5.221 ± 1.094
5.007GlyLeu: 5.007 ± 1.045
1.172GlyMet: 1.172 ± 0.33
3.09GlyAsn: 3.09 ± 0.489
0.213GlyPro: 0.213 ± 0.16
2.664GlyGln: 2.664 ± 0.552
2.131GlyArg: 2.131 ± 0.5
4.901GlySer: 4.901 ± 0.905
4.901GlyThr: 4.901 ± 0.631
3.622GlyVal: 3.622 ± 0.733
0.639GlyTrp: 0.639 ± 0.355
2.664GlyTyr: 2.664 ± 0.645
0.0GlyXaa: 0.0 ± 0.0
His
0.746HisAla: 0.746 ± 0.323
0.0HisCys: 0.0 ± 0.0
0.746HisAsp: 0.746 ± 0.464
0.852HisGlu: 0.852 ± 0.205
0.426HisPhe: 0.426 ± 0.186
1.065HisGly: 1.065 ± 0.346
0.213HisHis: 0.213 ± 0.134
0.639HisIle: 0.639 ± 0.278
0.426HisLys: 0.426 ± 0.184
1.065HisLeu: 1.065 ± 0.371
0.32HisMet: 0.32 ± 0.206
0.639HisAsn: 0.639 ± 0.284
0.213HisPro: 0.213 ± 0.117
0.639HisGln: 0.639 ± 0.22
0.533HisArg: 0.533 ± 0.214
0.746HisSer: 0.746 ± 0.295
0.639HisThr: 0.639 ± 0.297
0.533HisVal: 0.533 ± 0.255
0.213HisTrp: 0.213 ± 0.151
0.533HisTyr: 0.533 ± 0.241
0.0HisXaa: 0.0 ± 0.0
Ile
5.966IleAla: 5.966 ± 0.98
0.213IleCys: 0.213 ± 0.169
4.581IleAsp: 4.581 ± 1.013
5.434IleGlu: 5.434 ± 0.95
2.344IlePhe: 2.344 ± 0.505
3.303IleGly: 3.303 ± 0.557
0.533IleHis: 0.533 ± 0.238
2.983IleIle: 2.983 ± 0.547
7.138IleLys: 7.138 ± 0.986
4.475IleLeu: 4.475 ± 0.714
0.959IleMet: 0.959 ± 0.29
3.942IleAsn: 3.942 ± 0.735
0.959IlePro: 0.959 ± 0.347
2.45IleGln: 2.45 ± 0.572
2.237IleArg: 2.237 ± 0.368
6.073IleSer: 6.073 ± 0.766
3.622IleThr: 3.622 ± 0.721
3.196IleVal: 3.196 ± 0.604
0.746IleTrp: 0.746 ± 0.344
1.918IleTyr: 1.918 ± 0.469
0.0IleXaa: 0.0 ± 0.0
Lys
9.908LysAla: 9.908 ± 1.912
0.213LysCys: 0.213 ± 0.199
5.647LysAsp: 5.647 ± 0.761
6.499LysGlu: 6.499 ± 0.793
2.983LysPhe: 2.983 ± 0.52
4.475LysGly: 4.475 ± 1.197
1.598LysHis: 1.598 ± 0.397
6.392LysIle: 6.392 ± 0.981
9.163LysLys: 9.163 ± 1.004
7.671LysLeu: 7.671 ± 0.8
2.45LysMet: 2.45 ± 0.5
6.712LysAsn: 6.712 ± 0.646
2.45LysPro: 2.45 ± 0.525
4.901LysGln: 4.901 ± 0.9
3.942LysArg: 3.942 ± 0.929
5.966LysSer: 5.966 ± 0.841
6.819LysThr: 6.819 ± 0.837
3.835LysVal: 3.835 ± 0.505
0.746LysTrp: 0.746 ± 0.342
3.409LysTyr: 3.409 ± 0.705
0.0LysXaa: 0.0 ± 0.0
Leu
6.499LeuAla: 6.499 ± 0.851
0.107LeuCys: 0.107 ± 0.125
6.499LeuAsp: 6.499 ± 0.753
7.138LeuGlu: 7.138 ± 1.346
1.811LeuPhe: 1.811 ± 0.428
4.368LeuGly: 4.368 ± 0.972
0.746LeuHis: 0.746 ± 0.284
5.327LeuIle: 5.327 ± 0.777
9.482LeuLys: 9.482 ± 1.171
6.606LeuLeu: 6.606 ± 0.964
1.278LeuMet: 1.278 ± 0.39
4.688LeuAsn: 4.688 ± 0.805
1.492LeuPro: 1.492 ± 0.326
3.196LeuGln: 3.196 ± 0.469
3.303LeuArg: 3.303 ± 0.59
5.966LeuSer: 5.966 ± 0.758
7.245LeuThr: 7.245 ± 1.034
5.221LeuVal: 5.221 ± 0.759
0.426LeuTrp: 0.426 ± 0.215
2.557LeuTyr: 2.557 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
2.024MetAla: 2.024 ± 0.564
0.213MetCys: 0.213 ± 0.191
1.065MetAsp: 1.065 ± 0.405
0.852MetGlu: 0.852 ± 0.294
0.852MetPhe: 0.852 ± 0.307
0.959MetGly: 0.959 ± 0.283
0.107MetHis: 0.107 ± 0.089
1.385MetIle: 1.385 ± 0.42
1.811MetLys: 1.811 ± 0.379
2.237MetLeu: 2.237 ± 0.536
0.107MetMet: 0.107 ± 0.103
0.959MetAsn: 0.959 ± 0.364
0.746MetPro: 0.746 ± 0.29
1.278MetGln: 1.278 ± 0.421
0.639MetArg: 0.639 ± 0.412
1.065MetSer: 1.065 ± 0.318
1.278MetThr: 1.278 ± 0.327
0.852MetVal: 0.852 ± 0.258
0.32MetTrp: 0.32 ± 0.17
0.959MetTyr: 0.959 ± 0.38
0.0MetXaa: 0.0 ± 0.0
Asn
5.114AsnAla: 5.114 ± 0.721
0.107AsnCys: 0.107 ± 0.103
2.77AsnAsp: 2.77 ± 0.675
2.877AsnGlu: 2.877 ± 0.676
4.368AsnPhe: 4.368 ± 0.704
3.622AsnGly: 3.622 ± 0.734
0.852AsnHis: 0.852 ± 0.36
3.303AsnIle: 3.303 ± 0.744
4.581AsnLys: 4.581 ± 0.688
4.901AsnLeu: 4.901 ± 0.678
0.959AsnMet: 0.959 ± 0.353
2.877AsnAsn: 2.877 ± 0.605
1.811AsnPro: 1.811 ± 0.462
2.77AsnGln: 2.77 ± 0.444
2.557AsnArg: 2.557 ± 0.584
4.262AsnSer: 4.262 ± 0.569
3.942AsnThr: 3.942 ± 0.685
3.835AsnVal: 3.835 ± 0.757
1.065AsnTrp: 1.065 ± 0.32
2.344AsnTyr: 2.344 ± 0.408
0.0AsnXaa: 0.0 ± 0.0
Pro
2.237ProAla: 2.237 ± 0.53
0.0ProCys: 0.0 ± 0.0
1.278ProAsp: 1.278 ± 0.339
1.705ProGlu: 1.705 ± 0.423
1.598ProPhe: 1.598 ± 0.413
0.746ProGly: 0.746 ± 0.284
0.107ProHis: 0.107 ± 0.08
1.065ProIle: 1.065 ± 0.303
2.344ProLys: 2.344 ± 0.541
2.024ProLeu: 2.024 ± 0.488
0.107ProMet: 0.107 ± 0.125
0.852ProAsn: 0.852 ± 0.348
0.32ProPro: 0.32 ± 0.216
1.278ProGln: 1.278 ± 0.384
0.959ProArg: 0.959 ± 0.312
1.385ProSer: 1.385 ± 0.437
1.492ProThr: 1.492 ± 0.423
1.278ProVal: 1.278 ± 0.3
0.32ProTrp: 0.32 ± 0.19
0.852ProTyr: 0.852 ± 0.257
0.0ProXaa: 0.0 ± 0.0
Gln
3.835GlnAla: 3.835 ± 0.791
0.213GlnCys: 0.213 ± 0.153
1.918GlnAsp: 1.918 ± 0.413
3.303GlnGlu: 3.303 ± 0.658
0.959GlnPhe: 0.959 ± 0.323
1.278GlnGly: 1.278 ± 0.364
0.533GlnHis: 0.533 ± 0.243
2.664GlnIle: 2.664 ± 0.618
5.221GlnLys: 5.221 ± 0.95
2.77GlnLeu: 2.77 ± 0.556
0.852GlnMet: 0.852 ± 0.263
3.729GlnAsn: 3.729 ± 0.636
0.746GlnPro: 0.746 ± 0.28
1.918GlnGln: 1.918 ± 0.672
2.344GlnArg: 2.344 ± 0.562
3.942GlnSer: 3.942 ± 0.623
3.196GlnThr: 3.196 ± 0.679
2.237GlnVal: 2.237 ± 0.459
0.32GlnTrp: 0.32 ± 0.182
0.852GlnTyr: 0.852 ± 0.318
0.0GlnXaa: 0.0 ± 0.0
Arg
4.262ArgAla: 4.262 ± 0.727
0.32ArgCys: 0.32 ± 0.203
2.77ArgAsp: 2.77 ± 0.571
3.196ArgGlu: 3.196 ± 0.553
2.237ArgPhe: 2.237 ± 0.497
1.918ArgGly: 1.918 ± 0.384
0.639ArgHis: 0.639 ± 0.298
2.344ArgIle: 2.344 ± 0.557
2.557ArgLys: 2.557 ± 0.633
3.942ArgLeu: 3.942 ± 0.871
1.065ArgMet: 1.065 ± 0.292
2.344ArgAsn: 2.344 ± 0.489
1.385ArgPro: 1.385 ± 0.428
1.278ArgGln: 1.278 ± 0.349
1.811ArgArg: 1.811 ± 0.42
2.344ArgSer: 2.344 ± 0.742
1.705ArgThr: 1.705 ± 0.398
2.557ArgVal: 2.557 ± 0.493
0.426ArgTrp: 0.426 ± 0.181
1.278ArgTyr: 1.278 ± 0.386
0.0ArgXaa: 0.0 ± 0.0
Ser
8.097SerAla: 8.097 ± 1.464
0.426SerCys: 0.426 ± 0.192
6.286SerAsp: 6.286 ± 1.203
4.794SerGlu: 4.794 ± 1.048
3.729SerPhe: 3.729 ± 0.664
5.221SerGly: 5.221 ± 0.96
1.172SerHis: 1.172 ± 0.323
3.303SerIle: 3.303 ± 0.462
6.925SerLys: 6.925 ± 1.191
4.581SerLeu: 4.581 ± 0.644
1.385SerMet: 1.385 ± 0.371
4.049SerAsn: 4.049 ± 0.737
0.852SerPro: 0.852 ± 0.3
2.983SerGln: 2.983 ± 0.527
2.77SerArg: 2.77 ± 0.451
4.475SerSer: 4.475 ± 0.918
4.368SerThr: 4.368 ± 0.662
2.877SerVal: 2.877 ± 0.421
0.32SerTrp: 0.32 ± 0.199
2.344SerTyr: 2.344 ± 0.467
0.0SerXaa: 0.0 ± 0.0
Thr
6.392ThrAla: 6.392 ± 0.743
0.426ThrCys: 0.426 ± 0.227
4.901ThrAsp: 4.901 ± 0.764
3.942ThrGlu: 3.942 ± 0.601
2.983ThrPhe: 2.983 ± 0.625
3.622ThrGly: 3.622 ± 0.677
0.533ThrHis: 0.533 ± 0.239
5.434ThrIle: 5.434 ± 0.956
4.688ThrLys: 4.688 ± 0.628
5.966ThrLeu: 5.966 ± 0.963
1.065ThrMet: 1.065 ± 0.328
3.09ThrAsn: 3.09 ± 0.782
1.811ThrPro: 1.811 ± 0.444
3.303ThrGln: 3.303 ± 0.577
2.557ThrArg: 2.557 ± 0.458
3.729ThrSer: 3.729 ± 0.73
3.622ThrThr: 3.622 ± 0.685
4.794ThrVal: 4.794 ± 0.672
0.426ThrTrp: 0.426 ± 0.185
1.705ThrTyr: 1.705 ± 0.392
0.0ThrXaa: 0.0 ± 0.0
Val
3.835ValAla: 3.835 ± 0.642
0.639ValCys: 0.639 ± 0.304
4.262ValAsp: 4.262 ± 0.965
2.877ValGlu: 2.877 ± 0.634
2.131ValPhe: 2.131 ± 0.696
3.09ValGly: 3.09 ± 0.516
0.426ValHis: 0.426 ± 0.176
3.942ValIle: 3.942 ± 0.654
5.753ValLys: 5.753 ± 0.872
4.794ValLeu: 4.794 ± 0.669
0.959ValMet: 0.959 ± 0.343
3.303ValAsn: 3.303 ± 0.654
1.918ValPro: 1.918 ± 0.381
2.344ValGln: 2.344 ± 0.535
1.492ValArg: 1.492 ± 0.386
4.581ValSer: 4.581 ± 0.597
4.688ValThr: 4.688 ± 0.783
4.262ValVal: 4.262 ± 0.784
0.32ValTrp: 0.32 ± 0.223
2.237ValTyr: 2.237 ± 0.518
0.0ValXaa: 0.0 ± 0.0
Trp
0.533TrpAla: 0.533 ± 0.284
0.107TrpCys: 0.107 ± 0.1
0.107TrpAsp: 0.107 ± 0.08
0.426TrpGlu: 0.426 ± 0.235
0.426TrpPhe: 0.426 ± 0.225
0.639TrpGly: 0.639 ± 0.23
0.0TrpHis: 0.0 ± 0.0
0.959TrpIle: 0.959 ± 0.319
1.172TrpLys: 1.172 ± 0.404
0.746TrpLeu: 0.746 ± 0.259
0.32TrpMet: 0.32 ± 0.153
0.426TrpAsn: 0.426 ± 0.162
0.107TrpPro: 0.107 ± 0.12
0.213TrpGln: 0.213 ± 0.147
0.746TrpArg: 0.746 ± 0.304
0.746TrpSer: 0.746 ± 0.209
1.065TrpThr: 1.065 ± 0.335
0.213TrpVal: 0.213 ± 0.16
0.213TrpTrp: 0.213 ± 0.144
0.32TrpTyr: 0.32 ± 0.146
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.77TyrAla: 2.77 ± 0.455
0.426TyrCys: 0.426 ± 0.231
2.344TyrAsp: 2.344 ± 0.485
1.278TyrGlu: 1.278 ± 0.386
2.024TyrPhe: 2.024 ± 0.456
2.557TyrGly: 2.557 ± 0.571
0.746TyrHis: 0.746 ± 0.242
2.45TyrIle: 2.45 ± 0.577
3.09TyrLys: 3.09 ± 0.431
2.983TyrLeu: 2.983 ± 0.663
0.639TyrMet: 0.639 ± 0.222
1.598TyrAsn: 1.598 ± 0.398
0.746TyrPro: 0.746 ± 0.332
1.705TyrGln: 1.705 ± 0.359
2.237TyrArg: 2.237 ± 0.533
3.09TyrSer: 3.09 ± 0.505
2.344TyrThr: 2.344 ± 0.351
2.983TyrVal: 2.983 ± 0.655
0.426TyrTrp: 0.426 ± 0.263
0.959TyrTyr: 0.959 ± 0.342
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 37 proteins (9387 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski