Amino acid dipepetide frequency for Streptococcus phage Javan226

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.485AlaAla: 4.485 ± 1.374
0.176AlaCys: 0.176 ± 0.134
4.398AlaAsp: 4.398 ± 0.587
4.925AlaGlu: 4.925 ± 0.795
3.606AlaPhe: 3.606 ± 0.962
4.837AlaGly: 4.837 ± 0.927
0.88AlaHis: 0.88 ± 0.277
5.805AlaIle: 5.805 ± 1.252
7.212AlaLys: 7.212 ± 0.707
7.74AlaLeu: 7.74 ± 0.996
2.463AlaMet: 2.463 ± 0.666
4.485AlaAsn: 4.485 ± 1.029
1.495AlaPro: 1.495 ± 0.42
4.398AlaGln: 4.398 ± 0.964
2.99AlaArg: 2.99 ± 0.461
5.541AlaSer: 5.541 ± 0.885
5.189AlaThr: 5.189 ± 0.814
5.629AlaVal: 5.629 ± 1.02
0.616AlaTrp: 0.616 ± 0.233
2.902AlaTyr: 2.902 ± 0.421
0.0AlaXaa: 0.0 ± 0.0
Cys
0.088CysAla: 0.088 ± 0.094
0.088CysCys: 0.088 ± 0.102
0.352CysAsp: 0.352 ± 0.175
0.088CysGlu: 0.088 ± 0.076
0.088CysPhe: 0.088 ± 0.081
0.616CysGly: 0.616 ± 0.35
0.352CysHis: 0.352 ± 0.146
0.264CysIle: 0.264 ± 0.133
0.352CysLys: 0.352 ± 0.163
0.176CysLeu: 0.176 ± 0.117
0.0CysMet: 0.0 ± 0.0
0.352CysAsn: 0.352 ± 0.232
0.088CysPro: 0.088 ± 0.081
0.088CysGln: 0.088 ± 0.102
0.0CysArg: 0.0 ± 0.0
0.176CysSer: 0.176 ± 0.128
0.176CysThr: 0.176 ± 0.133
0.264CysVal: 0.264 ± 0.131
0.088CysTrp: 0.088 ± 0.093
0.44CysTyr: 0.44 ± 0.191
0.0CysXaa: 0.0 ± 0.0
Asp
4.134AspAla: 4.134 ± 0.55
0.44AspCys: 0.44 ± 0.207
4.046AspAsp: 4.046 ± 0.627
4.837AspGlu: 4.837 ± 0.852
3.342AspPhe: 3.342 ± 0.462
4.573AspGly: 4.573 ± 0.634
0.264AspHis: 0.264 ± 0.159
3.87AspIle: 3.87 ± 0.65
4.134AspLys: 4.134 ± 0.795
5.453AspLeu: 5.453 ± 0.694
2.023AspMet: 2.023 ± 0.387
4.573AspAsn: 4.573 ± 0.698
0.88AspPro: 0.88 ± 0.255
0.704AspGln: 0.704 ± 0.207
1.319AspArg: 1.319 ± 0.328
3.87AspSer: 3.87 ± 0.721
3.166AspThr: 3.166 ± 0.528
3.694AspVal: 3.694 ± 0.623
1.143AspTrp: 1.143 ± 0.384
3.342AspTyr: 3.342 ± 0.627
0.0AspXaa: 0.0 ± 0.0
Glu
5.893GluAla: 5.893 ± 0.933
0.176GluCys: 0.176 ± 0.142
4.925GluAsp: 4.925 ± 0.848
5.541GluGlu: 5.541 ± 1.241
3.078GluPhe: 3.078 ± 0.618
2.814GluGly: 2.814 ± 0.431
1.495GluHis: 1.495 ± 0.365
6.42GluIle: 6.42 ± 1.215
7.3GluLys: 7.3 ± 1.432
7.564GluLeu: 7.564 ± 1.15
2.199GluMet: 2.199 ± 0.455
3.694GluAsn: 3.694 ± 0.732
1.847GluPro: 1.847 ± 0.366
3.87GluGln: 3.87 ± 0.746
3.078GluArg: 3.078 ± 0.516
1.583GluSer: 1.583 ± 0.423
3.254GluThr: 3.254 ± 0.698
4.749GluVal: 4.749 ± 0.858
0.967GluTrp: 0.967 ± 0.304
2.726GluTyr: 2.726 ± 0.508
0.0GluXaa: 0.0 ± 0.0
Phe
2.99PheAla: 2.99 ± 0.672
0.352PheCys: 0.352 ± 0.161
2.726PheAsp: 2.726 ± 0.321
2.99PheGlu: 2.99 ± 0.42
1.583PhePhe: 1.583 ± 0.338
3.254PheGly: 3.254 ± 0.659
0.352PheHis: 0.352 ± 0.198
2.199PheIle: 2.199 ± 0.427
4.222PheLys: 4.222 ± 0.63
2.551PheLeu: 2.551 ± 0.467
0.792PheMet: 0.792 ± 0.315
3.342PheAsn: 3.342 ± 0.633
0.792PhePro: 0.792 ± 0.293
1.671PheGln: 1.671 ± 0.45
0.967PheArg: 0.967 ± 0.263
2.375PheSer: 2.375 ± 0.36
2.551PheThr: 2.551 ± 0.399
2.287PheVal: 2.287 ± 0.479
0.176PheTrp: 0.176 ± 0.117
1.143PheTyr: 1.143 ± 0.331
0.0PheXaa: 0.0 ± 0.0
Gly
4.661GlyAla: 4.661 ± 0.931
0.176GlyCys: 0.176 ± 0.161
3.254GlyAsp: 3.254 ± 0.512
3.078GlyGlu: 3.078 ± 0.478
2.726GlyPhe: 2.726 ± 0.465
3.518GlyGly: 3.518 ± 0.653
0.44GlyHis: 0.44 ± 0.199
5.101GlyIle: 5.101 ± 1.041
5.013GlyLys: 5.013 ± 0.543
6.245GlyLeu: 6.245 ± 0.939
2.199GlyMet: 2.199 ± 0.603
4.134GlyAsn: 4.134 ± 0.642
0.088GlyPro: 0.088 ± 0.092
2.375GlyGln: 2.375 ± 0.492
1.759GlyArg: 1.759 ± 0.361
3.694GlySer: 3.694 ± 0.662
4.31GlyThr: 4.31 ± 0.878
4.837GlyVal: 4.837 ± 0.872
0.44GlyTrp: 0.44 ± 0.177
3.87GlyTyr: 3.87 ± 0.835
0.0GlyXaa: 0.0 ± 0.0
His
0.528HisAla: 0.528 ± 0.148
0.176HisCys: 0.176 ± 0.203
0.704HisAsp: 0.704 ± 0.276
1.231HisGlu: 1.231 ± 0.41
0.704HisPhe: 0.704 ± 0.272
0.88HisGly: 0.88 ± 0.337
0.176HisHis: 0.176 ± 0.136
0.176HisIle: 0.176 ± 0.113
1.143HisLys: 1.143 ± 0.361
0.792HisLeu: 0.792 ± 0.236
0.352HisMet: 0.352 ± 0.172
0.528HisAsn: 0.528 ± 0.204
0.264HisPro: 0.264 ± 0.141
0.44HisGln: 0.44 ± 0.222
0.528HisArg: 0.528 ± 0.282
0.88HisSer: 0.88 ± 0.302
0.616HisThr: 0.616 ± 0.193
0.704HisVal: 0.704 ± 0.302
0.176HisTrp: 0.176 ± 0.141
0.616HisTyr: 0.616 ± 0.267
0.0HisXaa: 0.0 ± 0.0
Ile
5.541IleAla: 5.541 ± 0.809
0.44IleCys: 0.44 ± 0.162
5.101IleAsp: 5.101 ± 1.002
6.245IleGlu: 6.245 ± 1.124
1.935IlePhe: 1.935 ± 0.496
3.958IleGly: 3.958 ± 0.934
0.352IleHis: 0.352 ± 0.181
3.958IleIle: 3.958 ± 0.818
6.332IleLys: 6.332 ± 0.789
4.573IleLeu: 4.573 ± 0.609
1.055IleMet: 1.055 ± 0.239
4.925IleAsn: 4.925 ± 0.613
1.847IlePro: 1.847 ± 0.35
2.287IleGln: 2.287 ± 0.416
1.759IleArg: 1.759 ± 0.28
5.717IleSer: 5.717 ± 0.964
5.101IleThr: 5.101 ± 1.045
5.013IleVal: 5.013 ± 0.616
0.704IleTrp: 0.704 ± 0.217
2.639IleTyr: 2.639 ± 0.448
0.0IleXaa: 0.0 ± 0.0
Lys
6.684LysAla: 6.684 ± 0.762
0.616LysCys: 0.616 ± 0.367
4.837LysAsp: 4.837 ± 0.683
7.564LysGlu: 7.564 ± 1.335
2.375LysPhe: 2.375 ± 0.433
4.046LysGly: 4.046 ± 0.538
1.231LysHis: 1.231 ± 0.443
5.453LysIle: 5.453 ± 0.89
6.245LysLys: 6.245 ± 1.113
6.596LysLeu: 6.596 ± 0.966
2.287LysMet: 2.287 ± 0.575
4.573LysAsn: 4.573 ± 0.682
2.726LysPro: 2.726 ± 0.557
4.31LysGln: 4.31 ± 0.673
4.222LysArg: 4.222 ± 0.801
4.661LysSer: 4.661 ± 0.637
5.277LysThr: 5.277 ± 0.88
5.013LysVal: 5.013 ± 0.744
0.704LysTrp: 0.704 ± 0.262
3.606LysTyr: 3.606 ± 0.635
0.0LysXaa: 0.0 ± 0.0
Leu
6.157LeuAla: 6.157 ± 0.634
0.264LeuCys: 0.264 ± 0.163
4.398LeuAsp: 4.398 ± 0.555
5.981LeuGlu: 5.981 ± 1.141
3.078LeuPhe: 3.078 ± 0.52
4.573LeuGly: 4.573 ± 0.823
1.231LeuHis: 1.231 ± 0.338
4.222LeuIle: 4.222 ± 0.682
7.212LeuLys: 7.212 ± 1.038
4.749LeuLeu: 4.749 ± 0.785
2.199LeuMet: 2.199 ± 0.528
5.365LeuAsn: 5.365 ± 0.889
1.847LeuPro: 1.847 ± 0.348
2.463LeuGln: 2.463 ± 0.406
3.166LeuArg: 3.166 ± 0.788
6.948LeuSer: 6.948 ± 0.646
5.541LeuThr: 5.541 ± 0.818
4.661LeuVal: 4.661 ± 0.553
0.792LeuTrp: 0.792 ± 0.247
2.99LeuTyr: 2.99 ± 0.584
0.0LeuXaa: 0.0 ± 0.0
Met
2.814MetAla: 2.814 ± 0.638
0.0MetCys: 0.0 ± 0.0
1.319MetAsp: 1.319 ± 0.311
1.319MetGlu: 1.319 ± 0.459
1.055MetPhe: 1.055 ± 0.386
1.055MetGly: 1.055 ± 0.338
0.44MetHis: 0.44 ± 0.254
1.319MetIle: 1.319 ± 0.366
2.287MetLys: 2.287 ± 0.501
2.023MetLeu: 2.023 ± 0.382
0.44MetMet: 0.44 ± 0.228
1.319MetAsn: 1.319 ± 0.316
0.616MetPro: 0.616 ± 0.233
1.319MetGln: 1.319 ± 0.347
0.792MetArg: 0.792 ± 0.246
2.111MetSer: 2.111 ± 0.558
2.463MetThr: 2.463 ± 0.428
1.143MetVal: 1.143 ± 0.303
0.352MetTrp: 0.352 ± 0.157
0.88MetTyr: 0.88 ± 0.295
0.0MetXaa: 0.0 ± 0.0
Asn
5.013AsnAla: 5.013 ± 0.953
0.176AsnCys: 0.176 ± 0.11
3.606AsnAsp: 3.606 ± 0.597
4.134AsnGlu: 4.134 ± 0.675
2.023AsnPhe: 2.023 ± 0.65
5.629AsnGly: 5.629 ± 1.17
0.528AsnHis: 0.528 ± 0.219
2.902AsnIle: 2.902 ± 0.719
4.925AsnLys: 4.925 ± 0.835
4.222AsnLeu: 4.222 ± 0.787
0.792AsnMet: 0.792 ± 0.249
3.87AsnAsn: 3.87 ± 0.892
2.375AsnPro: 2.375 ± 0.484
2.726AsnGln: 2.726 ± 0.744
2.726AsnArg: 2.726 ± 0.497
3.87AsnSer: 3.87 ± 0.676
2.99AsnThr: 2.99 ± 0.768
4.222AsnVal: 4.222 ± 0.441
0.616AsnTrp: 0.616 ± 0.265
2.639AsnTyr: 2.639 ± 0.603
0.0AsnXaa: 0.0 ± 0.0
Pro
1.847ProAla: 1.847 ± 0.554
0.176ProCys: 0.176 ± 0.146
2.199ProAsp: 2.199 ± 0.417
2.375ProGlu: 2.375 ± 0.582
1.495ProPhe: 1.495 ± 0.334
0.616ProGly: 0.616 ± 0.212
0.264ProHis: 0.264 ± 0.231
1.583ProIle: 1.583 ± 0.374
2.814ProLys: 2.814 ± 0.64
1.495ProLeu: 1.495 ± 0.347
0.792ProMet: 0.792 ± 0.216
2.287ProAsn: 2.287 ± 0.7
0.528ProPro: 0.528 ± 0.215
0.88ProGln: 0.88 ± 0.248
0.528ProArg: 0.528 ± 0.225
1.319ProSer: 1.319 ± 0.24
1.583ProThr: 1.583 ± 0.444
1.231ProVal: 1.231 ± 0.365
0.264ProTrp: 0.264 ± 0.161
1.231ProTyr: 1.231 ± 0.338
0.0ProXaa: 0.0 ± 0.0
Gln
4.046GlnAla: 4.046 ± 0.78
0.088GlnCys: 0.088 ± 0.102
2.023GlnAsp: 2.023 ± 0.405
2.814GlnGlu: 2.814 ± 0.659
1.495GlnPhe: 1.495 ± 0.431
2.551GlnGly: 2.551 ± 0.644
0.176GlnHis: 0.176 ± 0.121
4.398GlnIle: 4.398 ± 0.725
3.078GlnLys: 3.078 ± 0.695
3.43GlnLeu: 3.43 ± 0.451
1.143GlnMet: 1.143 ± 0.311
2.023GlnAsn: 2.023 ± 0.835
1.319GlnPro: 1.319 ± 0.428
1.935GlnGln: 1.935 ± 0.494
1.231GlnArg: 1.231 ± 0.299
2.902GlnSer: 2.902 ± 0.46
3.166GlnThr: 3.166 ± 0.682
3.606GlnVal: 3.606 ± 0.718
0.264GlnTrp: 0.264 ± 0.139
1.583GlnTyr: 1.583 ± 0.368
0.0GlnXaa: 0.0 ± 0.0
Arg
2.902ArgAla: 2.902 ± 0.53
0.088ArgCys: 0.088 ± 0.094
1.407ArgAsp: 1.407 ± 0.382
2.199ArgGlu: 2.199 ± 0.408
1.583ArgPhe: 1.583 ± 0.354
1.143ArgGly: 1.143 ± 0.36
0.44ArgHis: 0.44 ± 0.22
2.287ArgIle: 2.287 ± 0.378
3.958ArgLys: 3.958 ± 0.683
3.43ArgLeu: 3.43 ± 0.647
1.143ArgMet: 1.143 ± 0.454
2.375ArgAsn: 2.375 ± 0.415
0.792ArgPro: 0.792 ± 0.262
1.935ArgGln: 1.935 ± 0.529
1.231ArgArg: 1.231 ± 0.478
2.111ArgSer: 2.111 ± 0.481
0.792ArgThr: 0.792 ± 0.278
2.287ArgVal: 2.287 ± 0.436
0.088ArgTrp: 0.088 ± 0.085
1.759ArgTyr: 1.759 ± 0.468
0.0ArgXaa: 0.0 ± 0.0
Ser
5.541SerAla: 5.541 ± 1.436
0.0SerCys: 0.0 ± 0.0
4.134SerAsp: 4.134 ± 0.675
5.541SerGlu: 5.541 ± 0.705
2.639SerPhe: 2.639 ± 0.545
5.717SerGly: 5.717 ± 0.815
0.967SerHis: 0.967 ± 0.27
4.925SerIle: 4.925 ± 0.752
2.99SerLys: 2.99 ± 0.336
4.398SerLeu: 4.398 ± 0.608
2.375SerMet: 2.375 ± 0.637
3.694SerAsn: 3.694 ± 0.588
1.495SerPro: 1.495 ± 0.315
3.254SerGln: 3.254 ± 0.53
2.463SerArg: 2.463 ± 0.521
6.069SerSer: 6.069 ± 1.594
5.277SerThr: 5.277 ± 1.15
4.222SerVal: 4.222 ± 0.82
0.704SerTrp: 0.704 ± 0.262
2.023SerTyr: 2.023 ± 0.403
0.0SerXaa: 0.0 ± 0.0
Thr
6.508ThrAla: 6.508 ± 1.701
0.264ThrCys: 0.264 ± 0.172
3.254ThrAsp: 3.254 ± 0.643
4.046ThrGlu: 4.046 ± 0.66
2.814ThrPhe: 2.814 ± 0.487
4.222ThrGly: 4.222 ± 0.658
1.055ThrHis: 1.055 ± 0.406
5.101ThrIle: 5.101 ± 0.757
4.661ThrLys: 4.661 ± 0.909
4.485ThrLeu: 4.485 ± 0.719
0.704ThrMet: 0.704 ± 0.245
3.518ThrAsn: 3.518 ± 0.853
2.902ThrPro: 2.902 ± 0.669
3.694ThrGln: 3.694 ± 0.585
1.495ThrArg: 1.495 ± 0.4
5.365ThrSer: 5.365 ± 0.814
5.365ThrThr: 5.365 ± 0.922
4.749ThrVal: 4.749 ± 0.716
0.352ThrTrp: 0.352 ± 0.182
2.023ThrTyr: 2.023 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
6.508ValAla: 6.508 ± 1.224
0.088ValCys: 0.088 ± 0.098
3.694ValAsp: 3.694 ± 0.73
4.661ValGlu: 4.661 ± 1.209
2.023ValPhe: 2.023 ± 0.524
4.31ValGly: 4.31 ± 0.987
0.264ValHis: 0.264 ± 0.158
5.013ValIle: 5.013 ± 0.666
5.189ValLys: 5.189 ± 0.669
4.749ValLeu: 4.749 ± 0.793
0.88ValMet: 0.88 ± 0.363
2.726ValAsn: 2.726 ± 0.512
1.759ValPro: 1.759 ± 0.427
2.551ValGln: 2.551 ± 0.455
2.287ValArg: 2.287 ± 0.42
5.453ValSer: 5.453 ± 0.663
5.629ValThr: 5.629 ± 0.703
4.485ValVal: 4.485 ± 0.687
0.704ValTrp: 0.704 ± 0.277
2.639ValTyr: 2.639 ± 0.592
0.0ValXaa: 0.0 ± 0.0
Trp
0.792TrpAla: 0.792 ± 0.228
0.088TrpCys: 0.088 ± 0.075
0.264TrpAsp: 0.264 ± 0.154
0.616TrpGlu: 0.616 ± 0.285
0.352TrpPhe: 0.352 ± 0.149
0.616TrpGly: 0.616 ± 0.222
0.088TrpHis: 0.088 ± 0.102
0.792TrpIle: 0.792 ± 0.234
1.231TrpLys: 1.231 ± 0.3
0.792TrpLeu: 0.792 ± 0.243
0.264TrpMet: 0.264 ± 0.139
0.264TrpAsn: 0.264 ± 0.169
0.088TrpPro: 0.088 ± 0.088
0.616TrpGln: 0.616 ± 0.226
0.088TrpArg: 0.088 ± 0.081
0.967TrpSer: 0.967 ± 0.305
0.792TrpThr: 0.792 ± 0.263
0.264TrpVal: 0.264 ± 0.144
0.088TrpTrp: 0.088 ± 0.094
0.616TrpTyr: 0.616 ± 0.28
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.639TyrAla: 2.639 ± 0.529
0.264TyrCys: 0.264 ± 0.223
3.166TyrAsp: 3.166 ± 0.575
2.99TyrGlu: 2.99 ± 0.735
1.319TyrPhe: 1.319 ± 0.427
2.99TyrGly: 2.99 ± 0.696
0.616TyrHis: 0.616 ± 0.236
3.694TyrIle: 3.694 ± 0.557
2.814TyrLys: 2.814 ± 0.5
2.375TyrLeu: 2.375 ± 0.538
0.88TyrMet: 0.88 ± 0.335
1.935TyrAsn: 1.935 ± 0.37
1.759TyrPro: 1.759 ± 0.548
1.759TyrGln: 1.759 ± 0.551
1.319TyrArg: 1.319 ± 0.405
2.814TyrSer: 2.814 ± 0.744
3.342TyrThr: 3.342 ± 0.611
2.551TyrVal: 2.551 ± 0.564
0.44TyrTrp: 0.44 ± 0.189
2.199TyrTyr: 2.199 ± 0.517
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 53 proteins (11371 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski