Amino acid dipepetide frequency for Lactococcus phage AV09

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.152AlaAla: 1.152 ± 0.495
0.23AlaCys: 0.23 ± 0.171
3.572AlaAsp: 3.572 ± 0.717
5.415AlaGlu: 5.415 ± 0.973
3.802AlaPhe: 3.802 ± 0.991
4.378AlaGly: 4.378 ± 0.752
0.691AlaHis: 0.691 ± 0.316
4.954AlaIle: 4.954 ± 1.189
5.876AlaLys: 5.876 ± 0.925
5.761AlaLeu: 5.761 ± 0.954
2.996AlaMet: 2.996 ± 0.625
4.609AlaAsn: 4.609 ± 0.945
0.576AlaPro: 0.576 ± 0.238
2.535AlaGln: 2.535 ± 0.566
2.42AlaArg: 2.42 ± 0.535
3.226AlaSer: 3.226 ± 0.96
4.033AlaThr: 4.033 ± 0.948
3.918AlaVal: 3.918 ± 0.951
1.728AlaTrp: 1.728 ± 0.599
1.844AlaTyr: 1.844 ± 0.439
0.0AlaXaa: 0.0 ± 0.0
Cys
0.346CysAla: 0.346 ± 0.197
0.115CysCys: 0.115 ± 0.121
0.23CysAsp: 0.23 ± 0.17
0.23CysGlu: 0.23 ± 0.138
0.23CysPhe: 0.23 ± 0.242
0.922CysGly: 0.922 ± 0.383
0.115CysHis: 0.115 ± 0.135
0.461CysIle: 0.461 ± 0.223
0.461CysLys: 0.461 ± 0.253
0.23CysLeu: 0.23 ± 0.164
0.23CysMet: 0.23 ± 0.17
0.576CysAsn: 0.576 ± 0.298
0.115CysPro: 0.115 ± 0.128
0.461CysGln: 0.461 ± 0.302
0.461CysArg: 0.461 ± 0.203
0.23CysSer: 0.23 ± 0.148
0.23CysThr: 0.23 ± 0.165
0.461CysVal: 0.461 ± 0.227
0.0CysTrp: 0.0 ± 0.0
0.23CysTyr: 0.23 ± 0.184
0.0CysXaa: 0.0 ± 0.0
Asp
2.074AspAla: 2.074 ± 0.599
0.23AspCys: 0.23 ± 0.157
3.226AspAsp: 3.226 ± 0.729
4.148AspGlu: 4.148 ± 0.779
3.226AspPhe: 3.226 ± 0.684
4.954AspGly: 4.954 ± 0.852
0.576AspHis: 0.576 ± 0.339
3.802AspIle: 3.802 ± 0.63
4.609AspLys: 4.609 ± 0.609
6.107AspLeu: 6.107 ± 0.897
0.922AspMet: 0.922 ± 0.269
4.148AspAsn: 4.148 ± 0.765
1.728AspPro: 1.728 ± 0.406
0.691AspGln: 0.691 ± 0.382
1.613AspArg: 1.613 ± 0.468
2.65AspSer: 2.65 ± 0.636
3.341AspThr: 3.341 ± 0.695
3.226AspVal: 3.226 ± 0.627
0.691AspTrp: 0.691 ± 0.269
3.687AspTyr: 3.687 ± 0.651
0.0AspXaa: 0.0 ± 0.0
Glu
5.07GluAla: 5.07 ± 0.881
0.346GluCys: 0.346 ± 0.193
2.42GluAsp: 2.42 ± 0.587
5.07GluGlu: 5.07 ± 1.004
3.457GluPhe: 3.457 ± 0.466
2.189GluGly: 2.189 ± 0.455
1.383GluHis: 1.383 ± 0.457
6.107GluIle: 6.107 ± 0.893
5.991GluLys: 5.991 ± 1.236
10.37GluLeu: 10.37 ± 1.378
2.42GluMet: 2.42 ± 0.415
4.494GluAsn: 4.494 ± 0.825
1.152GluPro: 1.152 ± 0.409
4.263GluGln: 4.263 ± 0.814
2.996GluArg: 2.996 ± 0.615
4.033GluSer: 4.033 ± 0.617
5.3GluThr: 5.3 ± 0.939
3.802GluVal: 3.802 ± 0.613
0.922GluTrp: 0.922 ± 0.296
2.996GluTyr: 2.996 ± 0.717
0.0GluXaa: 0.0 ± 0.0
Phe
3.111PheAla: 3.111 ± 0.815
0.346PheCys: 0.346 ± 0.222
2.996PheAsp: 2.996 ± 0.489
3.111PheGlu: 3.111 ± 0.662
1.844PhePhe: 1.844 ± 0.614
1.959PheGly: 1.959 ± 0.554
0.0PheHis: 0.0 ± 0.0
3.341PheIle: 3.341 ± 0.71
3.918PheLys: 3.918 ± 0.839
2.304PheLeu: 2.304 ± 0.46
0.691PheMet: 0.691 ± 0.282
2.996PheAsn: 2.996 ± 0.791
1.152PhePro: 1.152 ± 0.476
1.498PheGln: 1.498 ± 0.418
1.498PheArg: 1.498 ± 0.343
3.341PheSer: 3.341 ± 0.822
2.881PheThr: 2.881 ± 0.527
2.535PheVal: 2.535 ± 0.619
0.23PheTrp: 0.23 ± 0.147
1.613PheTyr: 1.613 ± 0.38
0.0PheXaa: 0.0 ± 0.0
Gly
3.802GlyAla: 3.802 ± 1.24
0.461GlyCys: 0.461 ± 0.212
2.881GlyAsp: 2.881 ± 0.639
4.263GlyGlu: 4.263 ± 0.727
2.304GlyPhe: 2.304 ± 0.444
4.378GlyGly: 4.378 ± 1.166
0.461GlyHis: 0.461 ± 0.211
4.148GlyIle: 4.148 ± 1.04
6.452GlyLys: 6.452 ± 0.766
5.761GlyLeu: 5.761 ± 1.097
1.152GlyMet: 1.152 ± 0.446
3.226GlyAsn: 3.226 ± 0.74
0.115GlyPro: 0.115 ± 0.141
1.959GlyGln: 1.959 ± 0.444
1.613GlyArg: 1.613 ± 0.315
4.378GlySer: 4.378 ± 0.856
3.802GlyThr: 3.802 ± 0.855
5.3GlyVal: 5.3 ± 1.182
1.383GlyTrp: 1.383 ± 0.378
2.996GlyTyr: 2.996 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
0.922HisAla: 0.922 ± 0.353
0.691HisCys: 0.691 ± 0.384
0.576HisAsp: 0.576 ± 0.331
0.691HisGlu: 0.691 ± 0.256
0.346HisPhe: 0.346 ± 0.204
1.728HisGly: 1.728 ± 0.484
0.0HisHis: 0.0 ± 0.0
0.691HisIle: 0.691 ± 0.295
0.922HisLys: 0.922 ± 0.44
0.807HisLeu: 0.807 ± 0.337
0.0HisMet: 0.0 ± 0.0
1.613HisAsn: 1.613 ± 0.433
0.115HisPro: 0.115 ± 0.096
0.23HisGln: 0.23 ± 0.183
0.346HisArg: 0.346 ± 0.223
0.576HisSer: 0.576 ± 0.322
0.807HisThr: 0.807 ± 0.326
0.922HisVal: 0.922 ± 0.317
0.115HisTrp: 0.115 ± 0.121
0.691HisTyr: 0.691 ± 0.281
0.0HisXaa: 0.0 ± 0.0
Ile
4.839IleAla: 4.839 ± 0.791
0.115IleCys: 0.115 ± 0.114
5.07IleAsp: 5.07 ± 0.639
6.107IleGlu: 6.107 ± 0.891
2.765IlePhe: 2.765 ± 0.559
3.687IleGly: 3.687 ± 1.07
1.152IleHis: 1.152 ± 0.348
4.954IleIle: 4.954 ± 0.598
6.107IleLys: 6.107 ± 0.777
5.531IleLeu: 5.531 ± 0.903
1.267IleMet: 1.267 ± 0.387
5.415IleAsn: 5.415 ± 0.699
1.959IlePro: 1.959 ± 0.437
2.65IleGln: 2.65 ± 0.537
1.844IleArg: 1.844 ± 0.534
4.033IleSer: 4.033 ± 0.795
4.724IleThr: 4.724 ± 0.698
4.839IleVal: 4.839 ± 0.707
0.807IleTrp: 0.807 ± 0.258
2.765IleTyr: 2.765 ± 0.593
0.0IleXaa: 0.0 ± 0.0
Lys
6.683LysAla: 6.683 ± 1.04
0.576LysCys: 0.576 ± 0.338
5.185LysAsp: 5.185 ± 0.717
7.835LysGlu: 7.835 ± 1.51
1.498LysPhe: 1.498 ± 0.342
5.3LysGly: 5.3 ± 0.78
1.728LysHis: 1.728 ± 0.546
6.107LysIle: 6.107 ± 0.965
8.181LysLys: 8.181 ± 1.041
7.835LysLeu: 7.835 ± 0.924
3.111LysMet: 3.111 ± 0.569
5.531LysAsn: 5.531 ± 0.727
1.959LysPro: 1.959 ± 0.476
4.033LysGln: 4.033 ± 0.689
3.687LysArg: 3.687 ± 0.753
5.646LysSer: 5.646 ± 0.857
4.378LysThr: 4.378 ± 0.727
6.568LysVal: 6.568 ± 0.8
1.383LysTrp: 1.383 ± 0.352
3.572LysTyr: 3.572 ± 0.664
0.0LysXaa: 0.0 ± 0.0
Leu
5.415LeuAla: 5.415 ± 0.664
0.23LeuCys: 0.23 ± 0.164
4.724LeuAsp: 4.724 ± 0.641
5.415LeuGlu: 5.415 ± 0.9
3.572LeuPhe: 3.572 ± 0.682
4.724LeuGly: 4.724 ± 0.966
1.267LeuHis: 1.267 ± 0.412
6.452LeuIle: 6.452 ± 0.932
8.872LeuLys: 8.872 ± 0.94
6.452LeuLeu: 6.452 ± 1.029
1.613LeuMet: 1.613 ± 0.42
5.185LeuAsn: 5.185 ± 0.915
3.457LeuPro: 3.457 ± 0.624
3.111LeuGln: 3.111 ± 0.646
3.226LeuArg: 3.226 ± 0.58
4.839LeuSer: 4.839 ± 0.796
6.222LeuThr: 6.222 ± 0.713
5.3LeuVal: 5.3 ± 0.688
1.383LeuTrp: 1.383 ± 0.346
4.609LeuTyr: 4.609 ± 0.776
0.0LeuXaa: 0.0 ± 0.0
Met
1.844MetAla: 1.844 ± 0.484
0.0MetCys: 0.0 ± 0.0
1.383MetAsp: 1.383 ± 0.369
1.728MetGlu: 1.728 ± 0.583
0.461MetPhe: 0.461 ± 0.224
0.691MetGly: 0.691 ± 0.235
0.346MetHis: 0.346 ± 0.226
2.65MetIle: 2.65 ± 0.654
2.535MetLys: 2.535 ± 0.592
1.152MetLeu: 1.152 ± 0.405
0.346MetMet: 0.346 ± 0.224
2.189MetAsn: 2.189 ± 0.505
0.576MetPro: 0.576 ± 0.261
1.844MetGln: 1.844 ± 0.391
0.461MetArg: 0.461 ± 0.282
1.844MetSer: 1.844 ± 0.408
1.959MetThr: 1.959 ± 0.459
0.807MetVal: 0.807 ± 0.257
0.115MetTrp: 0.115 ± 0.111
1.037MetTyr: 1.037 ± 0.384
0.0MetXaa: 0.0 ± 0.0
Asn
5.3AsnAla: 5.3 ± 1.142
0.115AsnCys: 0.115 ± 0.105
4.263AsnAsp: 4.263 ± 0.778
6.107AsnGlu: 6.107 ± 1.094
2.189AsnPhe: 2.189 ± 0.592
6.107AsnGly: 6.107 ± 1.001
0.922AsnHis: 0.922 ± 0.397
4.148AsnIle: 4.148 ± 0.554
6.452AsnLys: 6.452 ± 1.135
5.991AsnLeu: 5.991 ± 0.961
1.383AsnMet: 1.383 ± 0.357
3.687AsnAsn: 3.687 ± 0.79
1.959AsnPro: 1.959 ± 0.506
1.959AsnGln: 1.959 ± 0.517
1.959AsnArg: 1.959 ± 0.394
4.724AsnSer: 4.724 ± 0.574
4.148AsnThr: 4.148 ± 0.875
3.918AsnVal: 3.918 ± 0.636
0.922AsnTrp: 0.922 ± 0.356
2.42AsnTyr: 2.42 ± 0.65
0.0AsnXaa: 0.0 ± 0.0
Pro
1.959ProAla: 1.959 ± 0.472
0.23ProCys: 0.23 ± 0.162
1.613ProAsp: 1.613 ± 0.484
1.613ProGlu: 1.613 ± 0.437
0.922ProPhe: 0.922 ± 0.242
0.461ProGly: 0.461 ± 0.21
0.115ProHis: 0.115 ± 0.122
1.844ProIle: 1.844 ± 0.427
2.304ProLys: 2.304 ± 0.543
1.844ProLeu: 1.844 ± 0.412
0.691ProMet: 0.691 ± 0.246
2.535ProAsn: 2.535 ± 0.743
0.922ProPro: 0.922 ± 0.406
0.691ProGln: 0.691 ± 0.27
0.691ProArg: 0.691 ± 0.231
0.691ProSer: 0.691 ± 0.252
2.304ProThr: 2.304 ± 0.532
1.498ProVal: 1.498 ± 0.396
0.346ProTrp: 0.346 ± 0.169
0.461ProTyr: 0.461 ± 0.234
0.0ProXaa: 0.0 ± 0.0
Gln
2.996GlnAla: 2.996 ± 0.693
0.23GlnCys: 0.23 ± 0.144
2.074GlnAsp: 2.074 ± 0.603
2.535GlnGlu: 2.535 ± 0.543
1.267GlnPhe: 1.267 ± 0.427
2.65GlnGly: 2.65 ± 0.555
0.461GlnHis: 0.461 ± 0.217
1.613GlnIle: 1.613 ± 0.393
3.802GlnLys: 3.802 ± 0.631
2.996GlnLeu: 2.996 ± 0.67
0.807GlnMet: 0.807 ± 0.253
2.535GlnAsn: 2.535 ± 0.445
1.152GlnPro: 1.152 ± 0.294
1.383GlnGln: 1.383 ± 0.427
1.613GlnArg: 1.613 ± 0.508
3.226GlnSer: 3.226 ± 0.467
1.613GlnThr: 1.613 ± 0.383
2.535GlnVal: 2.535 ± 0.511
0.576GlnTrp: 0.576 ± 0.235
1.613GlnTyr: 1.613 ± 0.402
0.0GlnXaa: 0.0 ± 0.0
Arg
2.189ArgAla: 2.189 ± 0.59
0.346ArgCys: 0.346 ± 0.198
1.728ArgAsp: 1.728 ± 0.441
1.844ArgGlu: 1.844 ± 0.428
0.807ArgPhe: 0.807 ± 0.285
1.844ArgGly: 1.844 ± 0.48
1.037ArgHis: 1.037 ± 0.316
2.42ArgIle: 2.42 ± 0.585
4.494ArgLys: 4.494 ± 0.91
3.572ArgLeu: 3.572 ± 0.755
0.461ArgMet: 0.461 ± 0.244
2.304ArgAsn: 2.304 ± 0.589
0.691ArgPro: 0.691 ± 0.28
1.498ArgGln: 1.498 ± 0.408
2.074ArgArg: 2.074 ± 0.576
2.189ArgSer: 2.189 ± 0.495
2.189ArgThr: 2.189 ± 0.499
2.189ArgVal: 2.189 ± 0.48
0.461ArgTrp: 0.461 ± 0.224
1.498ArgTyr: 1.498 ± 0.437
0.0ArgXaa: 0.0 ± 0.0
Ser
4.839SerAla: 4.839 ± 1.25
0.576SerCys: 0.576 ± 0.323
3.341SerAsp: 3.341 ± 0.54
4.033SerGlu: 4.033 ± 0.596
3.111SerPhe: 3.111 ± 0.553
4.494SerGly: 4.494 ± 1.483
0.691SerHis: 0.691 ± 0.32
5.07SerIle: 5.07 ± 0.885
4.609SerLys: 4.609 ± 0.817
6.222SerLeu: 6.222 ± 0.884
1.613SerMet: 1.613 ± 0.351
4.378SerAsn: 4.378 ± 0.845
1.498SerPro: 1.498 ± 0.447
1.844SerGln: 1.844 ± 0.482
2.42SerArg: 2.42 ± 0.555
4.839SerSer: 4.839 ± 0.962
3.226SerThr: 3.226 ± 0.943
4.033SerVal: 4.033 ± 0.74
0.691SerTrp: 0.691 ± 0.326
1.728SerTyr: 1.728 ± 0.479
0.0SerXaa: 0.0 ± 0.0
Thr
4.954ThrAla: 4.954 ± 0.735
0.461ThrCys: 0.461 ± 0.231
3.802ThrAsp: 3.802 ± 0.739
5.415ThrGlu: 5.415 ± 0.655
2.65ThrPhe: 2.65 ± 0.618
4.033ThrGly: 4.033 ± 0.558
0.115ThrHis: 0.115 ± 0.129
4.378ThrIle: 4.378 ± 0.822
5.185ThrLys: 5.185 ± 0.796
4.954ThrLeu: 4.954 ± 0.904
1.152ThrMet: 1.152 ± 0.347
4.609ThrAsn: 4.609 ± 0.641
1.844ThrPro: 1.844 ± 0.367
2.765ThrGln: 2.765 ± 0.617
1.728ThrArg: 1.728 ± 0.449
5.07ThrSer: 5.07 ± 0.649
4.494ThrThr: 4.494 ± 0.556
4.954ThrVal: 4.954 ± 0.908
0.807ThrTrp: 0.807 ± 0.302
2.189ThrTyr: 2.189 ± 0.535
0.0ThrXaa: 0.0 ± 0.0
Val
4.033ValAla: 4.033 ± 0.572
0.23ValCys: 0.23 ± 0.15
3.802ValAsp: 3.802 ± 0.696
5.07ValGlu: 5.07 ± 0.681
3.226ValPhe: 3.226 ± 0.637
2.765ValGly: 2.765 ± 0.419
0.346ValHis: 0.346 ± 0.149
4.378ValIle: 4.378 ± 0.483
5.876ValLys: 5.876 ± 0.852
3.687ValLeu: 3.687 ± 0.665
1.728ValMet: 1.728 ± 0.447
3.802ValAsn: 3.802 ± 0.796
1.267ValPro: 1.267 ± 0.546
1.844ValGln: 1.844 ± 0.439
3.111ValArg: 3.111 ± 0.684
4.954ValSer: 4.954 ± 0.984
6.452ValThr: 6.452 ± 0.953
3.226ValVal: 3.226 ± 0.728
0.461ValTrp: 0.461 ± 0.244
2.881ValTyr: 2.881 ± 0.547
0.0ValXaa: 0.0 ± 0.0
Trp
0.461TrpAla: 0.461 ± 0.201
0.23TrpCys: 0.23 ± 0.163
0.922TrpAsp: 0.922 ± 0.417
0.576TrpGlu: 0.576 ± 0.247
0.922TrpPhe: 0.922 ± 0.395
0.691TrpGly: 0.691 ± 0.329
0.23TrpHis: 0.23 ± 0.158
0.461TrpIle: 0.461 ± 0.229
1.152TrpLys: 1.152 ± 0.402
1.383TrpLeu: 1.383 ± 0.433
0.346TrpMet: 0.346 ± 0.181
1.267TrpAsn: 1.267 ± 0.372
0.0TrpPro: 0.0 ± 0.0
0.922TrpGln: 0.922 ± 0.311
0.461TrpArg: 0.461 ± 0.278
1.152TrpSer: 1.152 ± 0.323
0.576TrpThr: 0.576 ± 0.275
0.691TrpVal: 0.691 ± 0.306
0.115TrpTrp: 0.115 ± 0.102
0.922TrpTyr: 0.922 ± 0.301
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.728TyrAla: 1.728 ± 0.47
0.576TyrCys: 0.576 ± 0.301
1.844TyrAsp: 1.844 ± 0.544
3.572TyrGlu: 3.572 ± 0.786
2.765TyrPhe: 2.765 ± 0.512
2.881TyrGly: 2.881 ± 0.691
1.152TyrHis: 1.152 ± 0.369
2.765TyrIle: 2.765 ± 0.713
2.996TyrLys: 2.996 ± 0.593
3.111TyrLeu: 3.111 ± 0.722
0.922TyrMet: 0.922 ± 0.346
3.572TyrAsn: 3.572 ± 0.629
1.498TyrPro: 1.498 ± 0.45
1.498TyrGln: 1.498 ± 0.538
1.613TyrArg: 1.613 ± 0.418
1.613TyrSer: 1.613 ± 0.468
2.881TyrThr: 2.881 ± 0.687
2.42TyrVal: 2.42 ± 0.624
0.346TyrTrp: 0.346 ± 0.187
1.959TyrTyr: 1.959 ± 0.461
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 51 proteins (8680 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski