Amino acid dipepetide frequency for Lactococcus phage AM8

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.028AlaAla: 0.028 ± 0.025
0.277AlaCys: 0.277 ± 0.083
2.603AlaAsp: 2.603 ± 0.281
3.129AlaGlu: 3.129 ± 0.276
1.744AlaPhe: 1.744 ± 0.242
3.655AlaGly: 3.655 ± 0.667
0.886AlaHis: 0.886 ± 0.159
3.821AlaIle: 3.821 ± 0.383
5.261AlaLys: 5.261 ± 0.64
4.486AlaLeu: 4.486 ± 0.564
1.467AlaMet: 1.467 ± 0.23
2.824AlaAsn: 2.824 ± 0.294
1.024AlaPro: 1.024 ± 0.203
2.547AlaGln: 2.547 ± 0.448
1.689AlaArg: 1.689 ± 0.229
3.212AlaSer: 3.212 ± 0.335
3.24AlaThr: 3.24 ± 0.416
3.267AlaVal: 3.267 ± 0.38
0.471AlaTrp: 0.471 ± 0.093
2.63AlaTyr: 2.63 ± 0.233
0.0AlaXaa: 0.0 ± 0.0
Cys
0.194CysAla: 0.194 ± 0.076
0.111CysCys: 0.111 ± 0.057
0.443CysAsp: 0.443 ± 0.147
0.554CysGlu: 0.554 ± 0.166
0.277CysPhe: 0.277 ± 0.084
0.554CysGly: 0.554 ± 0.18
0.083CysHis: 0.083 ± 0.055
0.554CysIle: 0.554 ± 0.149
0.581CysLys: 0.581 ± 0.132
0.498CysLeu: 0.498 ± 0.132
0.055CysMet: 0.055 ± 0.046
0.36CysAsn: 0.36 ± 0.088
0.388CysPro: 0.388 ± 0.165
0.332CysGln: 0.332 ± 0.109
0.415CysArg: 0.415 ± 0.098
0.775CysSer: 0.775 ± 0.18
0.138CysThr: 0.138 ± 0.063
0.36CysVal: 0.36 ± 0.104
0.111CysTrp: 0.111 ± 0.051
0.305CysTyr: 0.305 ± 0.113
0.0CysXaa: 0.0 ± 0.0
Asp
2.492AspAla: 2.492 ± 0.339
0.415AspCys: 0.415 ± 0.13
4.541AspAsp: 4.541 ± 0.388
6.202AspGlu: 6.202 ± 0.474
4.098AspPhe: 4.098 ± 0.343
4.652AspGly: 4.652 ± 0.421
0.498AspHis: 0.498 ± 0.135
6.147AspIle: 6.147 ± 0.422
7.061AspLys: 7.061 ± 0.456
5.621AspLeu: 5.621 ± 0.52
1.8AspMet: 1.8 ± 0.2
4.79AspAsn: 4.79 ± 0.318
0.969AspPro: 0.969 ± 0.14
0.997AspGln: 0.997 ± 0.23
2.104AspArg: 2.104 ± 0.281
4.375AspSer: 4.375 ± 0.412
3.71AspThr: 3.71 ± 0.343
4.458AspVal: 4.458 ± 0.392
0.72AspTrp: 0.72 ± 0.163
3.073AspTyr: 3.073 ± 0.293
0.0AspXaa: 0.0 ± 0.0
Glu
4.015GluAla: 4.015 ± 0.427
0.554GluCys: 0.554 ± 0.134
5.925GluAsp: 5.925 ± 0.627
6.894GluGlu: 6.894 ± 0.532
3.572GluPhe: 3.572 ± 0.354
3.129GluGly: 3.129 ± 0.335
1.301GluHis: 1.301 ± 0.252
6.756GluIle: 6.756 ± 0.421
7.448GluLys: 7.448 ± 0.603
7.78GluLeu: 7.78 ± 0.519
2.187GluMet: 2.187 ± 0.258
5.289GluAsn: 5.289 ± 0.375
1.08GluPro: 1.08 ± 0.198
2.852GluGln: 2.852 ± 0.241
2.354GluArg: 2.354 ± 0.243
5.427GluSer: 5.427 ± 0.361
4.209GluThr: 4.209 ± 0.364
4.984GluVal: 4.984 ± 0.357
0.775GluTrp: 0.775 ± 0.14
4.624GluTyr: 4.624 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
1.994PheAla: 1.994 ± 0.219
0.388PheCys: 0.388 ± 0.136
3.849PheAsp: 3.849 ± 0.309
4.181PheGlu: 4.181 ± 0.395
1.301PhePhe: 1.301 ± 0.22
2.63PheGly: 2.63 ± 0.261
0.471PheHis: 0.471 ± 0.118
2.99PheIle: 2.99 ± 0.331
4.181PheLys: 4.181 ± 0.285
3.489PheLeu: 3.489 ± 0.404
1.44PheMet: 1.44 ± 0.219
3.267PheAsn: 3.267 ± 0.3
0.941PhePro: 0.941 ± 0.134
1.357PheGln: 1.357 ± 0.172
1.578PheArg: 1.578 ± 0.268
2.88PheSer: 2.88 ± 0.312
3.073PheThr: 3.073 ± 0.302
2.713PheVal: 2.713 ± 0.269
0.443PheTrp: 0.443 ± 0.111
2.187PheTyr: 2.187 ± 0.276
0.0PheXaa: 0.0 ± 0.0
Gly
2.464GlyAla: 2.464 ± 0.408
0.305GlyCys: 0.305 ± 0.083
3.572GlyAsp: 3.572 ± 0.281
4.126GlyGlu: 4.126 ± 0.326
3.295GlyPhe: 3.295 ± 0.274
3.572GlyGly: 3.572 ± 0.506
0.941GlyHis: 0.941 ± 0.145
4.818GlyIle: 4.818 ± 0.387
4.818GlyLys: 4.818 ± 0.4
4.624GlyLeu: 4.624 ± 0.343
1.966GlyMet: 1.966 ± 0.227
4.43GlyAsn: 4.43 ± 0.411
0.0GlyPro: 0.0 ± 0.0
1.523GlyGln: 1.523 ± 0.196
2.16GlyArg: 2.16 ± 0.283
3.378GlySer: 3.378 ± 0.324
3.627GlyThr: 3.627 ± 0.345
3.71GlyVal: 3.71 ± 0.308
0.803GlyTrp: 0.803 ± 0.203
3.046GlyTyr: 3.046 ± 0.284
0.0GlyXaa: 0.0 ± 0.0
His
0.554HisAla: 0.554 ± 0.111
0.138HisCys: 0.138 ± 0.063
1.191HisAsp: 1.191 ± 0.166
0.997HisGlu: 0.997 ± 0.175
0.831HisPhe: 0.831 ± 0.178
1.052HisGly: 1.052 ± 0.206
0.36HisHis: 0.36 ± 0.109
1.301HisIle: 1.301 ± 0.228
1.384HisLys: 1.384 ± 0.222
0.997HisLeu: 0.997 ± 0.188
0.249HisMet: 0.249 ± 0.091
1.135HisAsn: 1.135 ± 0.238
0.415HisPro: 0.415 ± 0.094
0.498HisGln: 0.498 ± 0.113
0.526HisArg: 0.526 ± 0.108
1.024HisSer: 1.024 ± 0.167
0.886HisThr: 0.886 ± 0.123
0.581HisVal: 0.581 ± 0.146
0.194HisTrp: 0.194 ± 0.067
0.969HisTyr: 0.969 ± 0.231
0.0HisXaa: 0.0 ± 0.0
Ile
3.959IleAla: 3.959 ± 0.374
0.581IleCys: 0.581 ± 0.142
5.815IleAsp: 5.815 ± 0.431
6.479IleGlu: 6.479 ± 0.517
3.184IlePhe: 3.184 ± 0.405
3.516IleGly: 3.516 ± 0.304
1.163IleHis: 1.163 ± 0.189
5.372IleIle: 5.372 ± 0.588
8.307IleLys: 8.307 ± 0.419
5.898IleLeu: 5.898 ± 0.493
1.301IleMet: 1.301 ± 0.159
5.427IleAsn: 5.427 ± 0.435
2.409IlePro: 2.409 ± 0.251
2.492IleGln: 2.492 ± 0.348
2.464IleArg: 2.464 ± 0.321
5.261IleSer: 5.261 ± 0.413
4.402IleThr: 4.402 ± 0.455
5.122IleVal: 5.122 ± 0.463
0.692IleTrp: 0.692 ± 0.129
2.852IleTyr: 2.852 ± 0.341
0.0IleXaa: 0.0 ± 0.0
Lys
4.901LysAla: 4.901 ± 0.504
0.581LysCys: 0.581 ± 0.152
6.673LysAsp: 6.673 ± 0.442
10.134LysGlu: 10.134 ± 0.615
3.433LysPhe: 3.433 ± 0.394
4.901LysGly: 4.901 ± 0.381
1.827LysHis: 1.827 ± 0.264
6.673LysIle: 6.673 ± 0.501
7.808LysLys: 7.808 ± 0.521
8.085LysLeu: 8.085 ± 0.476
3.073LysMet: 3.073 ± 0.25
5.981LysAsn: 5.981 ± 0.352
1.966LysPro: 1.966 ± 0.23
3.073LysGln: 3.073 ± 0.37
3.406LysArg: 3.406 ± 0.31
4.79LysSer: 4.79 ± 0.425
5.538LysThr: 5.538 ± 0.453
5.178LysVal: 5.178 ± 0.327
1.024LysTrp: 1.024 ± 0.16
3.932LysTyr: 3.932 ± 0.354
0.0LysXaa: 0.0 ± 0.0
Leu
4.679LeuAla: 4.679 ± 0.398
0.471LeuCys: 0.471 ± 0.119
6.341LeuAsp: 6.341 ± 0.426
6.147LeuGlu: 6.147 ± 0.469
3.766LeuPhe: 3.766 ± 0.459
4.762LeuGly: 4.762 ± 0.486
1.135LeuHis: 1.135 ± 0.227
5.704LeuIle: 5.704 ± 0.457
7.476LeuLys: 7.476 ± 0.545
6.202LeuLeu: 6.202 ± 0.491
1.966LeuMet: 1.966 ± 0.238
5.067LeuAsn: 5.067 ± 0.376
2.132LeuPro: 2.132 ± 0.233
2.686LeuGln: 2.686 ± 0.383
2.769LeuArg: 2.769 ± 0.311
6.451LeuSer: 6.451 ± 0.532
5.067LeuThr: 5.067 ± 0.386
4.652LeuVal: 4.652 ± 0.383
0.581LeuTrp: 0.581 ± 0.126
3.156LeuTyr: 3.156 ± 0.357
0.0LeuXaa: 0.0 ± 0.0
Met
1.8MetAla: 1.8 ± 0.224
0.083MetCys: 0.083 ± 0.05
1.551MetAsp: 1.551 ± 0.221
1.938MetGlu: 1.938 ± 0.248
1.191MetPhe: 1.191 ± 0.177
1.357MetGly: 1.357 ± 0.2
0.222MetHis: 0.222 ± 0.071
1.8MetIle: 1.8 ± 0.211
2.547MetLys: 2.547 ± 0.281
1.911MetLeu: 1.911 ± 0.241
0.775MetMet: 0.775 ± 0.124
1.966MetAsn: 1.966 ± 0.213
0.637MetPro: 0.637 ± 0.132
1.218MetGln: 1.218 ± 0.315
0.748MetArg: 0.748 ± 0.159
1.938MetSer: 1.938 ± 0.259
2.021MetThr: 2.021 ± 0.204
1.551MetVal: 1.551 ± 0.2
0.277MetTrp: 0.277 ± 0.085
0.997MetTyr: 0.997 ± 0.179
0.0MetXaa: 0.0 ± 0.0
Asn
3.156AsnAla: 3.156 ± 0.386
0.388AsnCys: 0.388 ± 0.118
3.267AsnAsp: 3.267 ± 0.265
5.067AsnGlu: 5.067 ± 0.423
3.101AsnPhe: 3.101 ± 0.325
4.319AsnGly: 4.319 ± 0.361
0.969AsnHis: 0.969 ± 0.215
5.704AsnIle: 5.704 ± 0.365
6.119AsnLys: 6.119 ± 0.483
5.621AsnLeu: 5.621 ± 0.411
1.606AsnMet: 1.606 ± 0.202
4.126AsnAsn: 4.126 ± 0.302
2.187AsnPro: 2.187 ± 0.277
2.824AsnGln: 2.824 ± 0.316
2.132AsnArg: 2.132 ± 0.207
4.209AsnSer: 4.209 ± 0.328
3.295AsnThr: 3.295 ± 0.254
3.876AsnVal: 3.876 ± 0.268
0.581AsnTrp: 0.581 ± 0.114
2.686AsnTyr: 2.686 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
1.191ProAla: 1.191 ± 0.201
0.138ProCys: 0.138 ± 0.064
1.717ProAsp: 1.717 ± 0.232
1.578ProGlu: 1.578 ± 0.203
1.135ProPhe: 1.135 ± 0.176
0.083ProGly: 0.083 ± 0.055
0.415ProHis: 0.415 ± 0.102
1.606ProIle: 1.606 ± 0.191
2.077ProLys: 2.077 ± 0.266
1.855ProLeu: 1.855 ± 0.229
0.775ProMet: 0.775 ± 0.142
1.689ProAsn: 1.689 ± 0.196
0.332ProPro: 0.332 ± 0.092
0.72ProGln: 0.72 ± 0.148
0.692ProArg: 0.692 ± 0.159
1.689ProSer: 1.689 ± 0.242
1.384ProThr: 1.384 ± 0.253
1.855ProVal: 1.855 ± 0.248
0.083ProTrp: 0.083 ± 0.055
1.191ProTyr: 1.191 ± 0.218
0.0ProXaa: 0.0 ± 0.0
Gln
2.187GlnAla: 2.187 ± 0.441
0.194GlnCys: 0.194 ± 0.077
1.883GlnAsp: 1.883 ± 0.215
2.492GlnGlu: 2.492 ± 0.236
1.274GlnPhe: 1.274 ± 0.184
2.104GlnGly: 2.104 ± 0.236
0.72GlnHis: 0.72 ± 0.152
2.298GlnIle: 2.298 ± 0.272
3.129GlnLys: 3.129 ± 0.367
3.156GlnLeu: 3.156 ± 0.397
1.191GlnMet: 1.191 ± 0.245
2.243GlnAsn: 2.243 ± 0.267
0.748GlnPro: 0.748 ± 0.133
1.467GlnGln: 1.467 ± 0.497
1.606GlnArg: 1.606 ± 0.199
2.021GlnSer: 2.021 ± 0.292
2.077GlnThr: 2.077 ± 0.38
1.384GlnVal: 1.384 ± 0.189
0.388GlnTrp: 0.388 ± 0.103
1.44GlnTyr: 1.44 ± 0.223
0.0GlnXaa: 0.0 ± 0.0
Arg
1.301ArgAla: 1.301 ± 0.187
0.581ArgCys: 0.581 ± 0.253
2.409ArgAsp: 2.409 ± 0.231
2.824ArgGlu: 2.824 ± 0.278
1.606ArgPhe: 1.606 ± 0.228
2.187ArgGly: 2.187 ± 0.261
0.665ArgHis: 0.665 ± 0.146
2.741ArgIle: 2.741 ± 0.354
2.935ArgLys: 2.935 ± 0.368
2.52ArgLeu: 2.52 ± 0.271
0.803ArgMet: 0.803 ± 0.142
2.437ArgAsn: 2.437 ± 0.272
0.72ArgPro: 0.72 ± 0.147
1.191ArgGln: 1.191 ± 0.213
1.246ArgArg: 1.246 ± 0.205
1.855ArgSer: 1.855 ± 0.263
1.994ArgThr: 1.994 ± 0.196
2.464ArgVal: 2.464 ± 0.284
0.581ArgTrp: 0.581 ± 0.131
2.021ArgTyr: 2.021 ± 0.235
0.0ArgXaa: 0.0 ± 0.0
Ser
3.655SerAla: 3.655 ± 0.462
0.388SerCys: 0.388 ± 0.098
4.264SerAsp: 4.264 ± 0.406
5.095SerGlu: 5.095 ± 0.331
3.627SerPhe: 3.627 ± 0.342
4.264SerGly: 4.264 ± 0.417
0.803SerHis: 0.803 ± 0.15
5.095SerIle: 5.095 ± 0.414
5.87SerLys: 5.87 ± 0.363
4.845SerLeu: 4.845 ± 0.49
1.467SerMet: 1.467 ± 0.192
3.738SerAsn: 3.738 ± 0.439
1.523SerPro: 1.523 ± 0.223
2.077SerGln: 2.077 ± 0.217
2.99SerArg: 2.99 ± 0.312
4.596SerSer: 4.596 ± 0.716
3.516SerThr: 3.516 ± 0.387
3.932SerVal: 3.932 ± 0.346
0.692SerTrp: 0.692 ± 0.161
2.658SerTyr: 2.658 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
3.516ThrAla: 3.516 ± 0.589
0.498ThrCys: 0.498 ± 0.14
3.987ThrAsp: 3.987 ± 0.327
3.959ThrGlu: 3.959 ± 0.412
2.88ThrPhe: 2.88 ± 0.326
3.267ThrGly: 3.267 ± 0.361
1.052ThrHis: 1.052 ± 0.132
4.458ThrIle: 4.458 ± 0.436
5.676ThrLys: 5.676 ± 0.331
5.012ThrLeu: 5.012 ± 0.324
1.44ThrMet: 1.44 ± 0.21
3.35ThrAsn: 3.35 ± 0.284
1.883ThrPro: 1.883 ± 0.232
1.578ThrGln: 1.578 ± 0.21
2.077ThrArg: 2.077 ± 0.221
3.932ThrSer: 3.932 ± 0.402
3.489ThrThr: 3.489 ± 0.419
4.735ThrVal: 4.735 ± 0.429
0.692ThrTrp: 0.692 ± 0.167
2.409ThrTyr: 2.409 ± 0.266
0.0ThrXaa: 0.0 ± 0.0
Val
3.35ValAla: 3.35 ± 0.367
0.526ValCys: 0.526 ± 0.126
4.79ValAsp: 4.79 ± 0.332
4.956ValGlu: 4.956 ± 0.431
2.88ValPhe: 2.88 ± 0.271
3.544ValGly: 3.544 ± 0.316
0.941ValHis: 0.941 ± 0.163
4.43ValIle: 4.43 ± 0.498
5.87ValLys: 5.87 ± 0.332
4.153ValLeu: 4.153 ± 0.38
1.551ValMet: 1.551 ± 0.233
3.323ValAsn: 3.323 ± 0.337
1.467ValPro: 1.467 ± 0.253
2.243ValGln: 2.243 ± 0.233
1.938ValArg: 1.938 ± 0.234
4.098ValSer: 4.098 ± 0.357
4.569ValThr: 4.569 ± 0.385
4.347ValVal: 4.347 ± 0.393
0.803ValTrp: 0.803 ± 0.171
2.797ValTyr: 2.797 ± 0.37
0.0ValXaa: 0.0 ± 0.0
Trp
0.388TrpAla: 0.388 ± 0.095
0.083TrpCys: 0.083 ± 0.052
0.692TrpAsp: 0.692 ± 0.122
0.831TrpGlu: 0.831 ± 0.165
0.443TrpPhe: 0.443 ± 0.136
0.72TrpGly: 0.72 ± 0.159
0.166TrpHis: 0.166 ± 0.066
0.637TrpIle: 0.637 ± 0.149
0.748TrpLys: 0.748 ± 0.146
0.886TrpLeu: 0.886 ± 0.176
0.36TrpMet: 0.36 ± 0.093
1.108TrpAsn: 1.108 ± 0.18
0.0TrpPro: 0.0 ± 0.0
0.305TrpGln: 0.305 ± 0.083
0.415TrpArg: 0.415 ± 0.1
0.526TrpSer: 0.526 ± 0.118
0.72TrpThr: 0.72 ± 0.152
0.831TrpVal: 0.831 ± 0.193
0.083TrpTrp: 0.083 ± 0.05
0.36TrpTyr: 0.36 ± 0.093
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.437TyrAla: 2.437 ± 0.232
0.388TyrCys: 0.388 ± 0.107
3.212TyrAsp: 3.212 ± 0.313
3.572TyrGlu: 3.572 ± 0.382
1.717TyrPhe: 1.717 ± 0.255
2.907TyrGly: 2.907 ± 0.282
0.665TyrHis: 0.665 ± 0.128
3.793TyrIle: 3.793 ± 0.32
3.683TyrLys: 3.683 ± 0.384
3.572TyrLeu: 3.572 ± 0.376
1.024TyrMet: 1.024 ± 0.18
2.686TyrAsn: 2.686 ± 0.265
1.384TyrPro: 1.384 ± 0.23
2.021TyrGln: 2.021 ± 0.223
1.8TyrArg: 1.8 ± 0.23
2.658TyrSer: 2.658 ± 0.268
2.935TyrThr: 2.935 ± 0.371
2.575TyrVal: 2.575 ± 0.268
0.305TyrTrp: 0.305 ± 0.09
1.827TyrTyr: 1.827 ± 0.235
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 178 proteins (36117 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski