Amino acid dipepetide frequency for Bordetella phage CN2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.453AlaAla: 13.453 ± 1.167
0.828AlaCys: 0.828 ± 0.189
5.64AlaAsp: 5.64 ± 0.581
8.175AlaGlu: 8.175 ± 0.667
3.932AlaPhe: 3.932 ± 0.47
8.123AlaGly: 8.123 ± 0.708
1.759AlaHis: 1.759 ± 0.315
4.501AlaIle: 4.501 ± 0.439
4.708AlaLys: 4.708 ± 0.559
9.727AlaLeu: 9.727 ± 0.831
1.759AlaMet: 1.759 ± 0.298
3.001AlaAsn: 3.001 ± 0.423
4.864AlaPro: 4.864 ± 0.47
4.605AlaGln: 4.605 ± 0.761
9.003AlaArg: 9.003 ± 0.686
4.346AlaSer: 4.346 ± 0.531
5.226AlaThr: 5.226 ± 0.55
8.693AlaVal: 8.693 ± 0.79
2.277AlaTrp: 2.277 ± 0.339
2.898AlaTyr: 2.898 ± 0.389
0.052AlaXaa: 0.052 ± 0.065
Cys
0.776CysAla: 0.776 ± 0.219
0.414CysCys: 0.414 ± 0.156
0.724CysAsp: 0.724 ± 0.196
0.88CysGlu: 0.88 ± 0.248
0.31CysPhe: 0.31 ± 0.126
0.621CysGly: 0.621 ± 0.18
0.414CysHis: 0.414 ± 0.122
0.103CysIle: 0.103 ± 0.06
0.207CysLys: 0.207 ± 0.094
1.035CysLeu: 1.035 ± 0.225
0.052CysMet: 0.052 ± 0.043
0.362CysAsn: 0.362 ± 0.132
0.466CysPro: 0.466 ± 0.141
0.103CysGln: 0.103 ± 0.072
1.087CysArg: 1.087 ± 0.216
0.517CysSer: 0.517 ± 0.16
0.414CysThr: 0.414 ± 0.129
0.621CysVal: 0.621 ± 0.184
0.259CysTrp: 0.259 ± 0.095
0.414CysTyr: 0.414 ± 0.157
0.0CysXaa: 0.0 ± 0.0
Asp
5.485AspAla: 5.485 ± 0.555
0.569AspCys: 0.569 ± 0.159
3.053AspAsp: 3.053 ± 0.41
3.622AspGlu: 3.622 ± 0.392
2.898AspPhe: 2.898 ± 0.433
5.898AspGly: 5.898 ± 0.577
1.397AspHis: 1.397 ± 0.3
2.949AspIle: 2.949 ± 0.377
2.121AspLys: 2.121 ± 0.429
6.933AspLeu: 6.933 ± 0.55
1.397AspMet: 1.397 ± 0.281
1.863AspAsn: 1.863 ± 0.286
4.346AspPro: 4.346 ± 0.452
2.846AspGln: 2.846 ± 0.414
3.932AspArg: 3.932 ± 0.508
2.277AspSer: 2.277 ± 0.334
2.535AspThr: 2.535 ± 0.42
3.725AspVal: 3.725 ± 0.435
1.035AspTrp: 1.035 ± 0.237
2.121AspTyr: 2.121 ± 0.336
0.0AspXaa: 0.0 ± 0.0
Glu
8.227GluAla: 8.227 ± 0.776
0.828GluCys: 0.828 ± 0.188
3.829GluAsp: 3.829 ± 0.457
5.743GluGlu: 5.743 ± 0.564
2.535GluPhe: 2.535 ± 0.373
5.485GluGly: 5.485 ± 0.588
1.242GluHis: 1.242 ± 0.284
2.328GluIle: 2.328 ± 0.347
3.208GluLys: 3.208 ± 0.441
7.554GluLeu: 7.554 ± 0.709
1.397GluMet: 1.397 ± 0.291
2.38GluAsn: 2.38 ± 0.345
3.415GluPro: 3.415 ± 0.425
3.053GluGln: 3.053 ± 0.408
5.898GluArg: 5.898 ± 0.694
2.484GluSer: 2.484 ± 0.413
4.036GluThr: 4.036 ± 0.438
5.536GluVal: 5.536 ± 0.486
1.707GluTrp: 1.707 ± 0.325
2.121GluTyr: 2.121 ± 0.281
0.052GluXaa: 0.052 ± 0.043
Phe
3.674PheAla: 3.674 ± 0.516
0.207PheCys: 0.207 ± 0.095
3.104PheAsp: 3.104 ± 0.435
3.208PheGlu: 3.208 ± 0.336
1.5PhePhe: 1.5 ± 0.261
3.415PheGly: 3.415 ± 0.52
0.673PheHis: 0.673 ± 0.209
1.19PheIle: 1.19 ± 0.293
1.5PheLys: 1.5 ± 0.32
2.742PheLeu: 2.742 ± 0.408
1.087PheMet: 1.087 ± 0.277
1.5PheAsn: 1.5 ± 0.255
1.242PhePro: 1.242 ± 0.269
1.294PheGln: 1.294 ± 0.249
2.794PheArg: 2.794 ± 0.351
1.5PheSer: 1.5 ± 0.234
2.277PheThr: 2.277 ± 0.346
2.225PheVal: 2.225 ± 0.342
0.673PheTrp: 0.673 ± 0.175
0.776PheTyr: 0.776 ± 0.205
0.0PheXaa: 0.0 ± 0.0
Gly
7.295GlyAla: 7.295 ± 0.71
0.621GlyCys: 0.621 ± 0.167
4.708GlyAsp: 4.708 ± 0.379
6.157GlyGlu: 6.157 ± 0.542
3.725GlyPhe: 3.725 ± 0.383
7.089GlyGly: 7.089 ± 0.604
0.983GlyHis: 0.983 ± 0.252
2.846GlyIle: 2.846 ± 0.403
4.243GlyLys: 4.243 ± 0.464
6.571GlyLeu: 6.571 ± 0.667
1.863GlyMet: 1.863 ± 0.359
2.018GlyAsn: 2.018 ± 0.287
4.036GlyPro: 4.036 ± 0.527
2.794GlyGln: 2.794 ± 0.352
5.433GlyArg: 5.433 ± 0.684
4.036GlySer: 4.036 ± 0.426
4.812GlyThr: 4.812 ± 0.464
6.209GlyVal: 6.209 ± 0.599
1.656GlyTrp: 1.656 ± 0.308
2.225GlyTyr: 2.225 ± 0.332
0.155GlyXaa: 0.155 ± 0.083
His
1.449HisAla: 1.449 ± 0.253
0.414HisCys: 0.414 ± 0.135
0.828HisAsp: 0.828 ± 0.218
1.656HisGlu: 1.656 ± 0.239
0.569HisPhe: 0.569 ± 0.149
1.242HisGly: 1.242 ± 0.27
0.362HisHis: 0.362 ± 0.148
1.087HisIle: 1.087 ± 0.227
0.621HisLys: 0.621 ± 0.2
1.138HisLeu: 1.138 ± 0.272
0.466HisMet: 0.466 ± 0.154
0.31HisAsn: 0.31 ± 0.155
1.397HisPro: 1.397 ± 0.253
0.983HisGln: 0.983 ± 0.221
1.345HisArg: 1.345 ± 0.268
0.828HisSer: 0.828 ± 0.198
0.569HisThr: 0.569 ± 0.19
1.294HisVal: 1.294 ± 0.244
0.414HisTrp: 0.414 ± 0.135
0.621HisTyr: 0.621 ± 0.149
0.052HisXaa: 0.052 ± 0.053
Ile
4.657IleAla: 4.657 ± 0.46
0.362IleCys: 0.362 ± 0.166
3.311IleAsp: 3.311 ± 0.474
3.467IleGlu: 3.467 ± 0.338
1.087IlePhe: 1.087 ± 0.242
3.208IleGly: 3.208 ± 0.453
0.983IleHis: 0.983 ± 0.243
1.345IleIle: 1.345 ± 0.292
2.484IleLys: 2.484 ± 0.328
3.26IleLeu: 3.26 ± 0.493
1.138IleMet: 1.138 ± 0.197
2.328IleAsn: 2.328 ± 0.337
1.811IlePro: 1.811 ± 0.332
2.38IleGln: 2.38 ± 0.284
2.639IleArg: 2.639 ± 0.337
1.811IleSer: 1.811 ± 0.286
3.26IleThr: 3.26 ± 0.417
2.794IleVal: 2.794 ± 0.364
0.569IleTrp: 0.569 ± 0.152
1.294IleTyr: 1.294 ± 0.254
0.0IleXaa: 0.0 ± 0.0
Lys
6.002LysAla: 6.002 ± 0.58
0.362LysCys: 0.362 ± 0.136
2.846LysAsp: 2.846 ± 0.428
2.639LysGlu: 2.639 ± 0.35
1.449LysPhe: 1.449 ± 0.26
2.846LysGly: 2.846 ± 0.451
1.035LysHis: 1.035 ± 0.278
1.19LysIle: 1.19 ± 0.206
1.656LysLys: 1.656 ± 0.322
3.415LysLeu: 3.415 ± 0.389
0.931LysMet: 0.931 ± 0.23
1.242LysAsn: 1.242 ± 0.232
2.949LysPro: 2.949 ± 0.474
1.811LysGln: 1.811 ± 0.29
2.846LysArg: 2.846 ± 0.448
2.328LysSer: 2.328 ± 0.356
2.07LysThr: 2.07 ± 0.425
3.001LysVal: 3.001 ± 0.483
0.621LysTrp: 0.621 ± 0.171
1.19LysTyr: 1.19 ± 0.265
0.155LysXaa: 0.155 ± 0.069
Leu
8.796LeuAla: 8.796 ± 0.69
1.035LeuCys: 1.035 ± 0.24
6.261LeuAsp: 6.261 ± 0.539
5.743LeuGlu: 5.743 ± 0.485
2.794LeuPhe: 2.794 ± 0.375
6.623LeuGly: 6.623 ± 0.659
1.707LeuHis: 1.707 ± 0.329
4.501LeuIle: 4.501 ± 0.493
3.932LeuLys: 3.932 ± 0.385
6.261LeuLeu: 6.261 ± 0.783
1.449LeuMet: 1.449 ± 0.33
3.053LeuAsn: 3.053 ± 0.41
5.795LeuPro: 5.795 ± 0.698
3.518LeuGln: 3.518 ± 0.464
6.623LeuArg: 6.623 ± 0.535
4.243LeuSer: 4.243 ± 0.561
6.002LeuThr: 6.002 ± 0.622
4.864LeuVal: 4.864 ± 0.582
1.5LeuTrp: 1.5 ± 0.32
2.587LeuTyr: 2.587 ± 0.386
0.103LeuXaa: 0.103 ± 0.074
Met
2.277MetAla: 2.277 ± 0.365
0.052MetCys: 0.052 ± 0.058
1.449MetAsp: 1.449 ± 0.225
0.88MetGlu: 0.88 ± 0.227
0.517MetPhe: 0.517 ± 0.156
1.138MetGly: 1.138 ± 0.267
0.259MetHis: 0.259 ± 0.113
1.604MetIle: 1.604 ± 0.306
0.983MetLys: 0.983 ± 0.227
1.449MetLeu: 1.449 ± 0.252
0.724MetMet: 0.724 ± 0.235
0.983MetAsn: 0.983 ± 0.192
0.983MetPro: 0.983 ± 0.224
0.207MetGln: 0.207 ± 0.133
1.5MetArg: 1.5 ± 0.241
2.432MetSer: 2.432 ± 0.362
1.138MetThr: 1.138 ± 0.287
1.242MetVal: 1.242 ± 0.282
0.259MetTrp: 0.259 ± 0.146
0.362MetTyr: 0.362 ± 0.134
0.052MetXaa: 0.052 ± 0.052
Asn
4.398AsnAla: 4.398 ± 0.506
0.259AsnCys: 0.259 ± 0.118
1.707AsnAsp: 1.707 ± 0.272
2.38AsnGlu: 2.38 ± 0.354
0.88AsnPhe: 0.88 ± 0.176
3.674AsnGly: 3.674 ± 0.49
0.31AsnHis: 0.31 ± 0.109
1.552AsnIle: 1.552 ± 0.228
1.759AsnLys: 1.759 ± 0.313
3.104AsnLeu: 3.104 ± 0.372
0.569AsnMet: 0.569 ± 0.17
1.449AsnAsn: 1.449 ± 0.38
2.225AsnPro: 2.225 ± 0.323
1.087AsnGln: 1.087 ± 0.251
2.432AsnArg: 2.432 ± 0.29
1.397AsnSer: 1.397 ± 0.306
1.604AsnThr: 1.604 ± 0.256
2.949AsnVal: 2.949 ± 0.401
0.724AsnTrp: 0.724 ± 0.188
1.035AsnTyr: 1.035 ± 0.19
0.052AsnXaa: 0.052 ± 0.053
Pro
5.381ProAla: 5.381 ± 0.594
0.362ProCys: 0.362 ± 0.128
3.26ProAsp: 3.26 ± 0.405
5.278ProGlu: 5.278 ± 0.532
2.432ProPhe: 2.432 ± 0.336
4.501ProGly: 4.501 ± 0.479
0.673ProHis: 0.673 ± 0.194
2.535ProIle: 2.535 ± 0.328
2.432ProLys: 2.432 ± 0.363
4.346ProLeu: 4.346 ± 0.543
0.88ProMet: 0.88 ± 0.193
2.225ProAsn: 2.225 ± 0.351
2.742ProPro: 2.742 ± 0.382
1.759ProGln: 1.759 ± 0.283
2.794ProArg: 2.794 ± 0.449
2.484ProSer: 2.484 ± 0.343
2.846ProThr: 2.846 ± 0.422
3.518ProVal: 3.518 ± 0.408
1.138ProTrp: 1.138 ± 0.266
1.242ProTyr: 1.242 ± 0.26
0.155ProXaa: 0.155 ± 0.086
Gln
4.501GlnAla: 4.501 ± 0.532
0.259GlnCys: 0.259 ± 0.106
2.535GlnAsp: 2.535 ± 0.295
2.898GlnGlu: 2.898 ± 0.382
1.19GlnPhe: 1.19 ± 0.232
2.949GlnGly: 2.949 ± 0.33
0.88GlnHis: 0.88 ± 0.174
2.38GlnIle: 2.38 ± 0.335
1.604GlnLys: 1.604 ± 0.258
3.311GlnLeu: 3.311 ± 0.453
1.294GlnMet: 1.294 ± 0.226
1.294GlnAsn: 1.294 ± 0.216
1.656GlnPro: 1.656 ± 0.315
2.173GlnGln: 2.173 ± 0.397
3.104GlnArg: 3.104 ± 0.345
1.552GlnSer: 1.552 ± 0.238
2.07GlnThr: 2.07 ± 0.321
2.742GlnVal: 2.742 ± 0.42
0.828GlnTrp: 0.828 ± 0.159
0.776GlnTyr: 0.776 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
9.106ArgAla: 9.106 ± 0.678
0.569ArgCys: 0.569 ± 0.167
4.501ArgAsp: 4.501 ± 0.556
6.985ArgGlu: 6.985 ± 0.678
3.001ArgPhe: 3.001 ± 0.44
4.191ArgGly: 4.191 ± 0.458
1.242ArgHis: 1.242 ± 0.284
2.949ArgIle: 2.949 ± 0.397
2.121ArgLys: 2.121 ± 0.343
7.295ArgLeu: 7.295 ± 0.548
0.931ArgMet: 0.931 ± 0.206
3.001ArgAsn: 3.001 ± 0.398
3.208ArgPro: 3.208 ± 0.429
2.691ArgGln: 2.691 ± 0.34
4.76ArgArg: 4.76 ± 0.59
2.587ArgSer: 2.587 ± 0.383
3.674ArgThr: 3.674 ± 0.423
5.019ArgVal: 5.019 ± 0.579
1.345ArgTrp: 1.345 ± 0.284
1.966ArgTyr: 1.966 ± 0.305
0.103ArgXaa: 0.103 ± 0.075
Ser
3.674SerAla: 3.674 ± 0.54
0.517SerCys: 0.517 ± 0.125
2.691SerAsp: 2.691 ± 0.401
2.328SerGlu: 2.328 ± 0.298
2.018SerPhe: 2.018 ± 0.292
5.226SerGly: 5.226 ± 0.576
0.724SerHis: 0.724 ± 0.17
2.38SerIle: 2.38 ± 0.376
2.639SerLys: 2.639 ± 0.299
3.415SerLeu: 3.415 ± 0.393
0.931SerMet: 0.931 ± 0.212
1.811SerAsn: 1.811 ± 0.291
2.121SerPro: 2.121 ± 0.312
2.018SerGln: 2.018 ± 0.263
2.794SerArg: 2.794 ± 0.325
2.846SerSer: 2.846 ± 0.443
2.535SerThr: 2.535 ± 0.286
3.311SerVal: 3.311 ± 0.416
1.242SerTrp: 1.242 ± 0.23
1.604SerTyr: 1.604 ± 0.251
0.0SerXaa: 0.0 ± 0.0
Thr
6.157ThrAla: 6.157 ± 0.574
0.517ThrCys: 0.517 ± 0.163
2.587ThrAsp: 2.587 ± 0.342
3.467ThrGlu: 3.467 ± 0.371
1.914ThrPhe: 1.914 ± 0.283
4.967ThrGly: 4.967 ± 0.479
0.88ThrHis: 0.88 ± 0.191
2.846ThrIle: 2.846 ± 0.37
2.07ThrLys: 2.07 ± 0.318
4.812ThrLeu: 4.812 ± 0.55
1.035ThrMet: 1.035 ± 0.251
1.759ThrAsn: 1.759 ± 0.355
3.674ThrPro: 3.674 ± 0.447
2.07ThrGln: 2.07 ± 0.26
3.311ThrArg: 3.311 ± 0.4
2.587ThrSer: 2.587 ± 0.366
2.328ThrThr: 2.328 ± 0.355
4.708ThrVal: 4.708 ± 0.513
1.035ThrTrp: 1.035 ± 0.263
1.449ThrTyr: 1.449 ± 0.266
0.103ThrXaa: 0.103 ± 0.074
Val
6.882ValAla: 6.882 ± 0.782
0.931ValCys: 0.931 ± 0.218
4.864ValAsp: 4.864 ± 0.534
4.346ValGlu: 4.346 ± 0.583
2.277ValPhe: 2.277 ± 0.342
4.812ValGly: 4.812 ± 0.532
1.449ValHis: 1.449 ± 0.295
3.208ValIle: 3.208 ± 0.435
2.328ValLys: 2.328 ± 0.359
6.364ValLeu: 6.364 ± 0.588
1.449ValMet: 1.449 ± 0.282
2.949ValAsn: 2.949 ± 0.431
3.829ValPro: 3.829 ± 0.372
3.053ValGln: 3.053 ± 0.551
6.002ValArg: 6.002 ± 0.595
3.984ValSer: 3.984 ± 0.557
4.139ValThr: 4.139 ± 0.508
4.346ValVal: 4.346 ± 0.516
1.552ValTrp: 1.552 ± 0.266
1.811ValTyr: 1.811 ± 0.265
0.0ValXaa: 0.0 ± 0.0
Trp
2.639TrpAla: 2.639 ± 0.346
0.259TrpCys: 0.259 ± 0.121
1.294TrpAsp: 1.294 ± 0.252
1.604TrpGlu: 1.604 ± 0.253
0.828TrpPhe: 0.828 ± 0.196
0.776TrpGly: 0.776 ± 0.185
0.259TrpHis: 0.259 ± 0.121
1.035TrpIle: 1.035 ± 0.186
1.035TrpLys: 1.035 ± 0.243
1.966TrpLeu: 1.966 ± 0.335
0.517TrpMet: 0.517 ± 0.172
0.673TrpAsn: 0.673 ± 0.198
0.673TrpPro: 0.673 ± 0.18
0.621TrpGln: 0.621 ± 0.171
1.087TrpArg: 1.087 ± 0.226
1.087TrpSer: 1.087 ± 0.217
1.138TrpThr: 1.138 ± 0.253
1.449TrpVal: 1.449 ± 0.257
0.259TrpTrp: 0.259 ± 0.098
0.569TrpTyr: 0.569 ± 0.147
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.484TyrAla: 2.484 ± 0.372
0.414TyrCys: 0.414 ± 0.117
1.707TyrAsp: 1.707 ± 0.28
1.449TyrGlu: 1.449 ± 0.285
0.724TyrPhe: 0.724 ± 0.185
2.535TyrGly: 2.535 ± 0.404
0.414TyrHis: 0.414 ± 0.139
1.656TyrIle: 1.656 ± 0.322
0.931TyrLys: 0.931 ± 0.198
2.535TyrLeu: 2.535 ± 0.397
0.517TyrMet: 0.517 ± 0.176
1.294TyrAsn: 1.294 ± 0.3
1.5TyrPro: 1.5 ± 0.335
0.931TyrGln: 0.931 ± 0.231
1.863TyrArg: 1.863 ± 0.299
1.5TyrSer: 1.5 ± 0.284
1.604TyrThr: 1.604 ± 0.239
2.277TyrVal: 2.277 ± 0.359
0.673TyrTrp: 0.673 ± 0.158
0.88TyrTyr: 0.88 ± 0.199
0.052TyrXaa: 0.052 ± 0.045
Xaa
0.259XaaAla: 0.259 ± 0.122
0.0XaaCys: 0.0 ± 0.0
0.103XaaAsp: 0.103 ± 0.072
0.155XaaGlu: 0.155 ± 0.077
0.0XaaPhe: 0.0 ± 0.0
0.207XaaGly: 0.207 ± 0.092
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.155XaaLeu: 0.155 ± 0.099
0.052XaaMet: 0.052 ± 0.046
0.052XaaAsn: 0.052 ± 0.053
0.052XaaPro: 0.052 ± 0.043
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.052XaaVal: 0.052 ± 0.046
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 81 proteins (19328 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski