Amino acid dipepetide frequency for Mycobacterium phage Filuzino

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.112AlaAla: 15.112 ± 1.592
0.842AlaCys: 0.842 ± 0.221
7.108AlaAsp: 7.108 ± 0.675
7.372AlaGlu: 7.372 ± 0.673
2.633AlaPhe: 2.633 ± 0.388
10.583AlaGly: 10.583 ± 1.241
2.264AlaHis: 2.264 ± 0.406
4.107AlaIle: 4.107 ± 0.458
3.896AlaLys: 3.896 ± 0.405
8.477AlaLeu: 8.477 ± 0.809
2.211AlaMet: 2.211 ± 0.361
2.58AlaAsn: 2.58 ± 0.37
5.265AlaPro: 5.265 ± 0.611
3.528AlaGln: 3.528 ± 0.361
7.687AlaArg: 7.687 ± 0.716
5.423AlaSer: 5.423 ± 0.653
5.95AlaThr: 5.95 ± 0.583
7.056AlaVal: 7.056 ± 0.569
2.475AlaTrp: 2.475 ± 0.443
2.317AlaTyr: 2.317 ± 0.327
0.0AlaXaa: 0.0 ± 0.0
Cys
0.948CysAla: 0.948 ± 0.296
0.105CysCys: 0.105 ± 0.075
1.369CysAsp: 1.369 ± 0.325
0.842CysGlu: 0.842 ± 0.23
0.158CysPhe: 0.158 ± 0.086
1.632CysGly: 1.632 ± 0.278
0.105CysHis: 0.105 ± 0.07
0.369CysIle: 0.369 ± 0.159
0.527CysLys: 0.527 ± 0.177
0.684CysLeu: 0.684 ± 0.204
0.211CysMet: 0.211 ± 0.085
0.369CysAsn: 0.369 ± 0.129
1.264CysPro: 1.264 ± 0.308
0.263CysGln: 0.263 ± 0.111
0.842CysArg: 0.842 ± 0.283
0.632CysSer: 0.632 ± 0.184
0.684CysThr: 0.684 ± 0.202
0.684CysVal: 0.684 ± 0.18
0.474CysTrp: 0.474 ± 0.169
0.158CysTyr: 0.158 ± 0.087
0.0CysXaa: 0.0 ± 0.0
Asp
7.845AspAla: 7.845 ± 0.619
1.264AspCys: 1.264 ± 0.297
4.634AspAsp: 4.634 ± 0.605
3.528AspGlu: 3.528 ± 0.448
2.264AspPhe: 2.264 ± 0.31
7.477AspGly: 7.477 ± 0.734
1.422AspHis: 1.422 ± 0.314
2.264AspIle: 2.264 ± 0.324
1.369AspLys: 1.369 ± 0.236
6.266AspLeu: 6.266 ± 0.555
1.053AspMet: 1.053 ± 0.218
1.685AspAsn: 1.685 ± 0.333
5.16AspPro: 5.16 ± 0.561
2.053AspGln: 2.053 ± 0.319
4.739AspArg: 4.739 ± 0.58
3.686AspSer: 3.686 ± 0.601
4.265AspThr: 4.265 ± 0.46
4.423AspVal: 4.423 ± 0.521
1.685AspTrp: 1.685 ± 0.284
2.159AspTyr: 2.159 ± 0.298
0.0AspXaa: 0.0 ± 0.0
Glu
6.16GluAla: 6.16 ± 0.758
0.684GluCys: 0.684 ± 0.179
2.896GluAsp: 2.896 ± 0.409
2.633GluGlu: 2.633 ± 0.485
2.159GluPhe: 2.159 ± 0.326
3.159GluGly: 3.159 ± 0.382
1.58GluHis: 1.58 ± 0.35
2.264GluIle: 2.264 ± 0.336
1.632GluLys: 1.632 ± 0.255
5.95GluLeu: 5.95 ± 0.728
1.58GluMet: 1.58 ± 0.26
1.79GluAsn: 1.79 ± 0.289
2.738GluPro: 2.738 ± 0.427
2.791GluGln: 2.791 ± 0.412
4.791GluArg: 4.791 ± 0.536
2.896GluSer: 2.896 ± 0.452
4.002GluThr: 4.002 ± 0.58
3.896GluVal: 3.896 ± 0.501
1.211GluTrp: 1.211 ± 0.233
1.632GluTyr: 1.632 ± 0.325
0.0GluXaa: 0.0 ± 0.0
Phe
3.001PheAla: 3.001 ± 0.384
0.263PheCys: 0.263 ± 0.121
2.58PheAsp: 2.58 ± 0.431
1.58PheGlu: 1.58 ± 0.322
0.895PhePhe: 0.895 ± 0.245
3.001PheGly: 3.001 ± 0.64
0.421PheHis: 0.421 ± 0.15
1.58PheIle: 1.58 ± 0.383
1.053PheLys: 1.053 ± 0.204
1.474PheLeu: 1.474 ± 0.231
0.842PheMet: 0.842 ± 0.246
1.158PheAsn: 1.158 ± 0.304
1.58PhePro: 1.58 ± 0.281
0.948PheGln: 0.948 ± 0.267
1.369PheArg: 1.369 ± 0.214
1.738PheSer: 1.738 ± 0.303
2.475PheThr: 2.475 ± 0.438
2.001PheVal: 2.001 ± 0.272
0.527PheTrp: 0.527 ± 0.124
0.842PheTyr: 0.842 ± 0.255
0.0PheXaa: 0.0 ± 0.0
Gly
8.951GlyAla: 8.951 ± 1.149
1.158GlyCys: 1.158 ± 0.298
6.634GlyAsp: 6.634 ± 0.659
4.002GlyGlu: 4.002 ± 0.566
2.843GlyPhe: 2.843 ± 0.468
10.267GlyGly: 10.267 ± 1.872
2.001GlyHis: 2.001 ± 0.291
4.318GlyIle: 4.318 ± 0.497
2.317GlyLys: 2.317 ± 0.391
5.897GlyLeu: 5.897 ± 0.51
2.264GlyMet: 2.264 ± 0.482
3.054GlyAsn: 3.054 ± 0.446
4.528GlyPro: 4.528 ± 0.474
2.58GlyGln: 2.58 ± 0.52
5.265GlyArg: 5.265 ± 0.579
6.318GlySer: 6.318 ± 0.987
6.687GlyThr: 6.687 ± 0.758
5.687GlyVal: 5.687 ± 0.583
2.369GlyTrp: 2.369 ± 0.344
2.58GlyTyr: 2.58 ± 0.453
0.0GlyXaa: 0.0 ± 0.0
His
1.738HisAla: 1.738 ± 0.367
0.369HisCys: 0.369 ± 0.163
1.106HisAsp: 1.106 ± 0.272
1.58HisGlu: 1.58 ± 0.306
0.527HisPhe: 0.527 ± 0.159
1.79HisGly: 1.79 ± 0.325
0.948HisHis: 0.948 ± 0.256
1.632HisIle: 1.632 ± 0.338
0.737HisLys: 0.737 ± 0.215
1.158HisLeu: 1.158 ± 0.255
0.632HisMet: 0.632 ± 0.169
0.79HisAsn: 0.79 ± 0.167
1.685HisPro: 1.685 ± 0.304
0.948HisGln: 0.948 ± 0.231
2.317HisArg: 2.317 ± 0.351
0.842HisSer: 0.842 ± 0.196
1.316HisThr: 1.316 ± 0.321
1.211HisVal: 1.211 ± 0.253
0.632HisTrp: 0.632 ± 0.171
0.684HisTyr: 0.684 ± 0.192
0.0HisXaa: 0.0 ± 0.0
Ile
5.423IleAla: 5.423 ± 0.578
0.474IleCys: 0.474 ± 0.15
3.896IleAsp: 3.896 ± 0.491
3.212IleGlu: 3.212 ± 0.354
0.737IlePhe: 0.737 ± 0.219
3.633IleGly: 3.633 ± 0.514
1.527IleHis: 1.527 ± 0.345
1.316IleIle: 1.316 ± 0.285
1.0IleLys: 1.0 ± 0.203
2.211IleLeu: 2.211 ± 0.347
0.369IleMet: 0.369 ± 0.137
2.106IleAsn: 2.106 ± 0.27
2.685IlePro: 2.685 ± 0.336
1.158IleGln: 1.158 ± 0.227
2.949IleArg: 2.949 ± 0.458
2.001IleSer: 2.001 ± 0.381
3.475IleThr: 3.475 ± 0.429
2.738IleVal: 2.738 ± 0.336
1.0IleTrp: 1.0 ± 0.248
0.684IleTyr: 0.684 ± 0.239
0.0IleXaa: 0.0 ± 0.0
Lys
3.58LysAla: 3.58 ± 0.398
0.421LysCys: 0.421 ± 0.159
1.58LysAsp: 1.58 ± 0.297
1.422LysGlu: 1.422 ± 0.256
1.053LysPhe: 1.053 ± 0.188
2.685LysGly: 2.685 ± 0.338
0.948LysHis: 0.948 ± 0.226
1.0LysIle: 1.0 ± 0.267
1.211LysLys: 1.211 ± 0.313
2.317LysLeu: 2.317 ± 0.499
0.632LysMet: 0.632 ± 0.186
0.79LysAsn: 0.79 ± 0.188
2.58LysPro: 2.58 ± 0.423
1.58LysGln: 1.58 ± 0.254
1.738LysArg: 1.738 ± 0.296
1.79LysSer: 1.79 ± 0.294
1.948LysThr: 1.948 ± 0.357
2.317LysVal: 2.317 ± 0.399
0.842LysTrp: 0.842 ± 0.209
1.053LysTyr: 1.053 ± 0.295
0.0LysXaa: 0.0 ± 0.0
Leu
8.003LeuAla: 8.003 ± 0.849
0.79LeuCys: 0.79 ± 0.201
5.634LeuAsp: 5.634 ± 0.618
3.633LeuGlu: 3.633 ± 0.507
2.633LeuPhe: 2.633 ± 0.288
5.318LeuGly: 5.318 ± 0.579
1.0LeuHis: 1.0 ± 0.243
3.159LeuIle: 3.159 ± 0.41
1.685LeuLys: 1.685 ± 0.294
4.949LeuLeu: 4.949 ± 0.556
1.422LeuMet: 1.422 ± 0.26
2.527LeuAsn: 2.527 ± 0.395
5.213LeuPro: 5.213 ± 0.694
2.422LeuGln: 2.422 ± 0.392
5.687LeuArg: 5.687 ± 0.72
5.476LeuSer: 5.476 ± 0.491
5.845LeuThr: 5.845 ± 0.635
5.107LeuVal: 5.107 ± 0.507
1.264LeuTrp: 1.264 ± 0.26
2.106LeuTyr: 2.106 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
2.211MetAla: 2.211 ± 0.332
0.263MetCys: 0.263 ± 0.151
1.316MetAsp: 1.316 ± 0.278
1.053MetGlu: 1.053 ± 0.219
0.579MetPhe: 0.579 ± 0.177
1.738MetGly: 1.738 ± 0.304
0.211MetHis: 0.211 ± 0.106
0.948MetIle: 0.948 ± 0.24
0.79MetLys: 0.79 ± 0.229
1.474MetLeu: 1.474 ± 0.228
0.527MetMet: 0.527 ± 0.206
1.106MetAsn: 1.106 ± 0.209
1.422MetPro: 1.422 ± 0.275
0.527MetGln: 0.527 ± 0.145
1.474MetArg: 1.474 ± 0.322
2.527MetSer: 2.527 ± 0.416
2.211MetThr: 2.211 ± 0.323
1.369MetVal: 1.369 ± 0.333
0.263MetTrp: 0.263 ± 0.1
0.316MetTyr: 0.316 ± 0.124
0.0MetXaa: 0.0 ± 0.0
Asn
3.528AsnAla: 3.528 ± 0.384
0.105AsnCys: 0.105 ± 0.074
2.159AsnAsp: 2.159 ± 0.28
1.632AsnGlu: 1.632 ± 0.316
0.842AsnPhe: 0.842 ± 0.285
4.212AsnGly: 4.212 ± 0.581
0.842AsnHis: 0.842 ± 0.181
1.527AsnIle: 1.527 ± 0.395
1.0AsnLys: 1.0 ± 0.25
2.211AsnLeu: 2.211 ± 0.35
0.579AsnMet: 0.579 ± 0.169
1.738AsnAsn: 1.738 ± 0.381
2.58AsnPro: 2.58 ± 0.35
1.053AsnGln: 1.053 ± 0.328
2.106AsnArg: 2.106 ± 0.38
1.422AsnSer: 1.422 ± 0.276
2.369AsnThr: 2.369 ± 0.341
1.896AsnVal: 1.896 ± 0.354
0.684AsnTrp: 0.684 ± 0.148
0.632AsnTyr: 0.632 ± 0.162
0.0AsnXaa: 0.0 ± 0.0
Pro
4.844ProAla: 4.844 ± 0.582
0.684ProCys: 0.684 ± 0.178
5.371ProAsp: 5.371 ± 0.479
4.054ProGlu: 4.054 ± 0.485
1.896ProPhe: 1.896 ± 0.353
6.16ProGly: 6.16 ± 0.584
1.58ProHis: 1.58 ± 0.291
1.896ProIle: 1.896 ± 0.259
2.633ProLys: 2.633 ± 0.43
3.896ProLeu: 3.896 ± 0.473
1.685ProMet: 1.685 ± 0.335
2.369ProAsn: 2.369 ± 0.309
3.528ProPro: 3.528 ± 0.58
1.896ProGln: 1.896 ± 0.321
3.37ProArg: 3.37 ± 0.51
3.107ProSer: 3.107 ± 0.487
3.844ProThr: 3.844 ± 0.425
5.371ProVal: 5.371 ± 0.52
1.106ProTrp: 1.106 ± 0.188
1.79ProTyr: 1.79 ± 0.275
0.0ProXaa: 0.0 ± 0.0
Gln
4.16GlnAla: 4.16 ± 0.583
0.421GlnCys: 0.421 ± 0.154
1.632GlnAsp: 1.632 ± 0.253
1.527GlnGlu: 1.527 ± 0.302
1.106GlnPhe: 1.106 ± 0.243
2.264GlnGly: 2.264 ± 0.455
0.79GlnHis: 0.79 ± 0.225
1.738GlnIle: 1.738 ± 0.269
1.158GlnLys: 1.158 ± 0.236
3.159GlnLeu: 3.159 ± 0.448
0.684GlnMet: 0.684 ± 0.178
0.684GlnAsn: 0.684 ± 0.181
2.369GlnPro: 2.369 ± 0.383
1.053GlnGln: 1.053 ± 0.217
2.58GlnArg: 2.58 ± 0.365
2.159GlnSer: 2.159 ± 0.334
1.738GlnThr: 1.738 ± 0.319
2.159GlnVal: 2.159 ± 0.312
0.632GlnTrp: 0.632 ± 0.158
1.053GlnTyr: 1.053 ± 0.257
0.0GlnXaa: 0.0 ± 0.0
Arg
6.95ArgAla: 6.95 ± 0.58
1.58ArgCys: 1.58 ± 0.364
4.423ArgAsp: 4.423 ± 0.572
4.054ArgGlu: 4.054 ± 0.6
2.053ArgPhe: 2.053 ± 0.378
4.16ArgGly: 4.16 ± 0.412
1.211ArgHis: 1.211 ± 0.255
3.475ArgIle: 3.475 ± 0.503
2.264ArgLys: 2.264 ± 0.378
5.265ArgLeu: 5.265 ± 0.729
2.159ArgMet: 2.159 ± 0.307
2.58ArgAsn: 2.58 ± 0.447
3.475ArgPro: 3.475 ± 0.375
2.211ArgGln: 2.211 ± 0.309
5.792ArgArg: 5.792 ± 0.775
3.528ArgSer: 3.528 ± 0.418
3.791ArgThr: 3.791 ± 0.521
5.213ArgVal: 5.213 ± 0.55
2.106ArgTrp: 2.106 ± 0.338
1.79ArgTyr: 1.79 ± 0.28
0.0ArgXaa: 0.0 ± 0.0
Ser
6.634SerAla: 6.634 ± 0.775
0.527SerCys: 0.527 ± 0.154
4.107SerAsp: 4.107 ± 0.553
2.843SerGlu: 2.843 ± 0.348
1.79SerPhe: 1.79 ± 0.457
7.003SerGly: 7.003 ± 0.961
1.369SerHis: 1.369 ± 0.259
3.001SerIle: 3.001 ± 0.432
2.369SerLys: 2.369 ± 0.381
3.738SerLeu: 3.738 ± 0.453
1.474SerMet: 1.474 ± 0.258
2.053SerAsn: 2.053 ± 0.401
3.159SerPro: 3.159 ± 0.379
1.632SerGln: 1.632 ± 0.307
3.422SerArg: 3.422 ± 0.428
3.896SerSer: 3.896 ± 0.753
3.422SerThr: 3.422 ± 0.483
4.318SerVal: 4.318 ± 0.537
1.369SerTrp: 1.369 ± 0.255
1.316SerTyr: 1.316 ± 0.211
0.0SerXaa: 0.0 ± 0.0
Thr
6.16ThrAla: 6.16 ± 0.554
0.79ThrCys: 0.79 ± 0.199
4.16ThrAsp: 4.16 ± 0.619
3.422ThrGlu: 3.422 ± 0.38
1.58ThrPhe: 1.58 ± 0.36
5.95ThrGly: 5.95 ± 0.628
1.685ThrHis: 1.685 ± 0.327
3.317ThrIle: 3.317 ± 0.389
1.948ThrLys: 1.948 ± 0.278
5.213ThrLeu: 5.213 ± 0.594
1.369ThrMet: 1.369 ± 0.252
2.369ThrAsn: 2.369 ± 0.35
5.476ThrPro: 5.476 ± 0.711
1.843ThrGln: 1.843 ± 0.284
3.896ThrArg: 3.896 ± 0.35
4.581ThrSer: 4.581 ± 0.541
5.16ThrThr: 5.16 ± 0.657
6.055ThrVal: 6.055 ± 0.54
1.211ThrTrp: 1.211 ± 0.276
2.001ThrTyr: 2.001 ± 0.341
0.0ThrXaa: 0.0 ± 0.0
Val
7.319ValAla: 7.319 ± 0.544
1.211ValCys: 1.211 ± 0.279
5.581ValAsp: 5.581 ± 0.555
4.686ValGlu: 4.686 ± 0.577
2.211ValPhe: 2.211 ± 0.342
5.423ValGly: 5.423 ± 0.627
1.474ValHis: 1.474 ± 0.301
2.791ValIle: 2.791 ± 0.445
2.211ValLys: 2.211 ± 0.295
5.476ValLeu: 5.476 ± 0.543
1.474ValMet: 1.474 ± 0.224
2.053ValAsn: 2.053 ± 0.302
3.686ValPro: 3.686 ± 0.399
2.475ValGln: 2.475 ± 0.376
4.37ValArg: 4.37 ± 0.584
4.476ValSer: 4.476 ± 0.511
5.318ValThr: 5.318 ± 0.472
5.792ValVal: 5.792 ± 0.776
2.053ValTrp: 2.053 ± 0.33
1.211ValTyr: 1.211 ± 0.227
0.0ValXaa: 0.0 ± 0.0
Trp
1.79TrpAla: 1.79 ± 0.283
0.105TrpCys: 0.105 ± 0.083
1.632TrpAsp: 1.632 ± 0.312
1.422TrpGlu: 1.422 ± 0.338
0.737TrpPhe: 0.737 ± 0.2
1.158TrpGly: 1.158 ± 0.244
0.79TrpHis: 0.79 ± 0.187
0.948TrpIle: 0.948 ± 0.22
0.948TrpLys: 0.948 ± 0.218
1.738TrpLeu: 1.738 ± 0.316
0.895TrpMet: 0.895 ± 0.258
0.632TrpAsn: 0.632 ± 0.178
1.211TrpPro: 1.211 ± 0.271
1.211TrpGln: 1.211 ± 0.249
2.001TrpArg: 2.001 ± 0.414
1.632TrpSer: 1.632 ± 0.38
1.632TrpThr: 1.632 ± 0.311
1.738TrpVal: 1.738 ± 0.442
0.842TrpTrp: 0.842 ± 0.201
0.316TrpTyr: 0.316 ± 0.119
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.58TyrAla: 2.58 ± 0.383
0.263TyrCys: 0.263 ± 0.103
1.422TyrAsp: 1.422 ± 0.33
1.948TyrGlu: 1.948 ± 0.287
0.632TyrPhe: 0.632 ± 0.18
2.106TyrGly: 2.106 ± 0.366
0.474TyrHis: 0.474 ± 0.14
0.948TyrIle: 0.948 ± 0.194
0.737TyrLys: 0.737 ± 0.202
2.317TyrLeu: 2.317 ± 0.342
0.211TyrMet: 0.211 ± 0.109
0.684TyrAsn: 0.684 ± 0.186
1.474TyrPro: 1.474 ± 0.258
0.895TyrGln: 0.895 ± 0.206
1.685TyrArg: 1.685 ± 0.285
1.316TyrSer: 1.316 ± 0.27
2.053TyrThr: 2.053 ± 0.372
2.264TyrVal: 2.264 ± 0.328
0.632TyrTrp: 0.632 ± 0.178
0.842TyrTyr: 0.842 ± 0.202
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 105 proteins (18993 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski