Amino acid dipepetide frequency for Mycobacterium phage Barriga

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.613AlaAla: 13.613 ± 1.326
0.723AlaCys: 0.723 ± 0.2
6.084AlaAsp: 6.084 ± 0.659
6.264AlaGlu: 6.264 ± 0.642
3.132AlaPhe: 3.132 ± 0.51
6.867AlaGly: 6.867 ± 0.695
1.446AlaHis: 1.446 ± 0.336
4.578AlaIle: 4.578 ± 0.581
4.036AlaLys: 4.036 ± 0.499
8.674AlaLeu: 8.674 ± 0.8
2.47AlaMet: 2.47 ± 0.374
2.65AlaAsn: 2.65 ± 0.419
5.12AlaPro: 5.12 ± 0.829
2.891AlaGln: 2.891 ± 0.403
6.144AlaArg: 6.144 ± 0.555
4.758AlaSer: 4.758 ± 0.535
6.264AlaThr: 6.264 ± 0.702
8.794AlaVal: 8.794 ± 0.742
1.747AlaTrp: 1.747 ± 0.324
2.831AlaTyr: 2.831 ± 0.38
0.0AlaXaa: 0.0 ± 0.0
Cys
0.904CysAla: 0.904 ± 0.267
0.06CysCys: 0.06 ± 0.066
0.723CysAsp: 0.723 ± 0.209
0.602CysGlu: 0.602 ± 0.177
0.181CysPhe: 0.181 ± 0.103
0.663CysGly: 0.663 ± 0.24
0.181CysHis: 0.181 ± 0.101
0.12CysIle: 0.12 ± 0.084
0.181CysLys: 0.181 ± 0.104
0.422CysLeu: 0.422 ± 0.187
0.181CysMet: 0.181 ± 0.094
0.241CysAsn: 0.241 ± 0.116
0.361CysPro: 0.361 ± 0.149
0.241CysGln: 0.241 ± 0.116
0.422CysArg: 0.422 ± 0.139
0.361CysSer: 0.361 ± 0.153
0.361CysThr: 0.361 ± 0.161
0.361CysVal: 0.361 ± 0.182
0.181CysTrp: 0.181 ± 0.105
0.06CysTyr: 0.06 ± 0.056
0.0CysXaa: 0.0 ± 0.0
Asp
6.746AspAla: 6.746 ± 0.609
0.482AspCys: 0.482 ± 0.181
5.06AspAsp: 5.06 ± 0.535
3.975AspGlu: 3.975 ± 0.54
2.47AspPhe: 2.47 ± 0.312
6.204AspGly: 6.204 ± 0.7
1.024AspHis: 1.024 ± 0.239
2.711AspIle: 2.711 ± 0.378
2.409AspLys: 2.409 ± 0.402
6.746AspLeu: 6.746 ± 0.699
1.446AspMet: 1.446 ± 0.307
1.927AspAsn: 1.927 ± 0.337
4.939AspPro: 4.939 ± 0.597
1.747AspGln: 1.747 ± 0.379
3.915AspArg: 3.915 ± 0.536
3.192AspSer: 3.192 ± 0.55
3.795AspThr: 3.795 ± 0.431
4.277AspVal: 4.277 ± 0.468
1.988AspTrp: 1.988 ± 0.344
1.867AspTyr: 1.867 ± 0.316
0.0AspXaa: 0.0 ± 0.0
Glu
5.903GluAla: 5.903 ± 0.74
0.422GluCys: 0.422 ± 0.184
4.758GluAsp: 4.758 ± 0.595
5.301GluGlu: 5.301 ± 0.539
2.229GluPhe: 2.229 ± 0.411
4.337GluGly: 4.337 ± 0.495
1.265GluHis: 1.265 ± 0.283
3.554GluIle: 3.554 ± 0.48
2.951GluLys: 2.951 ± 0.471
6.867GluLeu: 6.867 ± 0.583
1.807GluMet: 1.807 ± 0.31
1.747GluAsn: 1.747 ± 0.365
2.53GluPro: 2.53 ± 0.503
2.409GluGln: 2.409 ± 0.421
3.855GluArg: 3.855 ± 0.509
3.614GluSer: 3.614 ± 0.432
4.277GluThr: 4.277 ± 0.576
5.542GluVal: 5.542 ± 0.609
1.446GluTrp: 1.446 ± 0.375
2.229GluTyr: 2.229 ± 0.496
0.0GluXaa: 0.0 ± 0.0
Phe
2.59PheAla: 2.59 ± 0.314
0.422PheCys: 0.422 ± 0.177
2.59PheAsp: 2.59 ± 0.363
1.988PheGlu: 1.988 ± 0.337
0.482PhePhe: 0.482 ± 0.158
3.313PheGly: 3.313 ± 0.481
0.723PheHis: 0.723 ± 0.287
1.265PheIle: 1.265 ± 0.282
1.265PheLys: 1.265 ± 0.301
2.53PheLeu: 2.53 ± 0.45
0.602PheMet: 0.602 ± 0.214
1.205PheAsn: 1.205 ± 0.272
1.385PhePro: 1.385 ± 0.304
0.663PheGln: 0.663 ± 0.181
1.988PheArg: 1.988 ± 0.313
1.988PheSer: 1.988 ± 0.465
2.108PheThr: 2.108 ± 0.363
1.927PheVal: 1.927 ± 0.328
0.602PheTrp: 0.602 ± 0.187
0.964PheTyr: 0.964 ± 0.237
0.0PheXaa: 0.0 ± 0.0
Gly
7.168GlyAla: 7.168 ± 1.038
0.663GlyCys: 0.663 ± 0.24
5.722GlyAsp: 5.722 ± 0.513
5.12GlyGlu: 5.12 ± 0.613
2.65GlyPhe: 2.65 ± 0.465
9.216GlyGly: 9.216 ± 2.058
1.867GlyHis: 1.867 ± 0.341
4.457GlyIle: 4.457 ± 0.776
3.975GlyLys: 3.975 ± 0.465
7.349GlyLeu: 7.349 ± 0.889
1.747GlyMet: 1.747 ± 0.297
3.433GlyAsn: 3.433 ± 0.62
3.313GlyPro: 3.313 ± 0.524
2.409GlyGln: 2.409 ± 0.368
4.518GlyArg: 4.518 ± 0.556
5.361GlySer: 5.361 ± 0.62
4.638GlyThr: 4.638 ± 0.548
5.782GlyVal: 5.782 ± 0.606
2.711GlyTrp: 2.711 ± 0.419
2.831GlyTyr: 2.831 ± 0.391
0.0GlyXaa: 0.0 ± 0.0
His
1.687HisAla: 1.687 ± 0.393
0.181HisCys: 0.181 ± 0.123
1.385HisAsp: 1.385 ± 0.274
1.626HisGlu: 1.626 ± 0.346
0.602HisPhe: 0.602 ± 0.196
1.626HisGly: 1.626 ± 0.358
0.542HisHis: 0.542 ± 0.192
0.904HisIle: 0.904 ± 0.243
0.964HisLys: 0.964 ± 0.298
1.325HisLeu: 1.325 ± 0.325
0.06HisMet: 0.06 ± 0.062
0.361HisAsn: 0.361 ± 0.133
1.325HisPro: 1.325 ± 0.288
0.964HisGln: 0.964 ± 0.241
1.205HisArg: 1.205 ± 0.262
0.542HisSer: 0.542 ± 0.19
1.144HisThr: 1.144 ± 0.315
1.807HisVal: 1.807 ± 0.322
0.422HisTrp: 0.422 ± 0.152
0.723HisTyr: 0.723 ± 0.21
0.0HisXaa: 0.0 ± 0.0
Ile
6.144IleAla: 6.144 ± 0.782
0.301IleCys: 0.301 ± 0.153
3.132IleAsp: 3.132 ± 0.366
3.674IleGlu: 3.674 ± 0.459
1.024IlePhe: 1.024 ± 0.251
3.915IleGly: 3.915 ± 0.579
0.843IleHis: 0.843 ± 0.245
1.626IleIle: 1.626 ± 0.309
1.506IleLys: 1.506 ± 0.284
3.494IleLeu: 3.494 ± 0.403
0.783IleMet: 0.783 ± 0.204
2.048IleAsn: 2.048 ± 0.402
3.072IlePro: 3.072 ± 0.408
1.747IleGln: 1.747 ± 0.404
3.795IleArg: 3.795 ± 0.537
3.373IleSer: 3.373 ± 0.416
3.192IleThr: 3.192 ± 0.471
3.192IleVal: 3.192 ± 0.64
0.723IleTrp: 0.723 ± 0.221
1.807IleTyr: 1.807 ± 0.285
0.0IleXaa: 0.0 ± 0.0
Lys
3.373LysAla: 3.373 ± 0.461
0.301LysCys: 0.301 ± 0.133
2.289LysAsp: 2.289 ± 0.4
2.048LysGlu: 2.048 ± 0.393
1.446LysPhe: 1.446 ± 0.286
2.349LysGly: 2.349 ± 0.404
1.325LysHis: 1.325 ± 0.331
2.47LysIle: 2.47 ± 0.51
2.168LysLys: 2.168 ± 0.486
3.975LysLeu: 3.975 ± 0.502
1.024LysMet: 1.024 ± 0.201
1.626LysAsn: 1.626 ± 0.291
2.59LysPro: 2.59 ± 0.404
1.807LysGln: 1.807 ± 0.386
3.253LysArg: 3.253 ± 0.508
2.771LysSer: 2.771 ± 0.39
2.229LysThr: 2.229 ± 0.375
3.373LysVal: 3.373 ± 0.459
0.663LysTrp: 0.663 ± 0.234
1.205LysTyr: 1.205 ± 0.308
0.0LysXaa: 0.0 ± 0.0
Leu
9.457LeuAla: 9.457 ± 0.888
0.361LeuCys: 0.361 ± 0.202
6.204LeuAsp: 6.204 ± 0.625
5.903LeuGlu: 5.903 ± 0.651
1.988LeuPhe: 1.988 ± 0.354
7.65LeuGly: 7.65 ± 0.894
1.747LeuHis: 1.747 ± 0.324
4.518LeuIle: 4.518 ± 0.488
4.096LeuLys: 4.096 ± 0.447
5.421LeuLeu: 5.421 ± 0.536
1.927LeuMet: 1.927 ± 0.338
2.53LeuAsn: 2.53 ± 0.4
5.361LeuPro: 5.361 ± 0.646
2.409LeuGln: 2.409 ± 0.451
5.963LeuArg: 5.963 ± 0.583
5.722LeuSer: 5.722 ± 0.592
6.505LeuThr: 6.505 ± 0.53
4.337LeuVal: 4.337 ± 0.539
1.144LeuTrp: 1.144 ± 0.31
2.65LeuTyr: 2.65 ± 0.435
0.0LeuXaa: 0.0 ± 0.0
Met
2.409MetAla: 2.409 ± 0.291
0.0MetCys: 0.0 ± 0.0
1.205MetAsp: 1.205 ± 0.305
1.446MetGlu: 1.446 ± 0.329
0.602MetPhe: 0.602 ± 0.177
1.566MetGly: 1.566 ± 0.323
0.241MetHis: 0.241 ± 0.12
0.904MetIle: 0.904 ± 0.231
1.024MetLys: 1.024 ± 0.225
1.024MetLeu: 1.024 ± 0.219
0.12MetMet: 0.12 ± 0.074
1.144MetAsn: 1.144 ± 0.234
0.964MetPro: 0.964 ± 0.249
0.602MetGln: 0.602 ± 0.152
1.265MetArg: 1.265 ± 0.267
2.168MetSer: 2.168 ± 0.422
2.59MetThr: 2.59 ± 0.316
1.205MetVal: 1.205 ± 0.281
0.241MetTrp: 0.241 ± 0.111
0.542MetTyr: 0.542 ± 0.227
0.0MetXaa: 0.0 ± 0.0
Asn
3.192AsnAla: 3.192 ± 0.462
0.241AsnCys: 0.241 ± 0.192
1.927AsnAsp: 1.927 ± 0.326
1.506AsnGlu: 1.506 ± 0.288
0.964AsnPhe: 0.964 ± 0.267
3.614AsnGly: 3.614 ± 0.546
0.602AsnHis: 0.602 ± 0.181
1.687AsnIle: 1.687 ± 0.351
0.723AsnLys: 0.723 ± 0.241
2.409AsnLeu: 2.409 ± 0.375
0.542AsnMet: 0.542 ± 0.143
1.024AsnAsn: 1.024 ± 0.302
2.951AsnPro: 2.951 ± 0.373
1.024AsnGln: 1.024 ± 0.248
1.325AsnArg: 1.325 ± 0.323
1.927AsnSer: 1.927 ± 0.401
1.687AsnThr: 1.687 ± 0.327
2.47AsnVal: 2.47 ± 0.446
0.663AsnTrp: 0.663 ± 0.179
1.265AsnTyr: 1.265 ± 0.306
0.0AsnXaa: 0.0 ± 0.0
Pro
5.542ProAla: 5.542 ± 0.713
0.542ProCys: 0.542 ± 0.205
4.216ProAsp: 4.216 ± 0.436
4.397ProGlu: 4.397 ± 0.574
2.108ProPhe: 2.108 ± 0.367
4.999ProGly: 4.999 ± 0.643
0.783ProHis: 0.783 ± 0.201
2.47ProIle: 2.47 ± 0.458
1.927ProLys: 1.927 ± 0.308
4.457ProLeu: 4.457 ± 0.517
0.964ProMet: 0.964 ± 0.272
1.446ProAsn: 1.446 ± 0.327
2.891ProPro: 2.891 ± 0.484
1.385ProGln: 1.385 ± 0.313
2.59ProArg: 2.59 ± 0.442
3.554ProSer: 3.554 ± 0.432
3.795ProThr: 3.795 ± 0.546
3.975ProVal: 3.975 ± 0.494
0.904ProTrp: 0.904 ± 0.334
1.385ProTyr: 1.385 ± 0.35
0.0ProXaa: 0.0 ± 0.0
Gln
3.012GlnAla: 3.012 ± 0.452
0.06GlnCys: 0.06 ± 0.068
1.385GlnAsp: 1.385 ± 0.326
1.867GlnGlu: 1.867 ± 0.346
1.084GlnPhe: 1.084 ± 0.237
2.711GlnGly: 2.711 ± 0.353
0.663GlnHis: 0.663 ± 0.182
2.951GlnIle: 2.951 ± 0.544
1.084GlnLys: 1.084 ± 0.278
3.674GlnLeu: 3.674 ± 0.4
0.843GlnMet: 0.843 ± 0.229
0.422GlnAsn: 0.422 ± 0.138
2.048GlnPro: 2.048 ± 0.447
1.807GlnGln: 1.807 ± 0.395
1.927GlnArg: 1.927 ± 0.365
1.446GlnSer: 1.446 ± 0.306
1.747GlnThr: 1.747 ± 0.263
2.711GlnVal: 2.711 ± 0.379
0.542GlnTrp: 0.542 ± 0.155
0.542GlnTyr: 0.542 ± 0.163
0.0GlnXaa: 0.0 ± 0.0
Arg
5.361ArgAla: 5.361 ± 0.676
0.723ArgCys: 0.723 ± 0.23
3.614ArgAsp: 3.614 ± 0.535
4.518ArgGlu: 4.518 ± 0.666
1.867ArgPhe: 1.867 ± 0.341
4.638ArgGly: 4.638 ± 0.603
1.084ArgHis: 1.084 ± 0.265
3.253ArgIle: 3.253 ± 0.497
3.313ArgLys: 3.313 ± 0.549
6.385ArgLeu: 6.385 ± 0.747
1.988ArgMet: 1.988 ± 0.34
2.409ArgAsn: 2.409 ± 0.411
2.47ArgPro: 2.47 ± 0.337
1.807ArgGln: 1.807 ± 0.297
5.301ArgArg: 5.301 ± 0.658
3.855ArgSer: 3.855 ± 0.533
3.192ArgThr: 3.192 ± 0.552
4.638ArgVal: 4.638 ± 0.523
1.265ArgTrp: 1.265 ± 0.247
1.446ArgTyr: 1.446 ± 0.261
0.0ArgXaa: 0.0 ± 0.0
Ser
5.662SerAla: 5.662 ± 0.663
0.482SerCys: 0.482 ± 0.181
3.192SerAsp: 3.192 ± 0.475
4.337SerGlu: 4.337 ± 0.645
1.927SerPhe: 1.927 ± 0.395
5.843SerGly: 5.843 ± 0.758
1.385SerHis: 1.385 ± 0.273
2.65SerIle: 2.65 ± 0.409
2.65SerLys: 2.65 ± 0.431
5.662SerLeu: 5.662 ± 0.659
1.626SerMet: 1.626 ± 0.397
1.988SerAsn: 1.988 ± 0.363
3.012SerPro: 3.012 ± 0.475
2.168SerGln: 2.168 ± 0.289
3.192SerArg: 3.192 ± 0.441
3.253SerSer: 3.253 ± 0.545
2.891SerThr: 2.891 ± 0.454
3.674SerVal: 3.674 ± 0.435
1.385SerTrp: 1.385 ± 0.338
1.385SerTyr: 1.385 ± 0.319
0.0SerXaa: 0.0 ± 0.0
Thr
6.084ThrAla: 6.084 ± 0.798
0.181ThrCys: 0.181 ± 0.092
4.578ThrAsp: 4.578 ± 0.487
4.337ThrGlu: 4.337 ± 0.591
2.229ThrPhe: 2.229 ± 0.406
6.686ThrGly: 6.686 ± 0.703
1.024ThrHis: 1.024 ± 0.262
2.831ThrIle: 2.831 ± 0.577
2.711ThrLys: 2.711 ± 0.346
6.204ThrLeu: 6.204 ± 0.626
0.723ThrMet: 0.723 ± 0.197
1.446ThrAsn: 1.446 ± 0.304
3.674ThrPro: 3.674 ± 0.512
1.747ThrGln: 1.747 ± 0.323
3.192ThrArg: 3.192 ± 0.533
3.313ThrSer: 3.313 ± 0.476
4.518ThrThr: 4.518 ± 0.602
5.662ThrVal: 5.662 ± 0.621
1.084ThrTrp: 1.084 ± 0.298
2.048ThrTyr: 2.048 ± 0.397
0.0ThrXaa: 0.0 ± 0.0
Val
6.325ValAla: 6.325 ± 0.635
0.241ValCys: 0.241 ± 0.116
5.903ValAsp: 5.903 ± 0.564
5.06ValGlu: 5.06 ± 0.569
2.108ValPhe: 2.108 ± 0.332
4.578ValGly: 4.578 ± 0.62
1.687ValHis: 1.687 ± 0.339
3.554ValIle: 3.554 ± 0.44
3.614ValLys: 3.614 ± 0.567
4.819ValLeu: 4.819 ± 0.571
1.265ValMet: 1.265 ± 0.321
2.409ValAsn: 2.409 ± 0.388
3.855ValPro: 3.855 ± 0.45
2.53ValGln: 2.53 ± 0.39
5.481ValArg: 5.481 ± 0.608
4.879ValSer: 4.879 ± 0.474
5.843ValThr: 5.843 ± 0.661
5.18ValVal: 5.18 ± 0.69
1.144ValTrp: 1.144 ± 0.237
2.229ValTyr: 2.229 ± 0.404
0.0ValXaa: 0.0 ± 0.0
Trp
1.506TrpAla: 1.506 ± 0.367
0.181TrpCys: 0.181 ± 0.106
1.446TrpAsp: 1.446 ± 0.318
0.964TrpGlu: 0.964 ± 0.229
0.843TrpPhe: 0.843 ± 0.253
1.807TrpGly: 1.807 ± 0.341
0.482TrpHis: 0.482 ± 0.163
1.205TrpIle: 1.205 ± 0.232
0.361TrpLys: 0.361 ± 0.194
2.048TrpLeu: 2.048 ± 0.34
0.422TrpMet: 0.422 ± 0.207
0.422TrpAsn: 0.422 ± 0.184
0.783TrpPro: 0.783 ± 0.238
0.964TrpGln: 0.964 ± 0.225
1.205TrpArg: 1.205 ± 0.335
1.084TrpSer: 1.084 ± 0.282
1.566TrpThr: 1.566 ± 0.309
1.867TrpVal: 1.867 ± 0.304
0.663TrpTrp: 0.663 ± 0.256
0.422TrpTyr: 0.422 ± 0.161
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.409TyrAla: 2.409 ± 0.384
0.241TyrCys: 0.241 ± 0.132
1.506TyrAsp: 1.506 ± 0.314
2.108TyrGlu: 2.108 ± 0.342
0.663TyrPhe: 0.663 ± 0.202
2.349TyrGly: 2.349 ± 0.436
0.482TyrHis: 0.482 ± 0.166
1.687TyrIle: 1.687 ± 0.321
1.325TyrLys: 1.325 ± 0.287
2.409TyrLeu: 2.409 ± 0.369
0.663TyrMet: 0.663 ± 0.177
1.265TyrAsn: 1.265 ± 0.336
1.687TyrPro: 1.687 ± 0.367
1.205TyrGln: 1.205 ± 0.328
2.59TyrArg: 2.59 ± 0.457
1.265TyrSer: 1.265 ± 0.26
1.927TyrThr: 1.927 ± 0.379
1.927TyrVal: 1.927 ± 0.373
0.723TyrTrp: 0.723 ± 0.276
0.663TyrTyr: 0.663 ± 0.193
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 101 proteins (16603 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski