Amino acid dipepetide frequency for Mycobacterium phage Terror

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
19.316AlaAla: 19.316 ± 1.954
0.898AlaCys: 0.898 ± 0.237
8.235AlaAsp: 8.235 ± 0.958
8.61AlaGlu: 8.61 ± 1.136
4.193AlaPhe: 4.193 ± 0.755
14.899AlaGly: 14.899 ± 1.624
1.947AlaHis: 1.947 ± 0.456
6.064AlaIle: 6.064 ± 0.854
5.241AlaLys: 5.241 ± 0.803
9.808AlaLeu: 9.808 ± 0.934
3.07AlaMet: 3.07 ± 0.51
3.743AlaAsn: 3.743 ± 0.569
7.636AlaPro: 7.636 ± 1.174
4.642AlaGln: 4.642 ± 0.451
7.936AlaArg: 7.936 ± 1.06
5.465AlaSer: 5.465 ± 1.002
7.636AlaThr: 7.636 ± 0.731
9.209AlaVal: 9.209 ± 0.737
2.321AlaTrp: 2.321 ± 0.402
1.647AlaTyr: 1.647 ± 0.344
0.0AlaXaa: 0.0 ± 0.0
Cys
0.524CysAla: 0.524 ± 0.191
0.075CysCys: 0.075 ± 0.078
0.299CysAsp: 0.299 ± 0.149
0.599CysGlu: 0.599 ± 0.238
0.15CysPhe: 0.15 ± 0.088
1.872CysGly: 1.872 ± 0.519
0.15CysHis: 0.15 ± 0.114
0.374CysIle: 0.374 ± 0.152
0.15CysLys: 0.15 ± 0.108
0.225CysLeu: 0.225 ± 0.138
0.075CysMet: 0.075 ± 0.067
0.225CysAsn: 0.225 ± 0.134
0.674CysPro: 0.674 ± 0.31
0.524CysGln: 0.524 ± 0.184
1.048CysArg: 1.048 ± 0.289
0.449CysSer: 0.449 ± 0.212
0.225CysThr: 0.225 ± 0.136
0.449CysVal: 0.449 ± 0.17
0.374CysTrp: 0.374 ± 0.193
0.075CysTyr: 0.075 ± 0.077
0.0CysXaa: 0.0 ± 0.0
Asp
8.61AspAla: 8.61 ± 0.867
0.524AspCys: 0.524 ± 0.281
5.615AspAsp: 5.615 ± 0.771
4.866AspGlu: 4.866 ± 0.661
2.545AspPhe: 2.545 ± 0.423
8.31AspGly: 8.31 ± 0.857
0.674AspHis: 0.674 ± 0.191
1.797AspIle: 1.797 ± 0.423
2.021AspLys: 2.021 ± 0.48
5.091AspLeu: 5.091 ± 0.651
1.198AspMet: 1.198 ± 0.315
1.422AspAsn: 1.422 ± 0.336
4.791AspPro: 4.791 ± 0.821
3.144AspGln: 3.144 ± 0.54
4.492AspArg: 4.492 ± 0.682
2.545AspSer: 2.545 ± 0.388
3.594AspThr: 3.594 ± 0.461
4.267AspVal: 4.267 ± 0.519
1.647AspTrp: 1.647 ± 0.31
2.171AspTyr: 2.171 ± 0.45
0.0AspXaa: 0.0 ± 0.0
Glu
6.439GluAla: 6.439 ± 0.938
0.449GluCys: 0.449 ± 0.18
4.417GluAsp: 4.417 ± 0.57
3.369GluGlu: 3.369 ± 0.605
1.722GluPhe: 1.722 ± 0.487
3.818GluGly: 3.818 ± 0.436
2.92GluHis: 2.92 ± 0.486
2.096GluIle: 2.096 ± 0.419
1.497GluLys: 1.497 ± 0.345
5.091GluLeu: 5.091 ± 0.715
1.123GluMet: 1.123 ± 0.269
2.246GluAsn: 2.246 ± 0.491
3.818GluPro: 3.818 ± 0.906
2.62GluGln: 2.62 ± 0.472
3.594GluArg: 3.594 ± 0.901
2.321GluSer: 2.321 ± 0.5
2.77GluThr: 2.77 ± 0.531
4.417GluVal: 4.417 ± 0.538
1.348GluTrp: 1.348 ± 0.335
0.449GluTyr: 0.449 ± 0.161
0.0GluXaa: 0.0 ± 0.0
Phe
3.144PheAla: 3.144 ± 0.384
0.674PheCys: 0.674 ± 0.232
3.144PheAsp: 3.144 ± 0.536
2.321PheGlu: 2.321 ± 0.598
0.749PhePhe: 0.749 ± 0.244
3.668PheGly: 3.668 ± 0.707
0.449PheHis: 0.449 ± 0.192
0.749PheIle: 0.749 ± 0.237
0.898PheLys: 0.898 ± 0.222
2.396PheLeu: 2.396 ± 0.468
0.524PheMet: 0.524 ± 0.212
1.123PheAsn: 1.123 ± 0.333
1.497PhePro: 1.497 ± 0.405
0.674PheGln: 0.674 ± 0.202
1.497PheArg: 1.497 ± 0.382
2.171PheSer: 2.171 ± 0.365
1.273PheThr: 1.273 ± 0.341
2.096PheVal: 2.096 ± 0.357
0.374PheTrp: 0.374 ± 0.174
0.449PheTyr: 0.449 ± 0.165
0.0PheXaa: 0.0 ± 0.0
Gly
11.155GlyAla: 11.155 ± 1.163
0.824GlyCys: 0.824 ± 0.214
6.963GlyAsp: 6.963 ± 0.946
4.791GlyGlu: 4.791 ± 0.397
3.144GlyPhe: 3.144 ± 0.39
7.487GlyGly: 7.487 ± 1.403
2.321GlyHis: 2.321 ± 0.407
4.866GlyIle: 4.866 ± 0.724
4.717GlyLys: 4.717 ± 0.712
7.711GlyLeu: 7.711 ± 1.041
1.572GlyMet: 1.572 ± 0.311
3.294GlyAsn: 3.294 ± 0.522
4.717GlyPro: 4.717 ± 0.797
2.096GlyGln: 2.096 ± 0.473
5.69GlyArg: 5.69 ± 0.68
5.465GlySer: 5.465 ± 0.723
5.69GlyThr: 5.69 ± 0.809
5.241GlyVal: 5.241 ± 0.636
2.021GlyTrp: 2.021 ± 0.311
2.396GlyTyr: 2.396 ± 0.432
0.0GlyXaa: 0.0 ± 0.0
His
2.471HisAla: 2.471 ± 0.557
0.299HisCys: 0.299 ± 0.157
1.348HisAsp: 1.348 ± 0.351
0.674HisGlu: 0.674 ± 0.219
0.449HisPhe: 0.449 ± 0.216
1.872HisGly: 1.872 ± 0.339
0.599HisHis: 0.599 ± 0.199
1.198HisIle: 1.198 ± 0.278
0.749HisLys: 0.749 ± 0.207
1.348HisLeu: 1.348 ± 0.299
0.674HisMet: 0.674 ± 0.193
0.749HisAsn: 0.749 ± 0.24
1.872HisPro: 1.872 ± 0.359
0.898HisGln: 0.898 ± 0.242
1.947HisArg: 1.947 ± 0.406
0.449HisSer: 0.449 ± 0.196
1.273HisThr: 1.273 ± 0.302
1.422HisVal: 1.422 ± 0.302
0.674HisTrp: 0.674 ± 0.21
0.299HisTyr: 0.299 ± 0.144
0.0HisXaa: 0.0 ± 0.0
Ile
6.738IleAla: 6.738 ± 0.755
0.524IleCys: 0.524 ± 0.169
3.07IleAsp: 3.07 ± 0.536
2.995IleGlu: 2.995 ± 0.604
0.898IlePhe: 0.898 ± 0.185
3.594IleGly: 3.594 ± 0.727
0.898IleHis: 0.898 ± 0.238
1.273IleIle: 1.273 ± 0.334
1.422IleLys: 1.422 ± 0.475
2.77IleLeu: 2.77 ± 0.439
0.674IleMet: 0.674 ± 0.294
1.198IleAsn: 1.198 ± 0.305
4.118IlePro: 4.118 ± 0.71
1.048IleGln: 1.048 ± 0.256
3.519IleArg: 3.519 ± 0.451
1.722IleSer: 1.722 ± 0.43
3.07IleThr: 3.07 ± 0.556
2.92IleVal: 2.92 ± 0.556
0.973IleTrp: 0.973 ± 0.25
1.048IleTyr: 1.048 ± 0.272
0.0IleXaa: 0.0 ± 0.0
Lys
6.663LysAla: 6.663 ± 0.832
0.374LysCys: 0.374 ± 0.193
1.572LysAsp: 1.572 ± 0.368
0.973LysGlu: 0.973 ± 0.251
0.973LysPhe: 0.973 ± 0.303
2.096LysGly: 2.096 ± 0.46
0.674LysHis: 0.674 ± 0.215
2.246LysIle: 2.246 ± 0.395
1.872LysLys: 1.872 ± 0.447
2.77LysLeu: 2.77 ± 0.524
0.973LysMet: 0.973 ± 0.279
1.722LysAsn: 1.722 ± 0.423
3.07LysPro: 3.07 ± 0.513
1.048LysGln: 1.048 ± 0.274
2.545LysArg: 2.545 ± 0.511
2.021LysSer: 2.021 ± 0.341
2.62LysThr: 2.62 ± 0.368
1.797LysVal: 1.797 ± 0.304
0.15LysTrp: 0.15 ± 0.121
0.824LysTyr: 0.824 ± 0.281
0.0LysXaa: 0.0 ± 0.0
Leu
9.508LeuAla: 9.508 ± 0.877
0.524LeuCys: 0.524 ± 0.241
6.364LeuAsp: 6.364 ± 0.632
3.594LeuGlu: 3.594 ± 0.48
2.321LeuPhe: 2.321 ± 0.388
7.112LeuGly: 7.112 ± 1.048
1.497LeuHis: 1.497 ± 0.306
4.193LeuIle: 4.193 ± 0.497
2.995LeuLys: 2.995 ± 0.597
4.791LeuLeu: 4.791 ± 0.638
1.572LeuMet: 1.572 ± 0.333
2.246LeuAsn: 2.246 ± 0.435
4.941LeuPro: 4.941 ± 0.617
1.797LeuGln: 1.797 ± 0.526
5.091LeuArg: 5.091 ± 0.7
3.968LeuSer: 3.968 ± 0.616
4.642LeuThr: 4.642 ± 0.664
5.615LeuVal: 5.615 ± 0.821
1.048LeuTrp: 1.048 ± 0.513
1.647LeuTyr: 1.647 ± 0.389
0.0LeuXaa: 0.0 ± 0.0
Met
2.62MetAla: 2.62 ± 0.354
0.075MetCys: 0.075 ± 0.065
1.123MetAsp: 1.123 ± 0.27
0.599MetGlu: 0.599 ± 0.207
0.973MetPhe: 0.973 ± 0.286
1.947MetGly: 1.947 ± 0.373
0.599MetHis: 0.599 ± 0.238
1.198MetIle: 1.198 ± 0.249
0.599MetLys: 0.599 ± 0.177
2.096MetLeu: 2.096 ± 0.426
0.599MetMet: 0.599 ± 0.205
0.674MetAsn: 0.674 ± 0.2
0.449MetPro: 0.449 ± 0.194
0.449MetGln: 0.449 ± 0.194
1.497MetArg: 1.497 ± 0.355
2.171MetSer: 2.171 ± 0.341
2.096MetThr: 2.096 ± 0.301
1.273MetVal: 1.273 ± 0.364
0.374MetTrp: 0.374 ± 0.168
0.225MetTyr: 0.225 ± 0.113
0.0MetXaa: 0.0 ± 0.0
Asn
5.016AsnAla: 5.016 ± 0.732
0.524AsnCys: 0.524 ± 0.233
1.198AsnAsp: 1.198 ± 0.359
1.797AsnGlu: 1.797 ± 0.353
0.674AsnPhe: 0.674 ± 0.292
3.594AsnGly: 3.594 ± 0.539
0.674AsnHis: 0.674 ± 0.213
1.348AsnIle: 1.348 ± 0.371
0.599AsnLys: 0.599 ± 0.218
1.872AsnLeu: 1.872 ± 0.353
0.599AsnMet: 0.599 ± 0.208
0.749AsnAsn: 0.749 ± 0.286
3.444AsnPro: 3.444 ± 0.451
1.348AsnGln: 1.348 ± 0.278
1.947AsnArg: 1.947 ± 0.332
0.824AsnSer: 0.824 ± 0.249
1.797AsnThr: 1.797 ± 0.312
1.947AsnVal: 1.947 ± 0.623
0.374AsnTrp: 0.374 ± 0.142
0.674AsnTyr: 0.674 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
11.38ProAla: 11.38 ± 1.225
0.299ProCys: 0.299 ± 0.142
5.465ProAsp: 5.465 ± 0.861
4.193ProGlu: 4.193 ± 0.811
1.497ProPhe: 1.497 ± 0.338
5.39ProGly: 5.39 ± 0.756
1.422ProHis: 1.422 ± 0.326
2.695ProIle: 2.695 ± 0.441
2.171ProLys: 2.171 ± 0.361
4.043ProLeu: 4.043 ± 0.543
0.973ProMet: 0.973 ± 0.379
1.572ProAsn: 1.572 ± 0.378
3.444ProPro: 3.444 ± 0.539
1.872ProGln: 1.872 ± 0.468
3.369ProArg: 3.369 ± 0.66
3.444ProSer: 3.444 ± 0.418
3.519ProThr: 3.519 ± 0.645
3.968ProVal: 3.968 ± 0.609
0.824ProTrp: 0.824 ± 0.244
0.973ProTyr: 0.973 ± 0.269
0.0ProXaa: 0.0 ± 0.0
Gln
3.968GlnAla: 3.968 ± 0.58
0.15GlnCys: 0.15 ± 0.108
2.096GlnAsp: 2.096 ± 0.361
1.123GlnGlu: 1.123 ± 0.24
1.048GlnPhe: 1.048 ± 0.259
2.321GlnGly: 2.321 ± 0.467
1.048GlnHis: 1.048 ± 0.34
1.797GlnIle: 1.797 ± 0.413
0.824GlnLys: 0.824 ± 0.265
3.893GlnLeu: 3.893 ± 0.53
0.824GlnMet: 0.824 ± 0.209
1.273GlnAsn: 1.273 ± 0.535
1.872GlnPro: 1.872 ± 0.439
1.722GlnGln: 1.722 ± 0.382
2.695GlnArg: 2.695 ± 0.481
2.021GlnSer: 2.021 ± 0.489
1.722GlnThr: 1.722 ± 0.281
2.171GlnVal: 2.171 ± 0.414
0.449GlnTrp: 0.449 ± 0.165
0.524GlnTyr: 0.524 ± 0.164
0.0GlnXaa: 0.0 ± 0.0
Arg
8.235ArgAla: 8.235 ± 1.112
0.824ArgCys: 0.824 ± 0.23
4.342ArgAsp: 4.342 ± 0.443
3.743ArgGlu: 3.743 ± 0.65
1.722ArgPhe: 1.722 ± 0.399
4.567ArgGly: 4.567 ± 0.521
1.273ArgHis: 1.273 ± 0.36
2.995ArgIle: 2.995 ± 0.626
2.995ArgLys: 2.995 ± 0.518
5.465ArgLeu: 5.465 ± 0.746
1.647ArgMet: 1.647 ± 0.338
2.396ArgAsn: 2.396 ± 0.459
4.118ArgPro: 4.118 ± 0.615
2.92ArgGln: 2.92 ± 0.504
7.038ArgArg: 7.038 ± 1.097
2.995ArgSer: 2.995 ± 0.542
3.893ArgThr: 3.893 ± 0.445
3.294ArgVal: 3.294 ± 0.493
1.497ArgTrp: 1.497 ± 0.371
1.422ArgTyr: 1.422 ± 0.349
0.0ArgXaa: 0.0 ± 0.0
Ser
6.663SerAla: 6.663 ± 1.344
0.374SerCys: 0.374 ± 0.164
3.444SerAsp: 3.444 ± 0.457
3.07SerGlu: 3.07 ± 0.609
1.422SerPhe: 1.422 ± 0.324
4.417SerGly: 4.417 ± 0.953
0.449SerHis: 0.449 ± 0.173
2.471SerIle: 2.471 ± 0.468
2.471SerLys: 2.471 ± 0.347
3.219SerLeu: 3.219 ± 0.618
1.722SerMet: 1.722 ± 0.323
1.497SerAsn: 1.497 ± 0.389
2.695SerPro: 2.695 ± 0.501
1.048SerGln: 1.048 ± 0.303
2.845SerArg: 2.845 ± 0.35
3.668SerSer: 3.668 ± 0.539
3.369SerThr: 3.369 ± 0.493
3.668SerVal: 3.668 ± 0.619
1.572SerTrp: 1.572 ± 0.389
1.198SerTyr: 1.198 ± 0.347
0.0SerXaa: 0.0 ± 0.0
Thr
8.385ThrAla: 8.385 ± 1.024
0.15ThrCys: 0.15 ± 0.094
3.893ThrAsp: 3.893 ± 0.529
2.995ThrGlu: 2.995 ± 0.426
2.171ThrPhe: 2.171 ± 0.399
6.139ThrGly: 6.139 ± 0.792
1.123ThrHis: 1.123 ± 0.24
2.695ThrIle: 2.695 ± 0.494
2.246ThrLys: 2.246 ± 0.349
3.818ThrLeu: 3.818 ± 0.564
1.947ThrMet: 1.947 ± 0.448
1.497ThrAsn: 1.497 ± 0.315
3.519ThrPro: 3.519 ± 0.614
1.497ThrGln: 1.497 ± 0.315
3.144ThrArg: 3.144 ± 0.513
3.818ThrSer: 3.818 ± 0.563
3.818ThrThr: 3.818 ± 0.679
5.316ThrVal: 5.316 ± 0.59
0.898ThrTrp: 0.898 ± 0.279
0.749ThrTyr: 0.749 ± 0.296
0.0ThrXaa: 0.0 ± 0.0
Val
7.636ValAla: 7.636 ± 0.692
0.824ValCys: 0.824 ± 0.284
5.166ValAsp: 5.166 ± 0.711
4.717ValGlu: 4.717 ± 0.558
2.471ValPhe: 2.471 ± 0.396
5.39ValGly: 5.39 ± 0.729
1.422ValHis: 1.422 ± 0.239
2.77ValIle: 2.77 ± 0.528
2.096ValLys: 2.096 ± 0.375
5.016ValLeu: 5.016 ± 0.552
1.048ValMet: 1.048 ± 0.299
2.171ValAsn: 2.171 ± 0.43
3.444ValPro: 3.444 ± 0.465
2.096ValGln: 2.096 ± 0.388
4.642ValArg: 4.642 ± 0.813
2.845ValSer: 2.845 ± 0.599
4.342ValThr: 4.342 ± 0.666
5.091ValVal: 5.091 ± 0.629
2.096ValTrp: 2.096 ± 0.437
1.722ValTyr: 1.722 ± 0.33
0.0ValXaa: 0.0 ± 0.0
Trp
1.947TrpAla: 1.947 ± 0.366
0.075TrpCys: 0.075 ± 0.078
0.973TrpAsp: 0.973 ± 0.24
0.524TrpGlu: 0.524 ± 0.182
0.449TrpPhe: 0.449 ± 0.174
1.497TrpGly: 1.497 ± 0.43
0.599TrpHis: 0.599 ± 0.198
0.898TrpIle: 0.898 ± 0.224
0.898TrpLys: 0.898 ± 0.263
2.471TrpLeu: 2.471 ± 0.584
0.524TrpMet: 0.524 ± 0.258
0.674TrpAsn: 0.674 ± 0.208
1.198TrpPro: 1.198 ± 0.274
1.048TrpGln: 1.048 ± 0.217
1.422TrpArg: 1.422 ± 0.289
1.572TrpSer: 1.572 ± 0.373
1.273TrpThr: 1.273 ± 0.308
0.898TrpVal: 0.898 ± 0.228
0.15TrpTrp: 0.15 ± 0.102
0.524TrpTyr: 0.524 ± 0.198
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.171TyrAla: 2.171 ± 0.462
0.075TyrCys: 0.075 ± 0.075
0.599TyrAsp: 0.599 ± 0.232
0.973TyrGlu: 0.973 ± 0.267
0.299TyrPhe: 0.299 ± 0.165
2.171TyrGly: 2.171 ± 0.427
0.524TyrHis: 0.524 ± 0.196
0.749TyrIle: 0.749 ± 0.224
0.524TyrLys: 0.524 ± 0.216
1.572TyrLeu: 1.572 ± 0.292
0.225TyrMet: 0.225 ± 0.141
0.524TyrAsn: 0.524 ± 0.2
1.198TyrPro: 1.198 ± 0.286
1.048TyrGln: 1.048 ± 0.326
1.422TyrArg: 1.422 ± 0.339
1.348TyrSer: 1.348 ± 0.373
1.198TyrThr: 1.198 ± 0.317
1.947TyrVal: 1.947 ± 0.353
0.449TyrTrp: 0.449 ± 0.157
0.075TyrTyr: 0.075 ± 0.063
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 60 proteins (13358 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski