Amino acid dipepetide frequency for Mycobacterium phage Alsfro

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.755AlaAla: 13.755 ± 1.331
0.691AlaCys: 0.691 ± 0.218
6.469AlaAsp: 6.469 ± 0.671
5.904AlaGlu: 5.904 ± 0.731
2.952AlaPhe: 2.952 ± 0.439
7.286AlaGly: 7.286 ± 0.768
1.319AlaHis: 1.319 ± 0.299
4.208AlaIle: 4.208 ± 0.55
4.522AlaLys: 4.522 ± 0.525
8.856AlaLeu: 8.856 ± 0.845
2.638AlaMet: 2.638 ± 0.411
2.198AlaAsn: 2.198 ± 0.335
4.836AlaPro: 4.836 ± 0.572
2.45AlaGln: 2.45 ± 0.368
6.783AlaArg: 6.783 ± 0.665
5.339AlaSer: 5.339 ± 0.615
5.653AlaThr: 5.653 ± 0.72
8.542AlaVal: 8.542 ± 0.836
2.01AlaTrp: 2.01 ± 0.34
2.575AlaTyr: 2.575 ± 0.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.691CysAla: 0.691 ± 0.182
0.063CysCys: 0.063 ± 0.068
0.502CysAsp: 0.502 ± 0.171
0.691CysGlu: 0.691 ± 0.185
0.188CysPhe: 0.188 ± 0.088
0.502CysGly: 0.502 ± 0.177
0.188CysHis: 0.188 ± 0.103
0.251CysIle: 0.251 ± 0.132
0.251CysLys: 0.251 ± 0.161
0.502CysLeu: 0.502 ± 0.206
0.063CysMet: 0.063 ± 0.063
0.314CysAsn: 0.314 ± 0.145
0.188CysPro: 0.188 ± 0.092
0.251CysGln: 0.251 ± 0.125
0.377CysArg: 0.377 ± 0.128
0.502CysSer: 0.502 ± 0.197
0.314CysThr: 0.314 ± 0.129
0.377CysVal: 0.377 ± 0.128
0.251CysTrp: 0.251 ± 0.122
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.218AspAla: 6.218 ± 0.615
0.502AspCys: 0.502 ± 0.166
4.522AspAsp: 4.522 ± 0.44
3.706AspGlu: 3.706 ± 0.555
2.512AspPhe: 2.512 ± 0.334
5.841AspGly: 5.841 ± 0.607
1.193AspHis: 1.193 ± 0.294
2.764AspIle: 2.764 ± 0.448
2.512AspLys: 2.512 ± 0.38
6.972AspLeu: 6.972 ± 0.651
1.382AspMet: 1.382 ± 0.255
2.136AspAsn: 2.136 ± 0.315
4.774AspPro: 4.774 ± 0.605
1.57AspGln: 1.57 ± 0.384
3.706AspArg: 3.706 ± 0.337
3.329AspSer: 3.329 ± 0.466
3.58AspThr: 3.58 ± 0.434
4.648AspVal: 4.648 ± 0.509
1.759AspTrp: 1.759 ± 0.36
1.821AspTyr: 1.821 ± 0.263
0.0AspXaa: 0.0 ± 0.0
Glu
5.967GluAla: 5.967 ± 0.643
0.188GluCys: 0.188 ± 0.142
5.276GluAsp: 5.276 ± 0.541
5.025GluGlu: 5.025 ± 0.698
2.01GluPhe: 2.01 ± 0.332
4.774GluGly: 4.774 ± 0.483
1.319GluHis: 1.319 ± 0.31
3.455GluIle: 3.455 ± 0.47
2.638GluLys: 2.638 ± 0.369
6.721GluLeu: 6.721 ± 0.593
1.759GluMet: 1.759 ± 0.317
1.633GluAsn: 1.633 ± 0.404
2.952GluPro: 2.952 ± 0.52
3.015GluGln: 3.015 ± 0.408
3.706GluArg: 3.706 ± 0.573
3.455GluSer: 3.455 ± 0.328
3.58GluThr: 3.58 ± 0.476
5.841GluVal: 5.841 ± 0.621
1.696GluTrp: 1.696 ± 0.322
2.387GluTyr: 2.387 ± 0.397
0.0GluXaa: 0.0 ± 0.0
Phe
2.638PheAla: 2.638 ± 0.45
0.251PheCys: 0.251 ± 0.163
2.701PheAsp: 2.701 ± 0.413
2.826PheGlu: 2.826 ± 0.411
0.691PhePhe: 0.691 ± 0.226
3.455PheGly: 3.455 ± 0.486
0.502PheHis: 0.502 ± 0.202
1.445PheIle: 1.445 ± 0.26
1.131PheLys: 1.131 ± 0.274
2.701PheLeu: 2.701 ± 0.399
0.565PheMet: 0.565 ± 0.22
1.445PheAsn: 1.445 ± 0.322
1.633PhePro: 1.633 ± 0.314
0.754PheGln: 0.754 ± 0.189
1.884PheArg: 1.884 ± 0.354
1.821PheSer: 1.821 ± 0.278
2.387PheThr: 2.387 ± 0.423
1.947PheVal: 1.947 ± 0.39
0.628PheTrp: 0.628 ± 0.165
0.817PheTyr: 0.817 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
7.16GlyAla: 7.16 ± 0.764
0.817GlyCys: 0.817 ± 0.285
5.59GlyAsp: 5.59 ± 0.548
4.648GlyGlu: 4.648 ± 0.431
3.078GlyPhe: 3.078 ± 0.467
9.045GlyGly: 9.045 ± 2.018
1.759GlyHis: 1.759 ± 0.335
4.774GlyIle: 4.774 ± 0.633
4.208GlyLys: 4.208 ± 0.489
7.851GlyLeu: 7.851 ± 0.848
1.884GlyMet: 1.884 ± 0.351
3.141GlyAsn: 3.141 ± 0.367
4.02GlyPro: 4.02 ± 0.567
2.324GlyGln: 2.324 ± 0.339
4.711GlyArg: 4.711 ± 0.584
5.339GlySer: 5.339 ± 0.786
4.46GlyThr: 4.46 ± 0.469
5.527GlyVal: 5.527 ± 0.541
2.764GlyTrp: 2.764 ± 0.376
2.701GlyTyr: 2.701 ± 0.358
0.0GlyXaa: 0.0 ± 0.0
His
1.507HisAla: 1.507 ± 0.332
0.126HisCys: 0.126 ± 0.121
1.131HisAsp: 1.131 ± 0.223
1.759HisGlu: 1.759 ± 0.357
0.565HisPhe: 0.565 ± 0.17
1.633HisGly: 1.633 ± 0.386
0.565HisHis: 0.565 ± 0.214
0.691HisIle: 0.691 ± 0.197
1.256HisLys: 1.256 ± 0.318
1.319HisLeu: 1.319 ± 0.278
0.126HisMet: 0.126 ± 0.076
0.188HisAsn: 0.188 ± 0.094
1.445HisPro: 1.445 ± 0.277
0.817HisGln: 0.817 ± 0.245
1.445HisArg: 1.445 ± 0.306
0.628HisSer: 0.628 ± 0.185
1.005HisThr: 1.005 ± 0.279
1.445HisVal: 1.445 ± 0.288
0.44HisTrp: 0.44 ± 0.128
0.754HisTyr: 0.754 ± 0.221
0.0HisXaa: 0.0 ± 0.0
Ile
6.03IleAla: 6.03 ± 0.581
0.251IleCys: 0.251 ± 0.107
3.266IleAsp: 3.266 ± 0.373
3.706IleGlu: 3.706 ± 0.426
1.068IlePhe: 1.068 ± 0.255
4.083IleGly: 4.083 ± 0.464
0.879IleHis: 0.879 ± 0.246
1.633IleIle: 1.633 ± 0.317
1.633IleLys: 1.633 ± 0.266
3.329IleLeu: 3.329 ± 0.4
0.691IleMet: 0.691 ± 0.183
1.947IleAsn: 1.947 ± 0.301
3.015IlePro: 3.015 ± 0.334
1.319IleGln: 1.319 ± 0.303
3.266IleArg: 3.266 ± 0.48
3.517IleSer: 3.517 ± 0.458
3.58IleThr: 3.58 ± 0.397
3.455IleVal: 3.455 ± 0.483
0.817IleTrp: 0.817 ± 0.207
1.884IleTyr: 1.884 ± 0.309
0.0IleXaa: 0.0 ± 0.0
Lys
3.706LysAla: 3.706 ± 0.579
0.314LysCys: 0.314 ± 0.136
2.45LysAsp: 2.45 ± 0.451
2.324LysGlu: 2.324 ± 0.413
1.633LysPhe: 1.633 ± 0.286
2.764LysGly: 2.764 ± 0.376
1.131LysHis: 1.131 ± 0.292
2.387LysIle: 2.387 ± 0.444
2.01LysLys: 2.01 ± 0.458
3.455LysLeu: 3.455 ± 0.479
1.256LysMet: 1.256 ± 0.221
1.382LysAsn: 1.382 ± 0.275
2.952LysPro: 2.952 ± 0.495
1.57LysGln: 1.57 ± 0.376
2.889LysArg: 2.889 ± 0.394
3.015LysSer: 3.015 ± 0.431
2.324LysThr: 2.324 ± 0.46
3.392LysVal: 3.392 ± 0.456
0.754LysTrp: 0.754 ± 0.218
0.817LysTyr: 0.817 ± 0.234
0.0LysXaa: 0.0 ± 0.0
Leu
8.542LeuAla: 8.542 ± 0.83
0.44LeuCys: 0.44 ± 0.149
6.218LeuAsp: 6.218 ± 0.657
5.59LeuGlu: 5.59 ± 0.589
2.387LeuPhe: 2.387 ± 0.373
7.16LeuGly: 7.16 ± 0.706
1.445LeuHis: 1.445 ± 0.332
4.648LeuIle: 4.648 ± 0.509
4.334LeuLys: 4.334 ± 0.458
5.464LeuLeu: 5.464 ± 0.629
1.759LeuMet: 1.759 ± 0.292
2.701LeuAsn: 2.701 ± 0.354
5.716LeuPro: 5.716 ± 0.609
2.638LeuGln: 2.638 ± 0.521
6.344LeuArg: 6.344 ± 0.509
5.779LeuSer: 5.779 ± 0.62
6.155LeuThr: 6.155 ± 0.492
4.774LeuVal: 4.774 ± 0.616
0.942LeuTrp: 0.942 ± 0.286
2.324LeuTyr: 2.324 ± 0.432
0.0LeuXaa: 0.0 ± 0.0
Met
2.387MetAla: 2.387 ± 0.351
0.063MetCys: 0.063 ± 0.054
0.942MetAsp: 0.942 ± 0.243
1.445MetGlu: 1.445 ± 0.35
0.691MetPhe: 0.691 ± 0.206
1.319MetGly: 1.319 ± 0.277
0.314MetHis: 0.314 ± 0.132
0.691MetIle: 0.691 ± 0.195
1.256MetLys: 1.256 ± 0.253
1.193MetLeu: 1.193 ± 0.266
0.063MetMet: 0.063 ± 0.058
1.382MetAsn: 1.382 ± 0.282
1.256MetPro: 1.256 ± 0.256
0.691MetGln: 0.691 ± 0.202
1.256MetArg: 1.256 ± 0.265
2.261MetSer: 2.261 ± 0.417
2.512MetThr: 2.512 ± 0.264
1.131MetVal: 1.131 ± 0.302
0.251MetTrp: 0.251 ± 0.11
0.502MetTyr: 0.502 ± 0.169
0.0MetXaa: 0.0 ± 0.0
Asn
3.015AsnAla: 3.015 ± 0.443
0.063AsnCys: 0.063 ± 0.065
1.696AsnAsp: 1.696 ± 0.341
1.821AsnGlu: 1.821 ± 0.386
1.193AsnPhe: 1.193 ± 0.264
3.706AsnGly: 3.706 ± 0.502
0.817AsnHis: 0.817 ± 0.189
1.57AsnIle: 1.57 ± 0.316
0.879AsnLys: 0.879 ± 0.242
2.261AsnLeu: 2.261 ± 0.329
0.565AsnMet: 0.565 ± 0.162
0.754AsnAsn: 0.754 ± 0.195
2.764AsnPro: 2.764 ± 0.38
1.068AsnGln: 1.068 ± 0.25
1.57AsnArg: 1.57 ± 0.374
1.821AsnSer: 1.821 ± 0.363
2.01AsnThr: 2.01 ± 0.366
2.324AsnVal: 2.324 ± 0.349
0.565AsnTrp: 0.565 ± 0.171
1.193AsnTyr: 1.193 ± 0.292
0.0AsnXaa: 0.0 ± 0.0
Pro
5.59ProAla: 5.59 ± 0.593
0.44ProCys: 0.44 ± 0.143
4.46ProAsp: 4.46 ± 0.472
4.46ProGlu: 4.46 ± 0.532
2.198ProPhe: 2.198 ± 0.399
5.15ProGly: 5.15 ± 0.53
1.068ProHis: 1.068 ± 0.25
2.261ProIle: 2.261 ± 0.4
2.073ProLys: 2.073 ± 0.279
4.208ProLeu: 4.208 ± 0.538
0.942ProMet: 0.942 ± 0.281
1.507ProAsn: 1.507 ± 0.286
2.764ProPro: 2.764 ± 0.415
1.319ProGln: 1.319 ± 0.304
2.764ProArg: 2.764 ± 0.451
3.831ProSer: 3.831 ± 0.467
3.831ProThr: 3.831 ± 0.568
4.083ProVal: 4.083 ± 0.443
0.879ProTrp: 0.879 ± 0.285
1.57ProTyr: 1.57 ± 0.318
0.0ProXaa: 0.0 ± 0.0
Gln
2.889GlnAla: 2.889 ± 0.388
0.063GlnCys: 0.063 ± 0.054
1.507GlnAsp: 1.507 ± 0.279
1.884GlnGlu: 1.884 ± 0.352
0.942GlnPhe: 0.942 ± 0.242
2.45GlnGly: 2.45 ± 0.352
0.565GlnHis: 0.565 ± 0.16
2.764GlnIle: 2.764 ± 0.504
1.319GlnLys: 1.319 ± 0.287
3.706GlnLeu: 3.706 ± 0.447
1.005GlnMet: 1.005 ± 0.275
0.44GlnAsn: 0.44 ± 0.141
1.57GlnPro: 1.57 ± 0.367
1.759GlnGln: 1.759 ± 0.41
1.821GlnArg: 1.821 ± 0.394
1.445GlnSer: 1.445 ± 0.26
1.821GlnThr: 1.821 ± 0.316
2.638GlnVal: 2.638 ± 0.355
0.565GlnTrp: 0.565 ± 0.164
0.754GlnTyr: 0.754 ± 0.23
0.0GlnXaa: 0.0 ± 0.0
Arg
5.841ArgAla: 5.841 ± 0.724
0.754ArgCys: 0.754 ± 0.214
2.889ArgAsp: 2.889 ± 0.394
4.774ArgGlu: 4.774 ± 0.569
2.01ArgPhe: 2.01 ± 0.338
4.899ArgGly: 4.899 ± 0.654
0.942ArgHis: 0.942 ± 0.227
3.266ArgIle: 3.266 ± 0.579
3.078ArgLys: 3.078 ± 0.495
6.03ArgLeu: 6.03 ± 0.65
2.198ArgMet: 2.198 ± 0.317
2.261ArgAsn: 2.261 ± 0.408
2.324ArgPro: 2.324 ± 0.36
1.947ArgGln: 1.947 ± 0.329
5.276ArgArg: 5.276 ± 0.686
3.706ArgSer: 3.706 ± 0.495
2.701ArgThr: 2.701 ± 0.519
4.836ArgVal: 4.836 ± 0.526
1.507ArgTrp: 1.507 ± 0.351
1.696ArgTyr: 1.696 ± 0.299
0.0ArgXaa: 0.0 ± 0.0
Ser
5.653SerAla: 5.653 ± 0.636
0.44SerCys: 0.44 ± 0.182
3.203SerAsp: 3.203 ± 0.416
4.145SerGlu: 4.145 ± 0.48
2.073SerPhe: 2.073 ± 0.45
6.469SerGly: 6.469 ± 0.721
1.382SerHis: 1.382 ± 0.341
2.575SerIle: 2.575 ± 0.352
2.198SerLys: 2.198 ± 0.344
5.088SerLeu: 5.088 ± 0.466
1.57SerMet: 1.57 ± 0.277
2.073SerAsn: 2.073 ± 0.37
3.517SerPro: 3.517 ± 0.515
2.261SerGln: 2.261 ± 0.327
2.638SerArg: 2.638 ± 0.345
4.02SerSer: 4.02 ± 0.743
3.517SerThr: 3.517 ± 0.507
3.894SerVal: 3.894 ± 0.571
1.382SerTrp: 1.382 ± 0.325
1.382SerTyr: 1.382 ± 0.294
0.0SerXaa: 0.0 ± 0.0
Thr
6.281ThrAla: 6.281 ± 0.755
0.314ThrCys: 0.314 ± 0.148
4.208ThrAsp: 4.208 ± 0.486
4.334ThrGlu: 4.334 ± 0.475
2.261ThrPhe: 2.261 ± 0.413
6.155ThrGly: 6.155 ± 0.542
1.382ThrHis: 1.382 ± 0.332
3.266ThrIle: 3.266 ± 0.584
2.512ThrLys: 2.512 ± 0.394
6.03ThrLeu: 6.03 ± 0.755
0.817ThrMet: 0.817 ± 0.216
1.633ThrAsn: 1.633 ± 0.302
3.266ThrPro: 3.266 ± 0.459
1.507ThrGln: 1.507 ± 0.292
3.392ThrArg: 3.392 ± 0.481
3.015ThrSer: 3.015 ± 0.622
3.894ThrThr: 3.894 ± 0.604
4.899ThrVal: 4.899 ± 0.613
1.005ThrTrp: 1.005 ± 0.281
2.073ThrTyr: 2.073 ± 0.422
0.0ThrXaa: 0.0 ± 0.0
Val
6.846ValAla: 6.846 ± 0.737
0.314ValCys: 0.314 ± 0.111
5.464ValAsp: 5.464 ± 0.535
5.15ValGlu: 5.15 ± 0.53
2.01ValPhe: 2.01 ± 0.308
4.962ValGly: 4.962 ± 0.661
1.131ValHis: 1.131 ± 0.262
4.271ValIle: 4.271 ± 0.517
3.015ValLys: 3.015 ± 0.399
5.464ValLeu: 5.464 ± 0.55
1.131ValMet: 1.131 ± 0.265
2.952ValAsn: 2.952 ± 0.365
4.02ValPro: 4.02 ± 0.509
2.512ValGln: 2.512 ± 0.419
5.088ValArg: 5.088 ± 0.686
4.397ValSer: 4.397 ± 0.443
5.464ValThr: 5.464 ± 0.561
5.213ValVal: 5.213 ± 0.641
1.507ValTrp: 1.507 ± 0.275
2.198ValTyr: 2.198 ± 0.424
0.0ValXaa: 0.0 ± 0.0
Trp
1.759TrpAla: 1.759 ± 0.352
0.188TrpCys: 0.188 ± 0.093
1.57TrpAsp: 1.57 ± 0.295
1.131TrpGlu: 1.131 ± 0.248
1.005TrpPhe: 1.005 ± 0.239
1.759TrpGly: 1.759 ± 0.334
0.377TrpHis: 0.377 ± 0.165
1.005TrpIle: 1.005 ± 0.221
0.314TrpLys: 0.314 ± 0.132
1.821TrpLeu: 1.821 ± 0.319
0.314TrpMet: 0.314 ± 0.143
0.377TrpAsn: 0.377 ± 0.15
0.754TrpPro: 0.754 ± 0.232
0.942TrpGln: 0.942 ± 0.231
1.382TrpArg: 1.382 ± 0.334
1.068TrpSer: 1.068 ± 0.26
1.696TrpThr: 1.696 ± 0.35
2.136TrpVal: 2.136 ± 0.353
0.565TrpTrp: 0.565 ± 0.189
0.502TrpTyr: 0.502 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.512TyrAla: 2.512 ± 0.412
0.188TyrCys: 0.188 ± 0.135
1.382TyrAsp: 1.382 ± 0.31
2.073TyrGlu: 2.073 ± 0.343
0.754TyrPhe: 0.754 ± 0.211
2.387TyrGly: 2.387 ± 0.395
0.565TyrHis: 0.565 ± 0.185
1.445TyrIle: 1.445 ± 0.346
1.382TyrLys: 1.382 ± 0.279
2.387TyrLeu: 2.387 ± 0.438
0.817TyrMet: 0.817 ± 0.215
1.193TyrAsn: 1.193 ± 0.317
1.633TyrPro: 1.633 ± 0.287
1.193TyrGln: 1.193 ± 0.284
2.701TyrArg: 2.701 ± 0.454
1.193TyrSer: 1.193 ± 0.22
1.759TyrThr: 1.759 ± 0.385
2.01TyrVal: 2.01 ± 0.355
0.377TyrTrp: 0.377 ± 0.155
0.628TyrTyr: 0.628 ± 0.189
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (15922 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski