Amino acid dipepetide frequency for Mycobacterium phage Muddy

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
9.825AlaAla: 9.825 ± 1.778
0.702AlaCys: 0.702 ± 0.21
5.869AlaAsp: 5.869 ± 0.646
6.124AlaGlu: 6.124 ± 0.675
3.126AlaPhe: 3.126 ± 0.519
7.974AlaGly: 7.974 ± 1.171
1.595AlaHis: 1.595 ± 0.347
4.083AlaIle: 4.083 ± 0.818
5.231AlaLys: 5.231 ± 0.774
8.102AlaLeu: 8.102 ± 0.77
2.233AlaMet: 2.233 ± 0.397
2.233AlaAsn: 2.233 ± 0.32
4.274AlaPro: 4.274 ± 0.599
2.616AlaGln: 2.616 ± 0.375
6.061AlaArg: 6.061 ± 0.65
4.211AlaSer: 4.211 ± 0.534
5.231AlaThr: 5.231 ± 0.677
5.614AlaVal: 5.614 ± 0.595
1.786AlaTrp: 1.786 ± 0.37
2.041AlaTyr: 2.041 ± 0.461
0.0AlaXaa: 0.0 ± 0.0
Cys
0.766CysAla: 0.766 ± 0.194
0.0CysCys: 0.0 ± 0.0
0.51CysAsp: 0.51 ± 0.193
0.447CysGlu: 0.447 ± 0.237
0.51CysPhe: 0.51 ± 0.2
1.212CysGly: 1.212 ± 0.328
0.255CysHis: 0.255 ± 0.152
0.51CysIle: 0.51 ± 0.165
0.51CysLys: 0.51 ± 0.196
0.766CysLeu: 0.766 ± 0.213
0.064CysMet: 0.064 ± 0.068
0.447CysAsn: 0.447 ± 0.196
0.829CysPro: 0.829 ± 0.244
0.128CysGln: 0.128 ± 0.087
0.893CysArg: 0.893 ± 0.244
0.255CysSer: 0.255 ± 0.111
0.638CysThr: 0.638 ± 0.26
0.51CysVal: 0.51 ± 0.172
0.255CysTrp: 0.255 ± 0.138
0.319CysTyr: 0.319 ± 0.139
0.0CysXaa: 0.0 ± 0.0
Asp
6.188AspAla: 6.188 ± 0.692
0.383AspCys: 0.383 ± 0.155
3.955AspAsp: 3.955 ± 0.684
5.55AspGlu: 5.55 ± 0.669
1.914AspPhe: 1.914 ± 0.379
5.359AspGly: 5.359 ± 0.489
1.404AspHis: 1.404 ± 0.274
2.807AspIle: 2.807 ± 0.383
2.679AspLys: 2.679 ± 0.384
7.145AspLeu: 7.145 ± 0.733
1.212AspMet: 1.212 ± 0.266
1.404AspAsn: 1.404 ± 0.309
4.53AspPro: 4.53 ± 0.634
2.424AspGln: 2.424 ± 0.419
2.998AspArg: 2.998 ± 0.475
3.062AspSer: 3.062 ± 0.51
2.998AspThr: 2.998 ± 0.466
3.573AspVal: 3.573 ± 0.466
1.467AspTrp: 1.467 ± 0.288
2.105AspTyr: 2.105 ± 0.484
0.0AspXaa: 0.0 ± 0.0
Glu
6.89GluAla: 6.89 ± 0.856
0.829GluCys: 0.829 ± 0.218
5.231GluAsp: 5.231 ± 0.693
6.124GluGlu: 6.124 ± 1.523
1.595GluPhe: 1.595 ± 0.344
5.104GluGly: 5.104 ± 0.402
1.467GluHis: 1.467 ± 0.331
3.062GluIle: 3.062 ± 0.524
2.998GluLys: 2.998 ± 0.528
7.273GluLeu: 7.273 ± 0.708
1.85GluMet: 1.85 ± 0.413
1.659GluAsn: 1.659 ± 0.349
2.233GluPro: 2.233 ± 0.517
2.424GluGln: 2.424 ± 0.39
4.338GluArg: 4.338 ± 0.559
2.935GluSer: 2.935 ± 0.431
3.126GluThr: 3.126 ± 0.476
3.19GluVal: 3.19 ± 0.561
2.041GluTrp: 2.041 ± 0.388
2.297GluTyr: 2.297 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
2.424PheAla: 2.424 ± 0.374
0.319PheCys: 0.319 ± 0.152
2.041PheAsp: 2.041 ± 0.392
1.978PheGlu: 1.978 ± 0.277
1.085PhePhe: 1.085 ± 0.308
3.445PheGly: 3.445 ± 0.616
0.51PheHis: 0.51 ± 0.171
1.722PheIle: 1.722 ± 0.246
1.786PheLys: 1.786 ± 0.361
2.233PheLeu: 2.233 ± 0.42
0.638PheMet: 0.638 ± 0.159
1.34PheAsn: 1.34 ± 0.349
1.404PhePro: 1.404 ± 0.354
1.085PheGln: 1.085 ± 0.206
2.233PheArg: 2.233 ± 0.454
1.914PheSer: 1.914 ± 0.326
2.424PheThr: 2.424 ± 0.473
2.169PheVal: 2.169 ± 0.354
0.766PheTrp: 0.766 ± 0.209
0.766PheTyr: 0.766 ± 0.254
0.0PheXaa: 0.0 ± 0.0
Gly
6.316GlyAla: 6.316 ± 0.836
0.829GlyCys: 0.829 ± 0.213
4.402GlyAsp: 4.402 ± 0.711
5.359GlyGlu: 5.359 ± 0.721
3.381GlyPhe: 3.381 ± 0.541
8.868GlyGly: 8.868 ± 1.609
1.467GlyHis: 1.467 ± 0.359
5.04GlyIle: 5.04 ± 0.578
4.657GlyLys: 4.657 ± 0.735
5.742GlyLeu: 5.742 ± 0.527
2.233GlyMet: 2.233 ± 0.353
3.764GlyAsn: 3.764 ± 0.541
3.7GlyPro: 3.7 ± 0.625
2.36GlyGln: 2.36 ± 0.399
4.721GlyArg: 4.721 ± 0.513
4.721GlySer: 4.721 ± 0.764
5.423GlyThr: 5.423 ± 0.666
6.252GlyVal: 6.252 ± 0.774
2.807GlyTrp: 2.807 ± 0.435
2.297GlyTyr: 2.297 ± 0.396
0.0GlyXaa: 0.0 ± 0.0
His
1.212HisAla: 1.212 ± 0.338
0.383HisCys: 0.383 ± 0.173
1.467HisAsp: 1.467 ± 0.298
1.404HisGlu: 1.404 ± 0.341
0.893HisPhe: 0.893 ± 0.33
0.766HisGly: 0.766 ± 0.245
0.702HisHis: 0.702 ± 0.233
1.021HisIle: 1.021 ± 0.292
1.212HisLys: 1.212 ± 0.331
2.169HisLeu: 2.169 ± 0.488
0.51HisMet: 0.51 ± 0.175
1.148HisAsn: 1.148 ± 0.315
1.531HisPro: 1.531 ± 0.381
0.638HisGln: 0.638 ± 0.265
1.531HisArg: 1.531 ± 0.293
1.212HisSer: 1.212 ± 0.284
1.148HisThr: 1.148 ± 0.266
1.085HisVal: 1.085 ± 0.244
0.702HisTrp: 0.702 ± 0.286
0.574HisTyr: 0.574 ± 0.175
0.0HisXaa: 0.0 ± 0.0
Ile
4.274IleAla: 4.274 ± 0.716
0.191IleCys: 0.191 ± 0.105
3.254IleAsp: 3.254 ± 0.399
2.743IleGlu: 2.743 ± 0.437
1.531IlePhe: 1.531 ± 0.319
3.509IleGly: 3.509 ± 0.553
0.638IleHis: 0.638 ± 0.231
2.233IleIle: 2.233 ± 0.389
2.935IleLys: 2.935 ± 0.422
2.998IleLeu: 2.998 ± 0.438
0.766IleMet: 0.766 ± 0.223
1.978IleAsn: 1.978 ± 0.464
3.317IlePro: 3.317 ± 0.431
1.531IleGln: 1.531 ± 0.326
3.445IleArg: 3.445 ± 0.413
2.616IleSer: 2.616 ± 0.33
4.083IleThr: 4.083 ± 0.488
3.573IleVal: 3.573 ± 0.438
1.148IleTrp: 1.148 ± 0.243
1.595IleTyr: 1.595 ± 0.349
0.0IleXaa: 0.0 ± 0.0
Lys
5.55LysAla: 5.55 ± 0.893
0.51LysCys: 0.51 ± 0.191
2.616LysAsp: 2.616 ± 0.41
3.573LysGlu: 3.573 ± 0.461
1.531LysPhe: 1.531 ± 0.325
3.509LysGly: 3.509 ± 0.464
1.467LysHis: 1.467 ± 0.336
2.616LysIle: 2.616 ± 0.437
3.509LysLys: 3.509 ± 0.629
4.019LysLeu: 4.019 ± 0.604
1.467LysMet: 1.467 ± 0.29
1.978LysAsn: 1.978 ± 0.355
2.616LysPro: 2.616 ± 0.387
2.552LysGln: 2.552 ± 0.407
3.828LysArg: 3.828 ± 0.534
2.807LysSer: 2.807 ± 0.421
3.19LysThr: 3.19 ± 0.486
3.828LysVal: 3.828 ± 0.659
1.212LysTrp: 1.212 ± 0.309
1.85LysTyr: 1.85 ± 0.439
0.0LysXaa: 0.0 ± 0.0
Leu
7.018LeuAla: 7.018 ± 0.603
0.702LeuCys: 0.702 ± 0.211
5.805LeuAsp: 5.805 ± 0.628
5.614LeuGlu: 5.614 ± 0.585
2.743LeuPhe: 2.743 ± 0.377
7.209LeuGly: 7.209 ± 0.822
1.786LeuHis: 1.786 ± 0.462
4.466LeuIle: 4.466 ± 0.508
3.764LeuLys: 3.764 ± 0.489
6.954LeuLeu: 6.954 ± 0.807
2.297LeuMet: 2.297 ± 0.41
2.807LeuAsn: 2.807 ± 0.446
5.167LeuPro: 5.167 ± 0.454
2.743LeuGln: 2.743 ± 0.48
5.55LeuArg: 5.55 ± 0.672
4.211LeuSer: 4.211 ± 0.583
4.848LeuThr: 4.848 ± 0.702
5.295LeuVal: 5.295 ± 0.614
1.467LeuTrp: 1.467 ± 0.262
2.743LeuTyr: 2.743 ± 0.457
0.0LeuXaa: 0.0 ± 0.0
Met
2.297MetAla: 2.297 ± 0.36
0.064MetCys: 0.064 ± 0.067
1.34MetAsp: 1.34 ± 0.295
1.659MetGlu: 1.659 ± 0.295
0.255MetPhe: 0.255 ± 0.156
1.531MetGly: 1.531 ± 0.3
0.319MetHis: 0.319 ± 0.139
0.893MetIle: 0.893 ± 0.236
1.34MetLys: 1.34 ± 0.257
1.722MetLeu: 1.722 ± 0.304
0.319MetMet: 0.319 ± 0.155
0.574MetAsn: 0.574 ± 0.176
1.404MetPro: 1.404 ± 0.306
0.829MetGln: 0.829 ± 0.237
1.978MetArg: 1.978 ± 0.353
1.85MetSer: 1.85 ± 0.315
1.978MetThr: 1.978 ± 0.382
1.404MetVal: 1.404 ± 0.313
0.447MetTrp: 0.447 ± 0.177
0.319MetTyr: 0.319 ± 0.141
0.0MetXaa: 0.0 ± 0.0
Asn
3.445AsnAla: 3.445 ± 0.716
0.447AsnCys: 0.447 ± 0.164
2.041AsnAsp: 2.041 ± 0.41
2.169AsnGlu: 2.169 ± 0.485
0.893AsnPhe: 0.893 ± 0.33
3.636AsnGly: 3.636 ± 0.483
0.893AsnHis: 0.893 ± 0.22
1.148AsnIle: 1.148 ± 0.284
1.659AsnLys: 1.659 ± 0.305
2.36AsnLeu: 2.36 ± 0.353
0.51AsnMet: 0.51 ± 0.137
1.34AsnAsn: 1.34 ± 0.303
3.317AsnPro: 3.317 ± 0.441
0.893AsnGln: 0.893 ± 0.223
1.786AsnArg: 1.786 ± 0.464
2.297AsnSer: 2.297 ± 0.489
1.34AsnThr: 1.34 ± 0.263
2.679AsnVal: 2.679 ± 0.35
1.085AsnTrp: 1.085 ± 0.228
1.467AsnTyr: 1.467 ± 0.304
0.0AsnXaa: 0.0 ± 0.0
Pro
5.423ProAla: 5.423 ± 0.804
0.766ProCys: 0.766 ± 0.227
3.381ProAsp: 3.381 ± 0.623
3.764ProGlu: 3.764 ± 0.595
1.467ProPhe: 1.467 ± 0.303
5.933ProGly: 5.933 ± 0.519
1.085ProHis: 1.085 ± 0.297
2.233ProIle: 2.233 ± 0.308
2.935ProLys: 2.935 ± 0.487
4.466ProLeu: 4.466 ± 0.638
1.085ProMet: 1.085 ± 0.22
2.105ProAsn: 2.105 ± 0.366
3.573ProPro: 3.573 ± 0.615
2.233ProGln: 2.233 ± 0.366
3.636ProArg: 3.636 ± 0.53
3.317ProSer: 3.317 ± 0.451
3.573ProThr: 3.573 ± 0.399
4.53ProVal: 4.53 ± 0.615
1.148ProTrp: 1.148 ± 0.337
1.276ProTyr: 1.276 ± 0.244
0.0ProXaa: 0.0 ± 0.0
Gln
3.7GlnAla: 3.7 ± 0.624
0.064GlnCys: 0.064 ± 0.065
2.105GlnAsp: 2.105 ± 0.402
2.233GlnGlu: 2.233 ± 0.352
1.34GlnPhe: 1.34 ± 0.236
2.36GlnGly: 2.36 ± 0.413
0.893GlnHis: 0.893 ± 0.268
1.722GlnIle: 1.722 ± 0.343
2.041GlnLys: 2.041 ± 0.393
2.871GlnLeu: 2.871 ± 0.413
1.021GlnMet: 1.021 ± 0.305
1.34GlnAsn: 1.34 ± 0.263
2.105GlnPro: 2.105 ± 0.334
1.531GlnGln: 1.531 ± 0.295
2.36GlnArg: 2.36 ± 0.364
1.914GlnSer: 1.914 ± 0.385
1.404GlnThr: 1.404 ± 0.292
2.807GlnVal: 2.807 ± 0.414
0.702GlnTrp: 0.702 ± 0.227
1.021GlnTyr: 1.021 ± 0.303
0.0GlnXaa: 0.0 ± 0.0
Arg
4.976ArgAla: 4.976 ± 0.615
0.957ArgCys: 0.957 ± 0.322
3.509ArgAsp: 3.509 ± 0.452
4.466ArgGlu: 4.466 ± 0.555
2.169ArgPhe: 2.169 ± 0.37
4.657ArgGly: 4.657 ± 0.555
1.531ArgHis: 1.531 ± 0.283
3.317ArgIle: 3.317 ± 0.466
4.976ArgLys: 4.976 ± 0.774
5.55ArgLeu: 5.55 ± 0.69
1.276ArgMet: 1.276 ± 0.277
2.105ArgAsn: 2.105 ± 0.585
3.381ArgPro: 3.381 ± 0.47
2.807ArgGln: 2.807 ± 0.446
4.848ArgArg: 4.848 ± 0.763
2.935ArgSer: 2.935 ± 0.4
3.381ArgThr: 3.381 ± 0.575
4.083ArgVal: 4.083 ± 0.643
1.276ArgTrp: 1.276 ± 0.299
1.722ArgTyr: 1.722 ± 0.403
0.0ArgXaa: 0.0 ± 0.0
Ser
4.211SerAla: 4.211 ± 0.851
0.51SerCys: 0.51 ± 0.178
3.636SerAsp: 3.636 ± 0.578
2.679SerGlu: 2.679 ± 0.375
1.595SerPhe: 1.595 ± 0.244
5.805SerGly: 5.805 ± 0.666
0.766SerHis: 0.766 ± 0.24
2.424SerIle: 2.424 ± 0.345
2.297SerLys: 2.297 ± 0.458
4.211SerLeu: 4.211 ± 0.465
1.212SerMet: 1.212 ± 0.237
1.659SerAsn: 1.659 ± 0.261
3.254SerPro: 3.254 ± 0.39
2.36SerGln: 2.36 ± 0.437
2.743SerArg: 2.743 ± 0.474
3.062SerSer: 3.062 ± 0.465
3.828SerThr: 3.828 ± 0.577
4.019SerVal: 4.019 ± 0.502
1.148SerTrp: 1.148 ± 0.259
1.722SerTyr: 1.722 ± 0.362
0.0SerXaa: 0.0 ± 0.0
Thr
5.486ThrAla: 5.486 ± 0.676
0.702ThrCys: 0.702 ± 0.234
4.338ThrAsp: 4.338 ± 0.627
3.254ThrGlu: 3.254 ± 0.542
2.233ThrPhe: 2.233 ± 0.372
4.848ThrGly: 4.848 ± 0.681
1.276ThrHis: 1.276 ± 0.314
2.488ThrIle: 2.488 ± 0.429
2.935ThrLys: 2.935 ± 0.47
4.721ThrLeu: 4.721 ± 0.454
1.531ThrMet: 1.531 ± 0.345
2.36ThrAsn: 2.36 ± 0.476
4.019ThrPro: 4.019 ± 0.575
1.914ThrGln: 1.914 ± 0.318
3.636ThrArg: 3.636 ± 0.577
3.573ThrSer: 3.573 ± 0.435
3.126ThrThr: 3.126 ± 0.416
4.466ThrVal: 4.466 ± 0.592
1.085ThrTrp: 1.085 ± 0.286
1.722ThrTyr: 1.722 ± 0.388
0.0ThrXaa: 0.0 ± 0.0
Val
5.04ValAla: 5.04 ± 0.583
0.638ValCys: 0.638 ± 0.185
4.211ValAsp: 4.211 ± 0.505
4.53ValGlu: 4.53 ± 0.502
2.36ValPhe: 2.36 ± 0.424
4.593ValGly: 4.593 ± 0.593
1.722ValHis: 1.722 ± 0.445
4.53ValIle: 4.53 ± 0.518
4.402ValLys: 4.402 ± 0.839
4.657ValLeu: 4.657 ± 0.575
1.085ValMet: 1.085 ± 0.289
2.935ValAsn: 2.935 ± 0.478
4.657ValPro: 4.657 ± 0.59
1.978ValGln: 1.978 ± 0.313
3.828ValArg: 3.828 ± 0.438
3.764ValSer: 3.764 ± 0.47
4.466ValThr: 4.466 ± 0.495
4.53ValVal: 4.53 ± 0.521
1.595ValTrp: 1.595 ± 0.312
1.404ValTyr: 1.404 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
1.595TrpAla: 1.595 ± 0.333
0.319TrpCys: 0.319 ± 0.152
1.659TrpAsp: 1.659 ± 0.352
1.786TrpGlu: 1.786 ± 0.344
0.893TrpPhe: 0.893 ± 0.21
1.786TrpGly: 1.786 ± 0.393
0.574TrpHis: 0.574 ± 0.212
1.021TrpIle: 1.021 ± 0.243
0.957TrpLys: 0.957 ± 0.222
2.488TrpLeu: 2.488 ± 0.382
0.51TrpMet: 0.51 ± 0.191
0.957TrpAsn: 0.957 ± 0.251
1.276TrpPro: 1.276 ± 0.299
1.404TrpGln: 1.404 ± 0.317
1.212TrpArg: 1.212 ± 0.353
1.021TrpSer: 1.021 ± 0.315
1.34TrpThr: 1.34 ± 0.35
1.404TrpVal: 1.404 ± 0.29
0.766TrpTrp: 0.766 ± 0.235
0.574TrpTyr: 0.574 ± 0.172
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.297TyrAla: 2.297 ± 0.358
0.51TyrCys: 0.51 ± 0.212
1.786TyrAsp: 1.786 ± 0.304
1.276TyrGlu: 1.276 ± 0.289
0.766TyrPhe: 0.766 ± 0.206
2.233TyrGly: 2.233 ± 0.334
0.957TyrHis: 0.957 ± 0.256
1.021TyrIle: 1.021 ± 0.244
1.404TyrLys: 1.404 ± 0.338
2.743TyrLeu: 2.743 ± 0.353
0.638TyrMet: 0.638 ± 0.176
1.276TyrAsn: 1.276 ± 0.271
1.34TyrPro: 1.34 ± 0.296
1.085TyrGln: 1.085 ± 0.3
2.297TyrArg: 2.297 ± 0.461
1.467TyrSer: 1.467 ± 0.301
2.169TyrThr: 2.169 ± 0.401
1.914TyrVal: 1.914 ± 0.481
0.638TyrTrp: 0.638 ± 0.207
1.276TyrTyr: 1.276 ± 0.283
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 71 proteins (15676 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski