Amino acid dipepetide frequency for Mycobacterium phage Ebony

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.897AlaAla: 11.897 ± 1.117
0.929AlaCys: 0.929 ± 0.193
5.762AlaAsp: 5.762 ± 0.697
7.807AlaGlu: 7.807 ± 0.706
3.594AlaPhe: 3.594 ± 0.482
7.435AlaGly: 7.435 ± 0.722
1.425AlaHis: 1.425 ± 0.278
4.895AlaIle: 4.895 ± 0.512
3.78AlaLys: 3.78 ± 0.598
9.108AlaLeu: 9.108 ± 1.022
2.355AlaMet: 2.355 ± 0.393
3.284AlaAsn: 3.284 ± 0.549
4.771AlaPro: 4.771 ± 0.72
2.974AlaGln: 2.974 ± 0.526
5.948AlaArg: 5.948 ± 0.656
4.585AlaSer: 4.585 ± 0.485
5.143AlaThr: 5.143 ± 0.633
7.002AlaVal: 7.002 ± 0.675
1.797AlaTrp: 1.797 ± 0.268
2.54AlaTyr: 2.54 ± 0.43
0.0AlaXaa: 0.0 ± 0.0
Cys
0.62CysAla: 0.62 ± 0.202
0.0CysCys: 0.0 ± 0.0
0.682CysAsp: 0.682 ± 0.232
0.496CysGlu: 0.496 ± 0.228
0.372CysPhe: 0.372 ± 0.152
1.053CysGly: 1.053 ± 0.246
0.186CysHis: 0.186 ± 0.1
0.496CysIle: 0.496 ± 0.178
0.062CysLys: 0.062 ± 0.058
0.991CysLeu: 0.991 ± 0.26
0.31CysMet: 0.31 ± 0.163
0.31CysAsn: 0.31 ± 0.144
0.558CysPro: 0.558 ± 0.218
0.372CysGln: 0.372 ± 0.15
0.867CysArg: 0.867 ± 0.236
0.434CysSer: 0.434 ± 0.159
0.434CysThr: 0.434 ± 0.153
0.496CysVal: 0.496 ± 0.163
0.31CysTrp: 0.31 ± 0.149
0.31CysTyr: 0.31 ± 0.14
0.0CysXaa: 0.0 ± 0.0
Asp
5.639AspAla: 5.639 ± 0.592
0.62AspCys: 0.62 ± 0.219
3.284AspAsp: 3.284 ± 0.469
5.143AspGlu: 5.143 ± 0.561
2.602AspPhe: 2.602 ± 0.502
5.391AspGly: 5.391 ± 0.633
1.797AspHis: 1.797 ± 0.324
3.098AspIle: 3.098 ± 0.56
3.036AspLys: 3.036 ± 0.363
5.205AspLeu: 5.205 ± 0.508
1.673AspMet: 1.673 ± 0.318
1.487AspAsn: 1.487 ± 0.289
5.143AspPro: 5.143 ± 0.544
2.045AspGln: 2.045 ± 0.312
4.028AspArg: 4.028 ± 0.538
2.788AspSer: 2.788 ± 0.361
3.656AspThr: 3.656 ± 0.42
4.585AspVal: 4.585 ± 0.565
1.053AspTrp: 1.053 ± 0.252
2.107AspTyr: 2.107 ± 0.381
0.0AspXaa: 0.0 ± 0.0
Glu
7.745GluAla: 7.745 ± 0.634
0.372GluCys: 0.372 ± 0.174
3.842GluAsp: 3.842 ± 0.458
5.886GluGlu: 5.886 ± 0.559
2.664GluPhe: 2.664 ± 0.409
6.01GluGly: 6.01 ± 0.544
1.921GluHis: 1.921 ± 0.384
3.78GluIle: 3.78 ± 0.494
2.417GluLys: 2.417 ± 0.378
7.621GluLeu: 7.621 ± 0.682
2.293GluMet: 2.293 ± 0.336
2.107GluAsn: 2.107 ± 0.351
2.602GluPro: 2.602 ± 0.458
2.169GluGln: 2.169 ± 0.415
4.647GluArg: 4.647 ± 0.524
2.664GluSer: 2.664 ± 0.385
3.904GluThr: 3.904 ± 0.463
5.205GluVal: 5.205 ± 0.521
1.549GluTrp: 1.549 ± 0.31
2.231GluTyr: 2.231 ± 0.322
0.0GluXaa: 0.0 ± 0.0
Phe
2.417PheAla: 2.417 ± 0.327
0.372PheCys: 0.372 ± 0.131
2.974PheAsp: 2.974 ± 0.361
2.726PheGlu: 2.726 ± 0.426
0.806PhePhe: 0.806 ± 0.247
2.602PheGly: 2.602 ± 0.425
0.496PheHis: 0.496 ± 0.165
1.611PheIle: 1.611 ± 0.31
1.487PheLys: 1.487 ± 0.285
2.726PheLeu: 2.726 ± 0.487
0.558PheMet: 0.558 ± 0.175
1.673PheAsn: 1.673 ± 0.278
2.54PhePro: 2.54 ± 0.368
0.991PheGln: 0.991 ± 0.284
2.231PheArg: 2.231 ± 0.375
2.478PheSer: 2.478 ± 0.489
1.363PheThr: 1.363 ± 0.317
2.478PheVal: 2.478 ± 0.424
0.496PheTrp: 0.496 ± 0.184
0.62PheTyr: 0.62 ± 0.212
0.0PheXaa: 0.0 ± 0.0
Gly
6.444GlyAla: 6.444 ± 0.946
0.62GlyCys: 0.62 ± 0.192
5.7GlyAsp: 5.7 ± 0.684
4.647GlyGlu: 4.647 ± 0.514
2.788GlyPhe: 2.788 ± 0.45
9.852GlyGly: 9.852 ± 2.72
2.169GlyHis: 2.169 ± 0.39
4.275GlyIle: 4.275 ± 0.557
3.842GlyLys: 3.842 ± 0.51
6.32GlyLeu: 6.32 ± 0.73
1.549GlyMet: 1.549 ± 0.29
2.85GlyAsn: 2.85 ± 0.408
3.47GlyPro: 3.47 ± 0.526
3.16GlyGln: 3.16 ± 0.502
4.213GlyArg: 4.213 ± 0.573
4.028GlySer: 4.028 ± 0.572
5.205GlyThr: 5.205 ± 0.611
5.824GlyVal: 5.824 ± 0.692
1.859GlyTrp: 1.859 ± 0.354
2.788GlyTyr: 2.788 ± 0.405
0.0GlyXaa: 0.0 ± 0.0
His
1.239HisAla: 1.239 ± 0.262
0.372HisCys: 0.372 ± 0.154
1.549HisAsp: 1.549 ± 0.362
1.673HisGlu: 1.673 ± 0.297
0.558HisPhe: 0.558 ± 0.263
1.797HisGly: 1.797 ± 0.358
0.744HisHis: 0.744 ± 0.203
1.115HisIle: 1.115 ± 0.297
1.115HisLys: 1.115 ± 0.305
1.611HisLeu: 1.611 ± 0.316
0.558HisMet: 0.558 ± 0.168
0.496HisAsn: 0.496 ± 0.178
0.991HisPro: 0.991 ± 0.246
1.177HisGln: 1.177 ± 0.295
1.735HisArg: 1.735 ± 0.379
0.744HisSer: 0.744 ± 0.212
0.991HisThr: 0.991 ± 0.234
1.673HisVal: 1.673 ± 0.334
0.434HisTrp: 0.434 ± 0.173
0.62HisTyr: 0.62 ± 0.247
0.0HisXaa: 0.0 ± 0.0
Ile
4.399IleAla: 4.399 ± 0.491
0.558IleCys: 0.558 ± 0.188
3.594IleAsp: 3.594 ± 0.429
5.329IleGlu: 5.329 ± 0.596
1.239IlePhe: 1.239 ± 0.275
3.594IleGly: 3.594 ± 0.423
0.867IleHis: 0.867 ± 0.226
2.045IleIle: 2.045 ± 0.389
2.293IleLys: 2.293 ± 0.389
3.16IleLeu: 3.16 ± 0.418
0.867IleMet: 0.867 ± 0.23
2.664IleAsn: 2.664 ± 0.363
3.594IlePro: 3.594 ± 0.444
1.425IleGln: 1.425 ± 0.306
3.78IleArg: 3.78 ± 0.426
2.912IleSer: 2.912 ± 0.469
3.532IleThr: 3.532 ± 0.439
3.532IleVal: 3.532 ± 0.489
0.744IleTrp: 0.744 ± 0.169
0.929IleTyr: 0.929 ± 0.278
0.0IleXaa: 0.0 ± 0.0
Lys
4.585LysAla: 4.585 ± 0.564
0.31LysCys: 0.31 ± 0.155
2.85LysAsp: 2.85 ± 0.337
2.788LysGlu: 2.788 ± 0.427
1.177LysPhe: 1.177 ± 0.209
3.222LysGly: 3.222 ± 0.403
1.053LysHis: 1.053 ± 0.239
1.797LysIle: 1.797 ± 0.242
2.788LysLys: 2.788 ± 0.624
3.222LysLeu: 3.222 ± 0.49
1.115LysMet: 1.115 ± 0.242
1.859LysAsn: 1.859 ± 0.371
2.85LysPro: 2.85 ± 0.506
1.797LysGln: 1.797 ± 0.297
3.284LysArg: 3.284 ± 0.477
2.602LysSer: 2.602 ± 0.401
2.231LysThr: 2.231 ± 0.373
4.213LysVal: 4.213 ± 0.511
0.682LysTrp: 0.682 ± 0.191
1.115LysTyr: 1.115 ± 0.255
0.0LysXaa: 0.0 ± 0.0
Leu
8.179LeuAla: 8.179 ± 0.668
0.744LeuCys: 0.744 ± 0.22
5.019LeuAsp: 5.019 ± 0.458
6.258LeuGlu: 6.258 ± 0.708
2.417LeuPhe: 2.417 ± 0.313
5.639LeuGly: 5.639 ± 0.511
2.85LeuHis: 2.85 ± 0.462
3.78LeuIle: 3.78 ± 0.391
4.461LeuLys: 4.461 ± 0.804
4.833LeuLeu: 4.833 ± 0.646
2.355LeuMet: 2.355 ± 0.382
2.417LeuAsn: 2.417 ± 0.441
4.028LeuPro: 4.028 ± 0.504
2.355LeuGln: 2.355 ± 0.338
5.143LeuArg: 5.143 ± 0.551
5.081LeuSer: 5.081 ± 0.587
5.019LeuThr: 5.019 ± 0.575
5.391LeuVal: 5.391 ± 0.59
1.921LeuTrp: 1.921 ± 0.285
2.664LeuTyr: 2.664 ± 0.364
0.0LeuXaa: 0.0 ± 0.0
Met
2.169MetAla: 2.169 ± 0.343
0.124MetCys: 0.124 ± 0.097
0.991MetAsp: 0.991 ± 0.227
1.301MetGlu: 1.301 ± 0.218
0.558MetPhe: 0.558 ± 0.176
1.549MetGly: 1.549 ± 0.306
0.372MetHis: 0.372 ± 0.137
1.549MetIle: 1.549 ± 0.282
1.735MetLys: 1.735 ± 0.304
2.169MetLeu: 2.169 ± 0.381
0.248MetMet: 0.248 ± 0.105
0.62MetAsn: 0.62 ± 0.183
1.239MetPro: 1.239 ± 0.261
0.744MetGln: 0.744 ± 0.214
1.363MetArg: 1.363 ± 0.251
2.54MetSer: 2.54 ± 0.385
1.983MetThr: 1.983 ± 0.335
1.239MetVal: 1.239 ± 0.286
0.372MetTrp: 0.372 ± 0.153
0.496MetTyr: 0.496 ± 0.191
0.0MetXaa: 0.0 ± 0.0
Asn
2.664AsnAla: 2.664 ± 0.434
0.372AsnCys: 0.372 ± 0.161
1.921AsnAsp: 1.921 ± 0.312
2.355AsnGlu: 2.355 ± 0.4
1.239AsnPhe: 1.239 ± 0.293
3.594AsnGly: 3.594 ± 0.52
0.867AsnHis: 0.867 ± 0.211
1.921AsnIle: 1.921 ± 0.3
1.115AsnLys: 1.115 ± 0.234
3.16AsnLeu: 3.16 ± 0.417
0.434AsnMet: 0.434 ± 0.137
0.806AsnAsn: 0.806 ± 0.204
2.602AsnPro: 2.602 ± 0.394
1.363AsnGln: 1.363 ± 0.241
1.673AsnArg: 1.673 ± 0.33
1.735AsnSer: 1.735 ± 0.35
1.921AsnThr: 1.921 ± 0.339
2.54AsnVal: 2.54 ± 0.35
0.806AsnTrp: 0.806 ± 0.207
0.806AsnTyr: 0.806 ± 0.168
0.0AsnXaa: 0.0 ± 0.0
Pro
5.081ProAla: 5.081 ± 0.609
0.496ProCys: 0.496 ± 0.183
4.337ProAsp: 4.337 ± 0.485
4.709ProGlu: 4.709 ± 0.599
2.045ProPhe: 2.045 ± 0.347
4.089ProGly: 4.089 ± 0.589
0.806ProHis: 0.806 ± 0.234
2.231ProIle: 2.231 ± 0.342
2.912ProLys: 2.912 ± 0.561
3.718ProLeu: 3.718 ± 0.459
1.239ProMet: 1.239 ± 0.284
2.664ProAsn: 2.664 ± 0.439
2.169ProPro: 2.169 ± 0.355
2.355ProGln: 2.355 ± 0.47
3.16ProArg: 3.16 ± 0.491
3.098ProSer: 3.098 ± 0.419
3.408ProThr: 3.408 ± 0.524
3.904ProVal: 3.904 ± 0.477
1.053ProTrp: 1.053 ± 0.38
1.363ProTyr: 1.363 ± 0.287
0.0ProXaa: 0.0 ± 0.0
Gln
3.966GlnAla: 3.966 ± 0.498
0.248GlnCys: 0.248 ± 0.123
1.921GlnAsp: 1.921 ± 0.326
1.859GlnGlu: 1.859 ± 0.373
1.301GlnPhe: 1.301 ± 0.276
3.408GlnGly: 3.408 ± 0.475
0.496GlnHis: 0.496 ± 0.158
2.478GlnIle: 2.478 ± 0.41
1.611GlnLys: 1.611 ± 0.333
3.408GlnLeu: 3.408 ± 0.566
0.806GlnMet: 0.806 ± 0.201
0.434GlnAsn: 0.434 ± 0.142
1.921GlnPro: 1.921 ± 0.371
1.239GlnGln: 1.239 ± 0.369
2.54GlnArg: 2.54 ± 0.437
1.673GlnSer: 1.673 ± 0.356
1.797GlnThr: 1.797 ± 0.312
2.478GlnVal: 2.478 ± 0.381
0.806GlnTrp: 0.806 ± 0.283
1.301GlnTyr: 1.301 ± 0.266
0.0GlnXaa: 0.0 ± 0.0
Arg
6.258ArgAla: 6.258 ± 0.739
0.929ArgCys: 0.929 ± 0.315
4.151ArgAsp: 4.151 ± 0.51
3.78ArgGlu: 3.78 ± 0.536
2.169ArgPhe: 2.169 ± 0.385
4.337ArgGly: 4.337 ± 0.512
1.425ArgHis: 1.425 ± 0.27
3.656ArgIle: 3.656 ± 0.44
2.974ArgLys: 2.974 ± 0.419
5.453ArgLeu: 5.453 ± 0.56
1.797ArgMet: 1.797 ± 0.301
1.859ArgAsn: 1.859 ± 0.381
3.408ArgPro: 3.408 ± 0.456
2.355ArgGln: 2.355 ± 0.402
5.948ArgArg: 5.948 ± 0.763
2.788ArgSer: 2.788 ± 0.409
2.417ArgThr: 2.417 ± 0.336
5.081ArgVal: 5.081 ± 0.556
1.301ArgTrp: 1.301 ± 0.293
1.859ArgTyr: 1.859 ± 0.362
0.0ArgXaa: 0.0 ± 0.0
Ser
5.205SerAla: 5.205 ± 0.663
0.62SerCys: 0.62 ± 0.2
2.788SerAsp: 2.788 ± 0.393
3.346SerGlu: 3.346 ± 0.503
1.859SerPhe: 1.859 ± 0.321
4.709SerGly: 4.709 ± 0.599
0.806SerHis: 0.806 ± 0.215
2.417SerIle: 2.417 ± 0.354
1.983SerLys: 1.983 ± 0.435
3.78SerLeu: 3.78 ± 0.618
1.673SerMet: 1.673 ± 0.254
1.611SerAsn: 1.611 ± 0.308
3.408SerPro: 3.408 ± 0.421
2.478SerGln: 2.478 ± 0.394
3.222SerArg: 3.222 ± 0.407
2.726SerSer: 2.726 ± 0.459
2.974SerThr: 2.974 ± 0.469
3.78SerVal: 3.78 ± 0.535
1.425SerTrp: 1.425 ± 0.302
1.487SerTyr: 1.487 ± 0.264
0.0SerXaa: 0.0 ± 0.0
Thr
5.886ThrAla: 5.886 ± 0.474
0.434ThrCys: 0.434 ± 0.161
4.213ThrAsp: 4.213 ± 0.589
2.974ThrGlu: 2.974 ± 0.387
2.045ThrPhe: 2.045 ± 0.322
5.453ThrGly: 5.453 ± 0.642
0.867ThrHis: 0.867 ± 0.228
2.602ThrIle: 2.602 ± 0.418
2.355ThrLys: 2.355 ± 0.427
4.337ThrLeu: 4.337 ± 0.519
1.053ThrMet: 1.053 ± 0.294
1.921ThrAsn: 1.921 ± 0.393
4.275ThrPro: 4.275 ± 0.514
2.169ThrGln: 2.169 ± 0.382
2.664ThrArg: 2.664 ± 0.381
3.098ThrSer: 3.098 ± 0.497
3.284ThrThr: 3.284 ± 0.477
4.275ThrVal: 4.275 ± 0.425
0.806ThrTrp: 0.806 ± 0.189
2.045ThrTyr: 2.045 ± 0.379
0.0ThrXaa: 0.0 ± 0.0
Val
7.807ValAla: 7.807 ± 0.795
0.558ValCys: 0.558 ± 0.197
5.081ValAsp: 5.081 ± 0.783
5.143ValGlu: 5.143 ± 0.572
2.85ValPhe: 2.85 ± 0.46
4.523ValGly: 4.523 ± 0.487
0.929ValHis: 0.929 ± 0.245
4.213ValIle: 4.213 ± 0.474
3.346ValLys: 3.346 ± 0.379
5.577ValLeu: 5.577 ± 0.697
1.549ValMet: 1.549 ± 0.353
3.222ValAsn: 3.222 ± 0.425
3.16ValPro: 3.16 ± 0.511
2.478ValGln: 2.478 ± 0.329
4.399ValArg: 4.399 ± 0.522
3.78ValSer: 3.78 ± 0.477
4.833ValThr: 4.833 ± 0.519
6.072ValVal: 6.072 ± 0.709
1.549ValTrp: 1.549 ± 0.284
2.726ValTyr: 2.726 ± 0.446
0.0ValXaa: 0.0 ± 0.0
Trp
1.859TrpAla: 1.859 ± 0.434
0.434TrpCys: 0.434 ± 0.166
1.363TrpAsp: 1.363 ± 0.228
1.487TrpGlu: 1.487 ± 0.293
0.744TrpPhe: 0.744 ± 0.226
1.177TrpGly: 1.177 ± 0.331
0.558TrpHis: 0.558 ± 0.168
1.053TrpIle: 1.053 ± 0.244
0.867TrpLys: 0.867 ± 0.228
1.487TrpLeu: 1.487 ± 0.318
0.496TrpMet: 0.496 ± 0.2
0.682TrpAsn: 0.682 ± 0.227
0.867TrpPro: 0.867 ± 0.261
0.929TrpGln: 0.929 ± 0.181
1.115TrpArg: 1.115 ± 0.232
1.239TrpSer: 1.239 ± 0.254
1.053TrpThr: 1.053 ± 0.238
1.363TrpVal: 1.363 ± 0.279
0.496TrpTrp: 0.496 ± 0.173
0.62TrpTyr: 0.62 ± 0.19
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.974TyrAla: 2.974 ± 0.364
0.248TyrCys: 0.248 ± 0.126
2.355TyrAsp: 2.355 ± 0.308
1.921TyrGlu: 1.921 ± 0.386
0.682TyrPhe: 0.682 ± 0.199
2.169TyrGly: 2.169 ± 0.345
0.558TyrHis: 0.558 ± 0.212
2.045TyrIle: 2.045 ± 0.358
1.363TyrLys: 1.363 ± 0.266
2.664TyrLeu: 2.664 ± 0.395
0.372TyrMet: 0.372 ± 0.172
0.929TyrAsn: 0.929 ± 0.213
1.239TyrPro: 1.239 ± 0.254
1.115TyrGln: 1.115 ± 0.238
1.983TyrArg: 1.983 ± 0.39
1.301TyrSer: 1.301 ± 0.257
1.611TyrThr: 1.611 ± 0.32
2.664TyrVal: 2.664 ± 0.428
0.434TyrTrp: 0.434 ± 0.186
0.929TyrTyr: 0.929 ± 0.294
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 98 proteins (16140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski