Amino acid dipepetide frequency for Mycobacterium phage Llij

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.675AlaAla: 13.675 ± 1.922
1.056AlaCys: 1.056 ± 0.22
6.671AlaAsp: 6.671 ± 0.518
6.726AlaGlu: 6.726 ± 0.689
2.835AlaPhe: 2.835 ± 0.431
9.728AlaGly: 9.728 ± 1.233
2.668AlaHis: 2.668 ± 0.429
3.891AlaIle: 3.891 ± 0.535
4.28AlaLys: 4.28 ± 0.357
7.56AlaLeu: 7.56 ± 0.795
2.668AlaMet: 2.668 ± 0.428
3.28AlaAsn: 3.28 ± 0.42
5.003AlaPro: 5.003 ± 0.547
3.224AlaGln: 3.224 ± 0.38
8.06AlaArg: 8.06 ± 0.767
4.725AlaSer: 4.725 ± 0.554
6.949AlaThr: 6.949 ± 0.597
7.227AlaVal: 7.227 ± 0.616
2.557AlaTrp: 2.557 ± 0.347
2.168AlaTyr: 2.168 ± 0.32
0.0AlaXaa: 0.0 ± 0.0
Cys
1.167CysAla: 1.167 ± 0.347
0.222CysCys: 0.222 ± 0.11
1.501CysAsp: 1.501 ± 0.36
0.834CysGlu: 0.834 ± 0.243
0.334CysPhe: 0.334 ± 0.122
2.168CysGly: 2.168 ± 0.469
0.334CysHis: 0.334 ± 0.15
0.056CysIle: 0.056 ± 0.053
0.5CysLys: 0.5 ± 0.17
0.667CysLeu: 0.667 ± 0.217
0.167CysMet: 0.167 ± 0.095
0.5CysAsn: 0.5 ± 0.169
1.001CysPro: 1.001 ± 0.254
0.445CysGln: 0.445 ± 0.142
0.834CysArg: 0.834 ± 0.253
1.001CysSer: 1.001 ± 0.283
0.834CysThr: 0.834 ± 0.279
0.889CysVal: 0.889 ± 0.202
0.334CysTrp: 0.334 ± 0.124
0.334CysTyr: 0.334 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
6.949AspAla: 6.949 ± 0.733
1.001AspCys: 1.001 ± 0.203
4.447AspAsp: 4.447 ± 0.524
3.002AspGlu: 3.002 ± 0.478
1.89AspPhe: 1.89 ± 0.272
6.448AspGly: 6.448 ± 0.651
1.501AspHis: 1.501 ± 0.258
2.279AspIle: 2.279 ± 0.303
1.668AspLys: 1.668 ± 0.293
5.448AspLeu: 5.448 ± 0.533
1.056AspMet: 1.056 ± 0.268
1.723AspAsn: 1.723 ± 0.342
5.003AspPro: 5.003 ± 0.585
2.168AspGln: 2.168 ± 0.301
5.392AspArg: 5.392 ± 0.675
3.335AspSer: 3.335 ± 0.502
4.67AspThr: 4.67 ± 0.472
4.447AspVal: 4.447 ± 0.473
1.668AspTrp: 1.668 ± 0.278
2.001AspTyr: 2.001 ± 0.31
0.0AspXaa: 0.0 ± 0.0
Glu
6.17GluAla: 6.17 ± 0.573
1.167GluCys: 1.167 ± 0.249
3.335GluAsp: 3.335 ± 0.37
3.057GluGlu: 3.057 ± 0.536
2.112GluPhe: 2.112 ± 0.333
3.558GluGly: 3.558 ± 0.407
1.723GluHis: 1.723 ± 0.43
2.168GluIle: 2.168 ± 0.328
2.224GluLys: 2.224 ± 0.311
5.503GluLeu: 5.503 ± 0.682
1.445GluMet: 1.445 ± 0.301
1.89GluAsn: 1.89 ± 0.294
3.002GluPro: 3.002 ± 0.434
2.502GluGln: 2.502 ± 0.373
5.114GluArg: 5.114 ± 0.587
2.891GluSer: 2.891 ± 0.46
3.724GluThr: 3.724 ± 0.514
3.558GluVal: 3.558 ± 0.432
1.39GluTrp: 1.39 ± 0.264
1.779GluTyr: 1.779 ± 0.311
0.0GluXaa: 0.0 ± 0.0
Phe
3.224PheAla: 3.224 ± 0.468
0.334PheCys: 0.334 ± 0.129
2.335PheAsp: 2.335 ± 0.401
1.779PheGlu: 1.779 ± 0.319
0.945PhePhe: 0.945 ± 0.278
2.835PheGly: 2.835 ± 0.562
0.5PheHis: 0.5 ± 0.157
1.501PheIle: 1.501 ± 0.349
1.223PheLys: 1.223 ± 0.265
1.946PheLeu: 1.946 ± 0.291
0.611PheMet: 0.611 ± 0.184
1.334PheAsn: 1.334 ± 0.352
1.612PhePro: 1.612 ± 0.338
1.001PheGln: 1.001 ± 0.275
1.723PheArg: 1.723 ± 0.26
1.612PheSer: 1.612 ± 0.265
2.446PheThr: 2.446 ± 0.35
2.335PheVal: 2.335 ± 0.273
0.667PheTrp: 0.667 ± 0.165
0.945PheTyr: 0.945 ± 0.256
0.0PheXaa: 0.0 ± 0.0
Gly
8.728GlyAla: 8.728 ± 1.148
1.445GlyCys: 1.445 ± 0.347
6.059GlyAsp: 6.059 ± 0.531
4.336GlyGlu: 4.336 ± 0.479
2.779GlyPhe: 2.779 ± 0.426
10.451GlyGly: 10.451 ± 2.093
2.001GlyHis: 2.001 ± 0.253
3.78GlyIle: 3.78 ± 0.527
2.502GlyLys: 2.502 ± 0.375
6.059GlyLeu: 6.059 ± 0.595
2.39GlyMet: 2.39 ± 0.421
3.057GlyAsn: 3.057 ± 0.326
3.836GlyPro: 3.836 ± 0.472
2.112GlyGln: 2.112 ± 0.579
5.225GlyArg: 5.225 ± 0.665
5.837GlySer: 5.837 ± 0.705
6.615GlyThr: 6.615 ± 0.787
6.226GlyVal: 6.226 ± 0.498
2.724GlyTrp: 2.724 ± 0.441
2.279GlyTyr: 2.279 ± 0.348
0.0GlyXaa: 0.0 ± 0.0
His
2.001HisAla: 2.001 ± 0.324
0.445HisCys: 0.445 ± 0.168
1.167HisAsp: 1.167 ± 0.232
1.223HisGlu: 1.223 ± 0.261
0.334HisPhe: 0.334 ± 0.122
1.723HisGly: 1.723 ± 0.289
0.889HisHis: 0.889 ± 0.294
1.668HisIle: 1.668 ± 0.313
0.778HisLys: 0.778 ± 0.23
1.946HisLeu: 1.946 ± 0.294
0.445HisMet: 0.445 ± 0.147
1.001HisAsn: 1.001 ± 0.195
1.612HisPro: 1.612 ± 0.263
0.611HisGln: 0.611 ± 0.184
2.39HisArg: 2.39 ± 0.398
0.834HisSer: 0.834 ± 0.206
1.334HisThr: 1.334 ± 0.253
1.557HisVal: 1.557 ± 0.371
0.445HisTrp: 0.445 ± 0.112
0.889HisTyr: 0.889 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
4.836IleAla: 4.836 ± 0.596
0.723IleCys: 0.723 ± 0.204
3.78IleAsp: 3.78 ± 0.438
3.224IleGlu: 3.224 ± 0.418
0.834IlePhe: 0.834 ± 0.232
3.78IleGly: 3.78 ± 0.493
1.279IleHis: 1.279 ± 0.243
1.334IleIle: 1.334 ± 0.246
1.279IleLys: 1.279 ± 0.278
2.168IleLeu: 2.168 ± 0.309
0.334IleMet: 0.334 ± 0.13
1.946IleAsn: 1.946 ± 0.255
2.557IlePro: 2.557 ± 0.318
1.39IleGln: 1.39 ± 0.249
2.001IleArg: 2.001 ± 0.294
2.112IleSer: 2.112 ± 0.309
3.335IleThr: 3.335 ± 0.36
3.057IleVal: 3.057 ± 0.401
1.001IleTrp: 1.001 ± 0.239
0.723IleTyr: 0.723 ± 0.183
0.0IleXaa: 0.0 ± 0.0
Lys
4.169LysAla: 4.169 ± 0.45
0.445LysCys: 0.445 ± 0.142
1.612LysAsp: 1.612 ± 0.279
1.501LysGlu: 1.501 ± 0.3
1.279LysPhe: 1.279 ± 0.189
2.668LysGly: 2.668 ± 0.347
1.167LysHis: 1.167 ± 0.265
1.056LysIle: 1.056 ± 0.219
1.501LysLys: 1.501 ± 0.314
2.613LysLeu: 2.613 ± 0.459
0.556LysMet: 0.556 ± 0.14
0.778LysAsn: 0.778 ± 0.211
2.057LysPro: 2.057 ± 0.293
1.946LysGln: 1.946 ± 0.292
2.891LysArg: 2.891 ± 0.395
2.057LysSer: 2.057 ± 0.294
1.834LysThr: 1.834 ± 0.308
2.446LysVal: 2.446 ± 0.363
0.667LysTrp: 0.667 ± 0.197
0.778LysTyr: 0.778 ± 0.2
0.0LysXaa: 0.0 ± 0.0
Leu
7.727LeuAla: 7.727 ± 0.901
0.945LeuCys: 0.945 ± 0.272
4.836LeuAsp: 4.836 ± 0.477
3.669LeuGlu: 3.669 ± 0.419
2.446LeuPhe: 2.446 ± 0.287
5.281LeuGly: 5.281 ± 0.531
1.112LeuHis: 1.112 ± 0.239
2.891LeuIle: 2.891 ± 0.414
2.168LeuLys: 2.168 ± 0.352
4.892LeuLeu: 4.892 ± 0.587
1.89LeuMet: 1.89 ± 0.33
2.557LeuAsn: 2.557 ± 0.363
5.003LeuPro: 5.003 ± 0.585
2.779LeuGln: 2.779 ± 0.427
5.225LeuArg: 5.225 ± 0.574
4.781LeuSer: 4.781 ± 0.475
5.392LeuThr: 5.392 ± 0.476
4.67LeuVal: 4.67 ± 0.553
1.279LeuTrp: 1.279 ± 0.285
2.112LeuTyr: 2.112 ± 0.332
0.0LeuXaa: 0.0 ± 0.0
Met
2.168MetAla: 2.168 ± 0.302
0.278MetCys: 0.278 ± 0.199
1.112MetAsp: 1.112 ± 0.2
1.112MetGlu: 1.112 ± 0.177
0.667MetPhe: 0.667 ± 0.158
1.946MetGly: 1.946 ± 0.326
0.111MetHis: 0.111 ± 0.077
0.889MetIle: 0.889 ± 0.215
0.778MetLys: 0.778 ± 0.243
1.834MetLeu: 1.834 ± 0.29
0.445MetMet: 0.445 ± 0.181
0.723MetAsn: 0.723 ± 0.185
1.167MetPro: 1.167 ± 0.253
0.5MetGln: 0.5 ± 0.151
1.501MetArg: 1.501 ± 0.294
2.835MetSer: 2.835 ± 0.462
1.946MetThr: 1.946 ± 0.278
1.279MetVal: 1.279 ± 0.331
0.389MetTrp: 0.389 ± 0.137
0.389MetTyr: 0.389 ± 0.118
0.0MetXaa: 0.0 ± 0.0
Asn
3.558AsnAla: 3.558 ± 0.413
0.5AsnCys: 0.5 ± 0.169
1.89AsnAsp: 1.89 ± 0.292
1.557AsnGlu: 1.557 ± 0.326
1.001AsnPhe: 1.001 ± 0.313
3.947AsnGly: 3.947 ± 0.534
0.834AsnHis: 0.834 ± 0.182
1.501AsnIle: 1.501 ± 0.414
1.001AsnLys: 1.001 ± 0.228
2.224AsnLeu: 2.224 ± 0.32
0.611AsnMet: 0.611 ± 0.161
1.668AsnAsn: 1.668 ± 0.348
2.946AsnPro: 2.946 ± 0.392
1.056AsnGln: 1.056 ± 0.344
2.39AsnArg: 2.39 ± 0.414
1.723AsnSer: 1.723 ± 0.332
1.89AsnThr: 1.89 ± 0.275
1.946AsnVal: 1.946 ± 0.288
0.834AsnTrp: 0.834 ± 0.172
0.778AsnTyr: 0.778 ± 0.181
0.0AsnXaa: 0.0 ± 0.0
Pro
5.448ProAla: 5.448 ± 0.622
0.723ProCys: 0.723 ± 0.179
4.225ProAsp: 4.225 ± 0.478
4.614ProGlu: 4.614 ± 0.467
2.001ProPhe: 2.001 ± 0.335
6.726ProGly: 6.726 ± 0.722
1.223ProHis: 1.223 ± 0.236
1.723ProIle: 1.723 ± 0.327
1.89ProLys: 1.89 ± 0.296
4.225ProLeu: 4.225 ± 0.486
1.334ProMet: 1.334 ± 0.307
2.168ProAsn: 2.168 ± 0.327
4.392ProPro: 4.392 ± 0.681
2.112ProGln: 2.112 ± 0.335
3.502ProArg: 3.502 ± 0.535
3.169ProSer: 3.169 ± 0.479
3.28ProThr: 3.28 ± 0.37
4.447ProVal: 4.447 ± 0.563
1.056ProTrp: 1.056 ± 0.212
1.39ProTyr: 1.39 ± 0.243
0.0ProXaa: 0.0 ± 0.0
Gln
4.002GlnAla: 4.002 ± 0.511
0.334GlnCys: 0.334 ± 0.182
1.557GlnAsp: 1.557 ± 0.261
1.946GlnGlu: 1.946 ± 0.311
1.167GlnPhe: 1.167 ± 0.239
2.224GlnGly: 2.224 ± 0.422
0.834GlnHis: 0.834 ± 0.209
1.834GlnIle: 1.834 ± 0.305
1.612GlnLys: 1.612 ± 0.284
2.779GlnLeu: 2.779 ± 0.3
0.556GlnMet: 0.556 ± 0.16
0.834GlnAsn: 0.834 ± 0.183
2.39GlnPro: 2.39 ± 0.36
1.112GlnGln: 1.112 ± 0.265
2.557GlnArg: 2.557 ± 0.368
2.446GlnSer: 2.446 ± 0.36
1.612GlnThr: 1.612 ± 0.338
2.502GlnVal: 2.502 ± 0.346
0.556GlnTrp: 0.556 ± 0.177
0.667GlnTyr: 0.667 ± 0.24
0.0GlnXaa: 0.0 ± 0.0
Arg
7.783ArgAla: 7.783 ± 0.702
1.39ArgCys: 1.39 ± 0.358
4.503ArgAsp: 4.503 ± 0.556
5.392ArgGlu: 5.392 ± 0.763
2.224ArgPhe: 2.224 ± 0.347
4.002ArgGly: 4.002 ± 0.437
1.779ArgHis: 1.779 ± 0.436
4.002ArgIle: 4.002 ± 0.548
2.835ArgLys: 2.835 ± 0.373
4.447ArgLeu: 4.447 ± 0.666
2.724ArgMet: 2.724 ± 0.4
2.613ArgAsn: 2.613 ± 0.35
3.335ArgPro: 3.335 ± 0.392
2.057ArgGln: 2.057 ± 0.39
6.337ArgArg: 6.337 ± 0.926
4.002ArgSer: 4.002 ± 0.422
3.724ArgThr: 3.724 ± 0.561
5.448ArgVal: 5.448 ± 0.652
2.224ArgTrp: 2.224 ± 0.346
2.001ArgTyr: 2.001 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
5.059SerAla: 5.059 ± 0.597
0.445SerCys: 0.445 ± 0.186
4.336SerAsp: 4.336 ± 0.519
3.558SerGlu: 3.558 ± 0.368
2.112SerPhe: 2.112 ± 0.47
5.892SerGly: 5.892 ± 0.638
1.445SerHis: 1.445 ± 0.279
2.502SerIle: 2.502 ± 0.383
2.279SerLys: 2.279 ± 0.42
3.669SerLeu: 3.669 ± 0.434
1.334SerMet: 1.334 ± 0.233
2.39SerAsn: 2.39 ± 0.465
3.28SerPro: 3.28 ± 0.391
1.612SerGln: 1.612 ± 0.238
3.947SerArg: 3.947 ± 0.467
3.724SerSer: 3.724 ± 0.614
3.613SerThr: 3.613 ± 0.442
4.392SerVal: 4.392 ± 0.503
1.334SerTrp: 1.334 ± 0.237
1.334SerTyr: 1.334 ± 0.234
0.0SerXaa: 0.0 ± 0.0
Thr
6.504ThrAla: 6.504 ± 0.614
1.112ThrCys: 1.112 ± 0.286
4.558ThrAsp: 4.558 ± 0.558
3.391ThrGlu: 3.391 ± 0.394
2.001ThrPhe: 2.001 ± 0.363
6.615ThrGly: 6.615 ± 0.656
1.612ThrHis: 1.612 ± 0.309
3.391ThrIle: 3.391 ± 0.4
1.89ThrLys: 1.89 ± 0.29
4.725ThrLeu: 4.725 ± 0.575
1.167ThrMet: 1.167 ± 0.265
2.057ThrAsn: 2.057 ± 0.318
4.392ThrPro: 4.392 ± 0.623
1.89ThrGln: 1.89 ± 0.294
4.28ThrArg: 4.28 ± 0.526
4.002ThrSer: 4.002 ± 0.458
4.725ThrThr: 4.725 ± 0.6
5.559ThrVal: 5.559 ± 0.606
1.112ThrTrp: 1.112 ± 0.304
1.445ThrTyr: 1.445 ± 0.242
0.0ThrXaa: 0.0 ± 0.0
Val
7.227ValAla: 7.227 ± 0.603
1.056ValCys: 1.056 ± 0.285
5.059ValAsp: 5.059 ± 0.559
4.614ValGlu: 4.614 ± 0.539
2.335ValPhe: 2.335 ± 0.377
5.559ValGly: 5.559 ± 0.661
1.39ValHis: 1.39 ± 0.253
2.835ValIle: 2.835 ± 0.399
2.224ValLys: 2.224 ± 0.328
4.836ValLeu: 4.836 ± 0.541
1.167ValMet: 1.167 ± 0.195
2.168ValAsn: 2.168 ± 0.364
4.503ValPro: 4.503 ± 0.437
3.113ValGln: 3.113 ± 0.356
4.947ValArg: 4.947 ± 0.621
4.614ValSer: 4.614 ± 0.537
5.114ValThr: 5.114 ± 0.502
6.004ValVal: 6.004 ± 0.648
1.779ValTrp: 1.779 ± 0.325
1.334ValTyr: 1.334 ± 0.305
0.0ValXaa: 0.0 ± 0.0
Trp
2.112TrpAla: 2.112 ± 0.323
0.222TrpCys: 0.222 ± 0.099
1.445TrpAsp: 1.445 ± 0.308
1.112TrpGlu: 1.112 ± 0.298
0.723TrpPhe: 0.723 ± 0.195
1.001TrpGly: 1.001 ± 0.223
0.556TrpHis: 0.556 ± 0.158
1.445TrpIle: 1.445 ± 0.252
0.667TrpLys: 0.667 ± 0.16
2.001TrpLeu: 2.001 ± 0.319
0.889TrpMet: 0.889 ± 0.214
0.611TrpAsn: 0.611 ± 0.193
1.223TrpPro: 1.223 ± 0.301
1.112TrpGln: 1.112 ± 0.26
2.279TrpArg: 2.279 ± 0.421
1.557TrpSer: 1.557 ± 0.229
1.557TrpThr: 1.557 ± 0.268
1.612TrpVal: 1.612 ± 0.346
1.001TrpTrp: 1.001 ± 0.212
0.445TrpTyr: 0.445 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.446TyrAla: 2.446 ± 0.356
0.278TyrCys: 0.278 ± 0.131
1.668TyrAsp: 1.668 ± 0.359
1.668TyrGlu: 1.668 ± 0.249
0.889TyrPhe: 0.889 ± 0.206
1.89TyrGly: 1.89 ± 0.444
0.334TyrHis: 0.334 ± 0.113
0.945TyrIle: 0.945 ± 0.186
0.723TyrLys: 0.723 ± 0.199
2.112TyrLeu: 2.112 ± 0.321
0.222TyrMet: 0.222 ± 0.1
0.556TyrAsn: 0.556 ± 0.168
1.334TyrPro: 1.334 ± 0.243
0.778TyrGln: 0.778 ± 0.179
2.224TyrArg: 2.224 ± 0.357
1.001TyrSer: 1.001 ± 0.24
1.89TyrThr: 1.89 ± 0.351
2.279TyrVal: 2.279 ± 0.326
0.556TyrTrp: 0.556 ± 0.176
0.667TyrTyr: 0.667 ± 0.156
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 100 proteins (17990 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski