Amino acid dipepetide frequency for Mycobacterium phage JF4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.677AlaAla: 10.677 ± 1.23
0.837AlaCys: 0.837 ± 0.245
5.513AlaAsp: 5.513 ± 0.671
6.839AlaGlu: 6.839 ± 0.763
3.001AlaPhe: 3.001 ± 0.553
7.397AlaGly: 7.397 ± 0.799
1.954AlaHis: 1.954 ± 0.406
3.629AlaIle: 3.629 ± 0.593
4.536AlaLys: 4.536 ± 0.678
9.351AlaLeu: 9.351 ± 0.928
2.512AlaMet: 2.512 ± 0.358
2.163AlaAsn: 2.163 ± 0.406
5.722AlaPro: 5.722 ± 0.782
4.466AlaGln: 4.466 ± 0.507
5.722AlaArg: 5.722 ± 0.53
5.304AlaSer: 5.304 ± 0.59
5.304AlaThr: 5.304 ± 0.563
7.537AlaVal: 7.537 ± 0.693
2.094AlaTrp: 2.094 ± 0.335
2.094AlaTyr: 2.094 ± 0.368
0.0AlaXaa: 0.0 ± 0.0
Cys
0.419CysAla: 0.419 ± 0.172
0.14CysCys: 0.14 ± 0.166
0.419CysAsp: 0.419 ± 0.173
0.907CysGlu: 0.907 ± 0.27
0.14CysPhe: 0.14 ± 0.092
0.768CysGly: 0.768 ± 0.204
0.279CysHis: 0.279 ± 0.12
0.279CysIle: 0.279 ± 0.174
0.349CysLys: 0.349 ± 0.156
0.698CysLeu: 0.698 ± 0.238
0.279CysMet: 0.279 ± 0.129
0.419CysAsn: 0.419 ± 0.16
0.558CysPro: 0.558 ± 0.254
0.279CysGln: 0.279 ± 0.135
0.698CysArg: 0.698 ± 0.204
0.349CysSer: 0.349 ± 0.168
0.279CysThr: 0.279 ± 0.201
0.419CysVal: 0.419 ± 0.152
0.07CysTrp: 0.07 ± 0.065
0.349CysTyr: 0.349 ± 0.164
0.0CysXaa: 0.0 ± 0.0
Asp
6.211AspAla: 6.211 ± 0.813
0.558AspCys: 0.558 ± 0.173
4.396AspAsp: 4.396 ± 0.632
4.396AspGlu: 4.396 ± 0.572
2.512AspPhe: 2.512 ± 0.44
4.885AspGly: 4.885 ± 0.461
1.605AspHis: 1.605 ± 0.36
2.652AspIle: 2.652 ± 0.422
1.675AspLys: 1.675 ± 0.323
5.513AspLeu: 5.513 ± 0.567
1.465AspMet: 1.465 ± 0.316
2.024AspAsn: 2.024 ± 0.366
4.327AspPro: 4.327 ± 0.688
1.884AspGln: 1.884 ± 0.37
4.257AspArg: 4.257 ± 0.678
3.14AspSer: 3.14 ± 0.42
4.047AspThr: 4.047 ± 0.522
3.908AspVal: 3.908 ± 0.438
1.535AspTrp: 1.535 ± 0.305
2.163AspTyr: 2.163 ± 0.4
0.0AspXaa: 0.0 ± 0.0
Glu
7.467GluAla: 7.467 ± 0.747
0.488GluCys: 0.488 ± 0.186
4.676GluAsp: 4.676 ± 0.709
4.396GluGlu: 4.396 ± 0.594
2.861GluPhe: 2.861 ± 0.426
5.094GluGly: 5.094 ± 0.612
1.465GluHis: 1.465 ± 0.377
2.582GluIle: 2.582 ± 0.461
2.442GluLys: 2.442 ± 0.369
6.42GluLeu: 6.42 ± 0.693
1.954GluMet: 1.954 ± 0.356
2.442GluAsn: 2.442 ± 0.408
3.07GluPro: 3.07 ± 0.561
2.512GluGln: 2.512 ± 0.402
4.396GluArg: 4.396 ± 0.634
3.978GluSer: 3.978 ± 0.544
3.768GluThr: 3.768 ± 0.492
5.443GluVal: 5.443 ± 0.61
1.117GluTrp: 1.117 ± 0.273
2.303GluTyr: 2.303 ± 0.47
0.0GluXaa: 0.0 ± 0.0
Phe
3.489PheAla: 3.489 ± 0.552
0.279PheCys: 0.279 ± 0.184
2.931PheAsp: 2.931 ± 0.506
2.652PheGlu: 2.652 ± 0.404
0.488PhePhe: 0.488 ± 0.218
2.861PheGly: 2.861 ± 0.524
0.558PheHis: 0.558 ± 0.213
1.465PheIle: 1.465 ± 0.283
0.977PheLys: 0.977 ± 0.276
3.14PheLeu: 3.14 ± 0.467
0.628PheMet: 0.628 ± 0.206
1.814PheAsn: 1.814 ± 0.391
1.675PhePro: 1.675 ± 0.323
1.117PheGln: 1.117 ± 0.243
1.954PheArg: 1.954 ± 0.356
1.745PheSer: 1.745 ± 0.409
1.814PheThr: 1.814 ± 0.362
2.303PheVal: 2.303 ± 0.397
0.488PheTrp: 0.488 ± 0.178
0.698PheTyr: 0.698 ± 0.211
0.0PheXaa: 0.0 ± 0.0
Gly
6.909GlyAla: 6.909 ± 0.813
0.977GlyCys: 0.977 ± 0.238
4.955GlyAsp: 4.955 ± 0.684
4.327GlyGlu: 4.327 ± 0.505
3.489GlyPhe: 3.489 ± 0.519
7.537GlyGly: 7.537 ± 1.118
1.745GlyHis: 1.745 ± 0.343
4.117GlyIle: 4.117 ± 0.546
3.559GlyLys: 3.559 ± 0.496
6.071GlyLeu: 6.071 ± 0.922
1.954GlyMet: 1.954 ± 0.404
3.629GlyAsn: 3.629 ± 0.622
5.443GlyPro: 5.443 ± 1.997
3.21GlyGln: 3.21 ± 0.519
4.955GlyArg: 4.955 ± 0.591
4.466GlySer: 4.466 ± 0.545
4.955GlyThr: 4.955 ± 0.682
5.304GlyVal: 5.304 ± 0.526
1.396GlyTrp: 1.396 ± 0.256
2.861GlyTyr: 2.861 ± 0.386
0.0GlyXaa: 0.0 ± 0.0
His
1.954HisAla: 1.954 ± 0.353
0.14HisCys: 0.14 ± 0.096
1.396HisAsp: 1.396 ± 0.315
1.186HisGlu: 1.186 ± 0.265
0.698HisPhe: 0.698 ± 0.199
2.512HisGly: 2.512 ± 0.481
0.628HisHis: 0.628 ± 0.18
1.186HisIle: 1.186 ± 0.23
0.768HisLys: 0.768 ± 0.209
1.326HisLeu: 1.326 ± 0.362
0.279HisMet: 0.279 ± 0.115
0.628HisAsn: 0.628 ± 0.189
1.605HisPro: 1.605 ± 0.366
1.047HisGln: 1.047 ± 0.321
1.884HisArg: 1.884 ± 0.362
0.628HisSer: 0.628 ± 0.263
1.186HisThr: 1.186 ± 0.247
1.047HisVal: 1.047 ± 0.246
0.349HisTrp: 0.349 ± 0.173
0.837HisTyr: 0.837 ± 0.245
0.0HisXaa: 0.0 ± 0.0
Ile
5.024IleAla: 5.024 ± 0.69
0.349IleCys: 0.349 ± 0.134
3.489IleAsp: 3.489 ± 0.479
3.908IleGlu: 3.908 ± 0.482
1.117IlePhe: 1.117 ± 0.306
3.838IleGly: 3.838 ± 0.394
1.047IleHis: 1.047 ± 0.267
1.745IleIle: 1.745 ± 0.335
1.256IleLys: 1.256 ± 0.311
2.512IleLeu: 2.512 ± 0.477
0.628IleMet: 0.628 ± 0.243
1.745IleAsn: 1.745 ± 0.323
3.35IlePro: 3.35 ± 0.416
1.814IleGln: 1.814 ± 0.417
3.768IleArg: 3.768 ± 0.472
2.233IleSer: 2.233 ± 0.579
3.629IleThr: 3.629 ± 0.439
3.28IleVal: 3.28 ± 0.415
0.768IleTrp: 0.768 ± 0.241
1.186IleTyr: 1.186 ± 0.289
0.0IleXaa: 0.0 ± 0.0
Lys
4.047LysAla: 4.047 ± 0.462
0.279LysCys: 0.279 ± 0.179
2.512LysAsp: 2.512 ± 0.409
2.931LysGlu: 2.931 ± 0.446
0.837LysPhe: 0.837 ± 0.26
3.838LysGly: 3.838 ± 0.562
0.977LysHis: 0.977 ± 0.292
1.954LysIle: 1.954 ± 0.384
2.582LysLys: 2.582 ± 0.469
3.35LysLeu: 3.35 ± 0.383
0.837LysMet: 0.837 ± 0.181
1.117LysAsn: 1.117 ± 0.307
2.442LysPro: 2.442 ± 0.54
1.326LysGln: 1.326 ± 0.264
2.722LysArg: 2.722 ± 0.526
2.652LysSer: 2.652 ± 0.475
2.582LysThr: 2.582 ± 0.59
3.908LysVal: 3.908 ± 0.557
0.837LysTrp: 0.837 ± 0.252
1.117LysTyr: 1.117 ± 0.275
0.0LysXaa: 0.0 ± 0.0
Leu
8.234LeuAla: 8.234 ± 0.801
0.349LeuCys: 0.349 ± 0.183
5.862LeuAsp: 5.862 ± 0.631
6.071LeuGlu: 6.071 ± 0.68
2.094LeuPhe: 2.094 ± 0.319
6.211LeuGly: 6.211 ± 0.827
1.396LeuHis: 1.396 ± 0.377
4.955LeuIle: 4.955 ± 0.568
3.629LeuLys: 3.629 ± 0.414
6.281LeuLeu: 6.281 ± 0.566
2.373LeuMet: 2.373 ± 0.322
2.722LeuAsn: 2.722 ± 0.438
4.187LeuPro: 4.187 ± 0.411
2.512LeuGln: 2.512 ± 0.792
6.839LeuArg: 6.839 ± 0.671
4.117LeuSer: 4.117 ± 0.544
5.234LeuThr: 5.234 ± 0.795
5.094LeuVal: 5.094 ± 0.481
1.396LeuTrp: 1.396 ± 0.3
2.582LeuTyr: 2.582 ± 0.496
0.0LeuXaa: 0.0 ± 0.0
Met
3.28MetAla: 3.28 ± 0.531
0.0MetCys: 0.0 ± 0.0
1.745MetAsp: 1.745 ± 0.338
1.396MetGlu: 1.396 ± 0.277
0.768MetPhe: 0.768 ± 0.244
1.814MetGly: 1.814 ± 0.4
0.209MetHis: 0.209 ± 0.143
1.326MetIle: 1.326 ± 0.292
1.535MetLys: 1.535 ± 0.262
1.396MetLeu: 1.396 ± 0.292
0.558MetMet: 0.558 ± 0.149
1.256MetAsn: 1.256 ± 0.286
1.535MetPro: 1.535 ± 0.324
0.628MetGln: 0.628 ± 0.278
1.535MetArg: 1.535 ± 0.358
1.884MetSer: 1.884 ± 0.341
1.605MetThr: 1.605 ± 0.379
1.256MetVal: 1.256 ± 0.291
0.209MetTrp: 0.209 ± 0.122
0.628MetTyr: 0.628 ± 0.199
0.0MetXaa: 0.0 ± 0.0
Asn
3.419AsnAla: 3.419 ± 0.492
0.279AsnCys: 0.279 ± 0.149
2.024AsnAsp: 2.024 ± 0.351
1.814AsnGlu: 1.814 ± 0.335
1.326AsnPhe: 1.326 ± 0.366
3.699AsnGly: 3.699 ± 0.706
1.047AsnHis: 1.047 ± 0.238
1.745AsnIle: 1.745 ± 0.437
1.117AsnLys: 1.117 ± 0.259
2.582AsnLeu: 2.582 ± 0.396
0.768AsnMet: 0.768 ± 0.22
1.047AsnAsn: 1.047 ± 0.292
2.652AsnPro: 2.652 ± 0.581
1.326AsnGln: 1.326 ± 0.296
1.954AsnArg: 1.954 ± 0.37
1.535AsnSer: 1.535 ± 0.308
1.884AsnThr: 1.884 ± 0.324
3.001AsnVal: 3.001 ± 0.573
0.768AsnTrp: 0.768 ± 0.22
1.396AsnTyr: 1.396 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
5.652ProAla: 5.652 ± 0.527
0.419ProCys: 0.419 ± 0.212
3.559ProAsp: 3.559 ± 0.503
4.117ProGlu: 4.117 ± 0.517
1.745ProPhe: 1.745 ± 0.287
5.373ProGly: 5.373 ± 0.692
1.117ProHis: 1.117 ± 0.23
2.442ProIle: 2.442 ± 0.338
3.35ProLys: 3.35 ± 0.647
4.606ProLeu: 4.606 ± 0.599
0.768ProMet: 0.768 ± 0.245
2.722ProAsn: 2.722 ± 0.604
3.001ProPro: 3.001 ± 0.569
3.28ProGln: 3.28 ± 1.218
2.931ProArg: 2.931 ± 0.601
2.861ProSer: 2.861 ± 0.346
3.559ProThr: 3.559 ± 0.531
4.396ProVal: 4.396 ± 0.574
0.977ProTrp: 0.977 ± 0.432
1.047ProTyr: 1.047 ± 0.267
0.0ProXaa: 0.0 ± 0.0
Gln
4.257GlnAla: 4.257 ± 0.552
0.14GlnCys: 0.14 ± 0.093
1.326GlnAsp: 1.326 ± 0.284
2.512GlnGlu: 2.512 ± 0.579
1.465GlnPhe: 1.465 ± 0.369
4.047GlnGly: 4.047 ± 1.713
0.837GlnHis: 0.837 ± 0.248
1.884GlnIle: 1.884 ± 0.363
1.256GlnLys: 1.256 ± 0.299
3.768GlnLeu: 3.768 ± 0.632
1.326GlnMet: 1.326 ± 0.342
0.768GlnAsn: 0.768 ± 0.248
1.675GlnPro: 1.675 ± 0.358
2.233GlnGln: 2.233 ± 0.618
2.861GlnArg: 2.861 ± 0.473
1.675GlnSer: 1.675 ± 0.364
1.814GlnThr: 1.814 ± 0.416
2.652GlnVal: 2.652 ± 0.513
0.907GlnTrp: 0.907 ± 0.221
0.768GlnTyr: 0.768 ± 0.187
0.0GlnXaa: 0.0 ± 0.0
Arg
5.513ArgAla: 5.513 ± 0.577
0.837ArgCys: 0.837 ± 0.295
3.35ArgAsp: 3.35 ± 0.572
4.955ArgGlu: 4.955 ± 0.652
2.373ArgPhe: 2.373 ± 0.387
4.466ArgGly: 4.466 ± 0.49
1.465ArgHis: 1.465 ± 0.305
3.908ArgIle: 3.908 ± 0.472
3.629ArgLys: 3.629 ± 0.602
5.304ArgLeu: 5.304 ± 0.607
2.442ArgMet: 2.442 ± 0.397
2.442ArgAsn: 2.442 ± 0.402
3.489ArgPro: 3.489 ± 0.537
2.024ArgGln: 2.024 ± 0.371
6.42ArgArg: 6.42 ± 0.91
3.768ArgSer: 3.768 ± 0.523
2.861ArgThr: 2.861 ± 0.418
5.792ArgVal: 5.792 ± 0.462
1.675ArgTrp: 1.675 ± 0.403
1.954ArgTyr: 1.954 ± 0.431
0.0ArgXaa: 0.0 ± 0.0
Ser
3.699SerAla: 3.699 ± 0.556
0.14SerCys: 0.14 ± 0.088
3.14SerAsp: 3.14 ± 0.427
4.187SerGlu: 4.187 ± 0.483
2.722SerPhe: 2.722 ± 0.402
4.885SerGly: 4.885 ± 0.716
1.256SerHis: 1.256 ± 0.31
2.442SerIle: 2.442 ± 0.358
2.373SerLys: 2.373 ± 0.497
4.047SerLeu: 4.047 ± 0.683
1.326SerMet: 1.326 ± 0.273
2.024SerAsn: 2.024 ± 0.33
2.094SerPro: 2.094 ± 0.336
1.814SerGln: 1.814 ± 0.453
3.768SerArg: 3.768 ± 0.453
2.652SerSer: 2.652 ± 0.435
2.303SerThr: 2.303 ± 0.375
4.047SerVal: 4.047 ± 0.472
1.326SerTrp: 1.326 ± 0.326
1.535SerTyr: 1.535 ± 0.291
0.0SerXaa: 0.0 ± 0.0
Thr
5.164ThrAla: 5.164 ± 0.547
0.698ThrCys: 0.698 ± 0.275
3.21ThrAsp: 3.21 ± 0.541
3.629ThrGlu: 3.629 ± 0.472
1.745ThrPhe: 1.745 ± 0.355
4.466ThrGly: 4.466 ± 0.656
1.186ThrHis: 1.186 ± 0.386
2.303ThrIle: 2.303 ± 0.485
2.373ThrLys: 2.373 ± 0.445
5.932ThrLeu: 5.932 ± 0.732
1.117ThrMet: 1.117 ± 0.236
1.745ThrAsn: 1.745 ± 0.455
4.047ThrPro: 4.047 ± 0.513
2.373ThrGln: 2.373 ± 0.443
3.489ThrArg: 3.489 ± 0.536
2.512ThrSer: 2.512 ± 0.508
3.001ThrThr: 3.001 ± 0.446
4.955ThrVal: 4.955 ± 0.655
1.047ThrTrp: 1.047 ± 0.301
1.535ThrTyr: 1.535 ± 0.334
0.0ThrXaa: 0.0 ± 0.0
Val
7.048ValAla: 7.048 ± 0.781
0.558ValCys: 0.558 ± 0.146
5.094ValAsp: 5.094 ± 0.68
5.583ValGlu: 5.583 ± 0.663
2.373ValPhe: 2.373 ± 0.499
4.466ValGly: 4.466 ± 0.658
1.605ValHis: 1.605 ± 0.378
3.14ValIle: 3.14 ± 0.479
3.978ValLys: 3.978 ± 0.535
5.583ValLeu: 5.583 ± 0.56
1.535ValMet: 1.535 ± 0.33
3.001ValAsn: 3.001 ± 0.404
4.676ValPro: 4.676 ± 0.602
2.512ValGln: 2.512 ± 0.368
5.234ValArg: 5.234 ± 0.706
4.257ValSer: 4.257 ± 0.49
4.047ValThr: 4.047 ± 0.564
6.141ValVal: 6.141 ± 0.519
1.186ValTrp: 1.186 ± 0.341
2.233ValTyr: 2.233 ± 0.46
0.0ValXaa: 0.0 ± 0.0
Trp
1.675TrpAla: 1.675 ± 0.395
0.349TrpCys: 0.349 ± 0.179
1.326TrpAsp: 1.326 ± 0.307
1.186TrpGlu: 1.186 ± 0.254
0.837TrpPhe: 0.837 ± 0.248
1.814TrpGly: 1.814 ± 0.25
0.279TrpHis: 0.279 ± 0.161
1.814TrpIle: 1.814 ± 0.316
0.628TrpLys: 0.628 ± 0.198
1.256TrpLeu: 1.256 ± 0.311
0.837TrpMet: 0.837 ± 0.213
0.558TrpAsn: 0.558 ± 0.207
0.837TrpPro: 0.837 ± 0.278
0.768TrpGln: 0.768 ± 0.202
0.907TrpArg: 0.907 ± 0.234
0.837TrpSer: 0.837 ± 0.211
1.117TrpThr: 1.117 ± 0.31
1.396TrpVal: 1.396 ± 0.287
0.628TrpTrp: 0.628 ± 0.191
0.419TrpTyr: 0.419 ± 0.179
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.512TyrAla: 2.512 ± 0.465
0.279TyrCys: 0.279 ± 0.125
2.233TyrAsp: 2.233 ± 0.528
1.884TyrGlu: 1.884 ± 0.424
0.488TyrPhe: 0.488 ± 0.156
1.884TyrGly: 1.884 ± 0.38
0.628TyrHis: 0.628 ± 0.194
1.117TyrIle: 1.117 ± 0.297
0.837TyrLys: 0.837 ± 0.271
3.001TyrLeu: 3.001 ± 0.442
0.907TyrMet: 0.907 ± 0.254
1.117TyrAsn: 1.117 ± 0.268
1.535TyrPro: 1.535 ± 0.27
1.047TyrGln: 1.047 ± 0.23
2.373TyrArg: 2.373 ± 0.496
1.256TyrSer: 1.256 ± 0.278
1.535TyrThr: 1.535 ± 0.328
2.373TyrVal: 2.373 ± 0.387
0.698TyrTrp: 0.698 ± 0.231
0.907TyrTyr: 0.907 ± 0.327
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 79 proteins (14331 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski