Amino acid dipepetide frequency for Mycobacterium phage Solon

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.656AlaAla: 12.656 ± 1.572
0.517AlaCys: 0.517 ± 0.163
6.134AlaAsp: 6.134 ± 0.648
6.78AlaGlu: 6.78 ± 0.777
2.906AlaPhe: 2.906 ± 0.473
7.555AlaGly: 7.555 ± 0.719
1.356AlaHis: 1.356 ± 0.324
4.455AlaIle: 4.455 ± 0.704
4.068AlaLys: 4.068 ± 0.497
9.04AlaLeu: 9.04 ± 0.831
2.195AlaMet: 2.195 ± 0.353
2.518AlaAsn: 2.518 ± 0.358
5.295AlaPro: 5.295 ± 0.771
3.099AlaGln: 3.099 ± 0.465
5.682AlaArg: 5.682 ± 0.642
5.23AlaSer: 5.23 ± 0.684
5.811AlaThr: 5.811 ± 0.583
8.2AlaVal: 8.2 ± 0.871
1.873AlaTrp: 1.873 ± 0.329
2.712AlaTyr: 2.712 ± 0.411
0.0AlaXaa: 0.0 ± 0.0
Cys
0.71CysAla: 0.71 ± 0.2
0.0CysCys: 0.0 ± 0.0
0.387CysAsp: 0.387 ± 0.146
0.581CysGlu: 0.581 ± 0.182
0.194CysPhe: 0.194 ± 0.108
0.581CysGly: 0.581 ± 0.241
0.129CysHis: 0.129 ± 0.086
0.258CysIle: 0.258 ± 0.141
0.323CysLys: 0.323 ± 0.162
0.387CysLeu: 0.387 ± 0.175
0.129CysMet: 0.129 ± 0.089
0.258CysAsn: 0.258 ± 0.122
0.258CysPro: 0.258 ± 0.117
0.194CysGln: 0.194 ± 0.106
0.517CysArg: 0.517 ± 0.202
0.387CysSer: 0.387 ± 0.139
0.194CysThr: 0.194 ± 0.098
0.323CysVal: 0.323 ± 0.145
0.129CysTrp: 0.129 ± 0.085
0.194CysTyr: 0.194 ± 0.1
0.0CysXaa: 0.0 ± 0.0
Asp
6.134AspAla: 6.134 ± 0.622
0.517AspCys: 0.517 ± 0.164
4.262AspAsp: 4.262 ± 0.44
4.068AspGlu: 4.068 ± 0.513
2.325AspPhe: 2.325 ± 0.309
6.134AspGly: 6.134 ± 0.576
1.162AspHis: 1.162 ± 0.277
2.777AspIle: 2.777 ± 0.492
2.389AspLys: 2.389 ± 0.461
6.392AspLeu: 6.392 ± 0.775
1.098AspMet: 1.098 ± 0.246
1.743AspAsn: 1.743 ± 0.331
5.166AspPro: 5.166 ± 0.623
1.614AspGln: 1.614 ± 0.369
3.681AspArg: 3.681 ± 0.44
3.229AspSer: 3.229 ± 0.476
3.81AspThr: 3.81 ± 0.456
4.778AspVal: 4.778 ± 0.524
1.873AspTrp: 1.873 ± 0.362
2.066AspTyr: 2.066 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
5.811GluAla: 5.811 ± 0.751
0.194GluCys: 0.194 ± 0.105
4.972GluAsp: 4.972 ± 0.558
4.843GluGlu: 4.843 ± 0.575
1.937GluPhe: 1.937 ± 0.339
3.81GluGly: 3.81 ± 0.525
1.227GluHis: 1.227 ± 0.3
3.358GluIle: 3.358 ± 0.497
2.906GluLys: 2.906 ± 0.452
7.296GluLeu: 7.296 ± 0.611
1.55GluMet: 1.55 ± 0.315
1.743GluAsn: 1.743 ± 0.387
3.099GluPro: 3.099 ± 0.47
2.712GluGln: 2.712 ± 0.464
3.874GluArg: 3.874 ± 0.592
3.358GluSer: 3.358 ± 0.411
4.003GluThr: 4.003 ± 0.577
5.747GluVal: 5.747 ± 0.563
1.485GluTrp: 1.485 ± 0.355
2.325GluTyr: 2.325 ± 0.462
0.0GluXaa: 0.0 ± 0.0
Phe
2.195PheAla: 2.195 ± 0.381
0.387PheCys: 0.387 ± 0.173
2.583PheAsp: 2.583 ± 0.35
1.679PheGlu: 1.679 ± 0.329
0.387PhePhe: 0.387 ± 0.141
3.293PheGly: 3.293 ± 0.477
0.775PheHis: 0.775 ± 0.271
1.227PheIle: 1.227 ± 0.267
1.356PheLys: 1.356 ± 0.25
2.195PheLeu: 2.195 ± 0.389
0.71PheMet: 0.71 ± 0.211
1.098PheAsn: 1.098 ± 0.239
1.55PhePro: 1.55 ± 0.284
0.775PheGln: 0.775 ± 0.189
1.743PheArg: 1.743 ± 0.345
1.937PheSer: 1.937 ± 0.482
2.325PheThr: 2.325 ± 0.423
2.195PheVal: 2.195 ± 0.383
0.646PheTrp: 0.646 ± 0.162
0.775PheTyr: 0.775 ± 0.198
0.0PheXaa: 0.0 ± 0.0
Gly
7.296GlyAla: 7.296 ± 0.93
0.452GlyCys: 0.452 ± 0.173
6.07GlyAsp: 6.07 ± 0.599
4.262GlyGlu: 4.262 ± 0.495
2.97GlyPhe: 2.97 ± 0.506
8.975GlyGly: 8.975 ± 2.076
2.066GlyHis: 2.066 ± 0.439
4.262GlyIle: 4.262 ± 0.589
3.81GlyLys: 3.81 ± 0.517
7.619GlyLeu: 7.619 ± 0.909
1.808GlyMet: 1.808 ± 0.387
3.293GlyAsn: 3.293 ± 0.455
3.745GlyPro: 3.745 ± 0.598
2.583GlyGln: 2.583 ± 0.387
4.843GlyArg: 4.843 ± 0.573
6.07GlySer: 6.07 ± 0.681
5.036GlyThr: 5.036 ± 0.594
5.811GlyVal: 5.811 ± 0.61
2.518GlyTrp: 2.518 ± 0.469
2.712GlyTyr: 2.712 ± 0.392
0.0GlyXaa: 0.0 ± 0.0
His
1.743HisAla: 1.743 ± 0.393
0.258HisCys: 0.258 ± 0.175
1.227HisAsp: 1.227 ± 0.26
1.679HisGlu: 1.679 ± 0.351
0.646HisPhe: 0.646 ± 0.223
1.485HisGly: 1.485 ± 0.412
0.71HisHis: 0.71 ± 0.201
0.904HisIle: 0.904 ± 0.219
0.969HisLys: 0.969 ± 0.275
1.421HisLeu: 1.421 ± 0.341
0.129HisMet: 0.129 ± 0.081
0.258HisAsn: 0.258 ± 0.136
1.485HisPro: 1.485 ± 0.268
0.839HisGln: 0.839 ± 0.218
1.356HisArg: 1.356 ± 0.351
0.775HisSer: 0.775 ± 0.215
1.098HisThr: 1.098 ± 0.259
1.743HisVal: 1.743 ± 0.347
0.517HisTrp: 0.517 ± 0.183
0.646HisTyr: 0.646 ± 0.222
0.0HisXaa: 0.0 ± 0.0
Ile
6.586IleAla: 6.586 ± 0.856
0.258IleCys: 0.258 ± 0.111
3.229IleAsp: 3.229 ± 0.416
3.487IleGlu: 3.487 ± 0.44
0.904IlePhe: 0.904 ± 0.267
4.003IleGly: 4.003 ± 0.464
0.839IleHis: 0.839 ± 0.218
1.679IleIle: 1.679 ± 0.29
1.614IleLys: 1.614 ± 0.348
3.681IleLeu: 3.681 ± 0.462
0.71IleMet: 0.71 ± 0.19
2.066IleAsn: 2.066 ± 0.338
3.164IlePro: 3.164 ± 0.433
1.485IleGln: 1.485 ± 0.388
3.939IleArg: 3.939 ± 0.532
3.035IleSer: 3.035 ± 0.423
3.358IleThr: 3.358 ± 0.444
2.906IleVal: 2.906 ± 0.486
0.775IleTrp: 0.775 ± 0.193
1.679IleTyr: 1.679 ± 0.287
0.0IleXaa: 0.0 ± 0.0
Lys
3.874LysAla: 3.874 ± 0.592
0.258LysCys: 0.258 ± 0.126
2.454LysAsp: 2.454 ± 0.439
2.131LysGlu: 2.131 ± 0.387
1.614LysPhe: 1.614 ± 0.272
2.454LysGly: 2.454 ± 0.402
1.356LysHis: 1.356 ± 0.312
2.26LysIle: 2.26 ± 0.43
1.873LysLys: 1.873 ± 0.368
3.358LysLeu: 3.358 ± 0.47
0.969LysMet: 0.969 ± 0.216
1.485LysAsn: 1.485 ± 0.275
2.583LysPro: 2.583 ± 0.368
1.485LysGln: 1.485 ± 0.328
2.906LysArg: 2.906 ± 0.486
2.195LysSer: 2.195 ± 0.354
2.647LysThr: 2.647 ± 0.474
3.422LysVal: 3.422 ± 0.416
0.775LysTrp: 0.775 ± 0.193
1.098LysTyr: 1.098 ± 0.366
0.0LysXaa: 0.0 ± 0.0
Leu
9.363LeuAla: 9.363 ± 0.879
0.258LeuCys: 0.258 ± 0.133
6.199LeuAsp: 6.199 ± 0.565
5.682LeuGlu: 5.682 ± 0.614
1.873LeuPhe: 1.873 ± 0.353
7.49LeuGly: 7.49 ± 0.889
1.485LeuHis: 1.485 ± 0.341
4.714LeuIle: 4.714 ± 0.575
3.939LeuLys: 3.939 ± 0.51
5.94LeuLeu: 5.94 ± 0.555
1.679LeuMet: 1.679 ± 0.283
3.229LeuAsn: 3.229 ± 0.515
5.295LeuPro: 5.295 ± 0.524
2.583LeuGln: 2.583 ± 0.449
5.811LeuArg: 5.811 ± 0.617
6.07LeuSer: 6.07 ± 0.548
6.07LeuThr: 6.07 ± 0.502
5.295LeuVal: 5.295 ± 0.64
1.098LeuTrp: 1.098 ± 0.326
2.26LeuTyr: 2.26 ± 0.382
0.0LeuXaa: 0.0 ± 0.0
Met
2.195MetAla: 2.195 ± 0.325
0.0MetCys: 0.0 ± 0.0
1.291MetAsp: 1.291 ± 0.262
1.55MetGlu: 1.55 ± 0.273
0.517MetPhe: 0.517 ± 0.16
1.485MetGly: 1.485 ± 0.285
0.258MetHis: 0.258 ± 0.136
0.452MetIle: 0.452 ± 0.149
0.904MetLys: 0.904 ± 0.228
1.162MetLeu: 1.162 ± 0.304
0.065MetMet: 0.065 ± 0.071
0.775MetAsn: 0.775 ± 0.179
1.162MetPro: 1.162 ± 0.281
0.775MetGln: 0.775 ± 0.191
1.098MetArg: 1.098 ± 0.286
2.195MetSer: 2.195 ± 0.509
2.066MetThr: 2.066 ± 0.289
0.904MetVal: 0.904 ± 0.213
0.194MetTrp: 0.194 ± 0.112
0.387MetTyr: 0.387 ± 0.148
0.0MetXaa: 0.0 ± 0.0
Asn
3.099AsnAla: 3.099 ± 0.474
0.065AsnCys: 0.065 ± 0.065
1.808AsnAsp: 1.808 ± 0.393
1.614AsnGlu: 1.614 ± 0.364
0.839AsnPhe: 0.839 ± 0.271
3.745AsnGly: 3.745 ± 0.582
0.646AsnHis: 0.646 ± 0.199
1.55AsnIle: 1.55 ± 0.308
0.71AsnLys: 0.71 ± 0.211
2.583AsnLeu: 2.583 ± 0.352
0.646AsnMet: 0.646 ± 0.147
0.839AsnAsn: 0.839 ± 0.249
2.777AsnPro: 2.777 ± 0.4
1.033AsnGln: 1.033 ± 0.245
1.614AsnArg: 1.614 ± 0.348
1.873AsnSer: 1.873 ± 0.41
1.937AsnThr: 1.937 ± 0.359
2.583AsnVal: 2.583 ± 0.402
0.775AsnTrp: 0.775 ± 0.195
1.421AsnTyr: 1.421 ± 0.327
0.0AsnXaa: 0.0 ± 0.0
Pro
5.036ProAla: 5.036 ± 0.62
0.452ProCys: 0.452 ± 0.175
4.455ProAsp: 4.455 ± 0.559
4.778ProGlu: 4.778 ± 0.564
2.131ProPhe: 2.131 ± 0.413
5.036ProGly: 5.036 ± 0.647
0.775ProHis: 0.775 ± 0.243
2.454ProIle: 2.454 ± 0.401
2.131ProLys: 2.131 ± 0.303
4.52ProLeu: 4.52 ± 0.594
0.839ProMet: 0.839 ± 0.288
1.55ProAsn: 1.55 ± 0.293
2.777ProPro: 2.777 ± 0.458
1.356ProGln: 1.356 ± 0.278
3.099ProArg: 3.099 ± 0.632
4.132ProSer: 4.132 ± 0.51
4.455ProThr: 4.455 ± 0.638
3.81ProVal: 3.81 ± 0.459
0.839ProTrp: 0.839 ± 0.271
1.614ProTyr: 1.614 ± 0.349
0.0ProXaa: 0.0 ± 0.0
Gln
2.518GlnAla: 2.518 ± 0.424
0.194GlnCys: 0.194 ± 0.104
1.227GlnAsp: 1.227 ± 0.348
1.808GlnGlu: 1.808 ± 0.315
1.098GlnPhe: 1.098 ± 0.272
2.518GlnGly: 2.518 ± 0.389
0.581GlnHis: 0.581 ± 0.184
3.293GlnIle: 3.293 ± 0.595
0.969GlnLys: 0.969 ± 0.235
3.874GlnLeu: 3.874 ± 0.487
0.904GlnMet: 0.904 ± 0.238
0.517GlnAsn: 0.517 ± 0.141
1.743GlnPro: 1.743 ± 0.379
1.808GlnGln: 1.808 ± 0.413
1.937GlnArg: 1.937 ± 0.351
1.679GlnSer: 1.679 ± 0.337
1.873GlnThr: 1.873 ± 0.329
2.647GlnVal: 2.647 ± 0.42
0.646GlnTrp: 0.646 ± 0.184
0.581GlnTyr: 0.581 ± 0.185
0.0GlnXaa: 0.0 ± 0.0
Arg
5.747ArgAla: 5.747 ± 0.617
0.839ArgCys: 0.839 ± 0.293
2.97ArgAsp: 2.97 ± 0.434
5.036ArgGlu: 5.036 ± 0.67
1.808ArgPhe: 1.808 ± 0.362
4.907ArgGly: 4.907 ± 0.64
1.162ArgHis: 1.162 ± 0.263
3.293ArgIle: 3.293 ± 0.555
3.616ArgLys: 3.616 ± 0.561
5.747ArgLeu: 5.747 ± 0.744
1.679ArgMet: 1.679 ± 0.324
2.066ArgAsn: 2.066 ± 0.42
2.712ArgPro: 2.712 ± 0.436
2.26ArgGln: 2.26 ± 0.431
5.424ArgArg: 5.424 ± 0.777
3.745ArgSer: 3.745 ± 0.483
3.293ArgThr: 3.293 ± 0.552
4.843ArgVal: 4.843 ± 0.568
1.098ArgTrp: 1.098 ± 0.254
1.55ArgTyr: 1.55 ± 0.277
0.0ArgXaa: 0.0 ± 0.0
Ser
6.328SerAla: 6.328 ± 0.809
0.517SerCys: 0.517 ± 0.166
3.616SerAsp: 3.616 ± 0.453
3.81SerGlu: 3.81 ± 0.507
1.679SerPhe: 1.679 ± 0.361
6.715SerGly: 6.715 ± 0.794
1.743SerHis: 1.743 ± 0.312
2.97SerIle: 2.97 ± 0.528
2.389SerLys: 2.389 ± 0.439
4.972SerLeu: 4.972 ± 0.596
1.291SerMet: 1.291 ± 0.287
2.26SerAsn: 2.26 ± 0.39
3.229SerPro: 3.229 ± 0.489
2.002SerGln: 2.002 ± 0.295
3.229SerArg: 3.229 ± 0.401
3.681SerSer: 3.681 ± 0.76
3.422SerThr: 3.422 ± 0.497
3.81SerVal: 3.81 ± 0.398
1.421SerTrp: 1.421 ± 0.345
1.485SerTyr: 1.485 ± 0.27
0.0SerXaa: 0.0 ± 0.0
Thr
5.94ThrAla: 5.94 ± 0.861
0.258ThrCys: 0.258 ± 0.118
3.939ThrAsp: 3.939 ± 0.535
4.068ThrGlu: 4.068 ± 0.486
2.26ThrPhe: 2.26 ± 0.399
6.974ThrGly: 6.974 ± 0.694
1.162ThrHis: 1.162 ± 0.276
2.906ThrIle: 2.906 ± 0.587
2.583ThrLys: 2.583 ± 0.39
6.328ThrLeu: 6.328 ± 0.706
0.775ThrMet: 0.775 ± 0.197
1.937ThrAsn: 1.937 ± 0.389
4.262ThrPro: 4.262 ± 0.53
1.808ThrGln: 1.808 ± 0.384
3.745ThrArg: 3.745 ± 0.57
3.616ThrSer: 3.616 ± 0.508
4.455ThrThr: 4.455 ± 0.642
5.23ThrVal: 5.23 ± 0.565
0.969ThrTrp: 0.969 ± 0.204
2.066ThrTyr: 2.066 ± 0.342
0.0ThrXaa: 0.0 ± 0.0
Val
6.974ValAla: 6.974 ± 0.704
0.323ValCys: 0.323 ± 0.113
5.682ValAsp: 5.682 ± 0.675
4.843ValGlu: 4.843 ± 0.493
2.26ValPhe: 2.26 ± 0.362
4.778ValGly: 4.778 ± 0.708
1.356ValHis: 1.356 ± 0.271
3.874ValIle: 3.874 ± 0.573
3.035ValLys: 3.035 ± 0.371
5.23ValLeu: 5.23 ± 0.653
1.227ValMet: 1.227 ± 0.307
2.583ValAsn: 2.583 ± 0.364
4.068ValPro: 4.068 ± 0.506
2.325ValGln: 2.325 ± 0.395
5.488ValArg: 5.488 ± 0.693
4.907ValSer: 4.907 ± 0.482
5.682ValThr: 5.682 ± 0.612
5.166ValVal: 5.166 ± 0.666
1.291ValTrp: 1.291 ± 0.262
2.002ValTyr: 2.002 ± 0.494
0.0ValXaa: 0.0 ± 0.0
Trp
1.421TrpAla: 1.421 ± 0.265
0.258TrpCys: 0.258 ± 0.116
1.421TrpAsp: 1.421 ± 0.314
1.033TrpGlu: 1.033 ± 0.227
0.904TrpPhe: 0.904 ± 0.199
1.743TrpGly: 1.743 ± 0.292
0.452TrpHis: 0.452 ± 0.169
1.227TrpIle: 1.227 ± 0.256
0.323TrpLys: 0.323 ± 0.19
2.066TrpLeu: 2.066 ± 0.334
0.452TrpMet: 0.452 ± 0.178
0.646TrpAsn: 0.646 ± 0.223
0.71TrpPro: 0.71 ± 0.236
0.839TrpGln: 0.839 ± 0.204
1.227TrpArg: 1.227 ± 0.303
0.839TrpSer: 0.839 ± 0.273
1.614TrpThr: 1.614 ± 0.375
1.937TrpVal: 1.937 ± 0.305
0.71TrpTrp: 0.71 ± 0.266
0.258TrpTyr: 0.258 ± 0.122
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.454TyrAla: 2.454 ± 0.393
0.129TyrCys: 0.129 ± 0.091
1.291TyrAsp: 1.291 ± 0.302
2.518TyrGlu: 2.518 ± 0.405
0.581TyrPhe: 0.581 ± 0.206
2.583TyrGly: 2.583 ± 0.397
0.646TyrHis: 0.646 ± 0.195
1.485TyrIle: 1.485 ± 0.332
1.291TyrLys: 1.291 ± 0.268
2.518TyrLeu: 2.518 ± 0.43
0.517TyrMet: 0.517 ± 0.156
1.291TyrAsn: 1.291 ± 0.327
1.162TyrPro: 1.162 ± 0.302
0.839TyrGln: 0.839 ± 0.225
2.583TyrArg: 2.583 ± 0.438
1.614TyrSer: 1.614 ± 0.371
2.131TyrThr: 2.131 ± 0.413
1.743TyrVal: 1.743 ± 0.307
0.452TyrTrp: 0.452 ± 0.174
0.581TyrTyr: 0.581 ± 0.2
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 86 proteins (15488 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski