Amino acid dipepetide frequency for Mycobacterium phage Newman

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.188AlaAla: 18.188 ± 1.536
1.163AlaCys: 1.163 ± 0.276
7.396AlaAsp: 7.396 ± 0.505
7.443AlaGlu: 7.443 ± 0.814
2.279AlaPhe: 2.279 ± 0.389
10.838AlaGly: 10.838 ± 1.237
2.512AlaHis: 2.512 ± 0.388
4.512AlaIle: 4.512 ± 0.593
3.628AlaLys: 3.628 ± 0.509
10.652AlaLeu: 10.652 ± 0.863
3.21AlaMet: 3.21 ± 0.418
3.117AlaAsn: 3.117 ± 0.398
6.791AlaPro: 6.791 ± 0.497
4.047AlaGln: 4.047 ± 0.502
8.373AlaArg: 8.373 ± 0.791
5.489AlaSer: 5.489 ± 0.555
7.954AlaThr: 7.954 ± 0.611
9.071AlaVal: 9.071 ± 0.866
2.0AlaTrp: 2.0 ± 0.346
3.024AlaTyr: 3.024 ± 0.301
0.0AlaXaa: 0.0 ± 0.0
Cys
0.744CysAla: 0.744 ± 0.195
0.0CysCys: 0.0 ± 0.0
0.977CysAsp: 0.977 ± 0.225
0.651CysGlu: 0.651 ± 0.205
0.326CysPhe: 0.326 ± 0.123
1.442CysGly: 1.442 ± 0.263
0.326CysHis: 0.326 ± 0.123
0.233CysIle: 0.233 ± 0.101
0.233CysLys: 0.233 ± 0.101
0.698CysLeu: 0.698 ± 0.185
0.233CysMet: 0.233 ± 0.106
0.279CysAsn: 0.279 ± 0.108
0.884CysPro: 0.884 ± 0.259
0.233CysGln: 0.233 ± 0.098
0.884CysArg: 0.884 ± 0.259
0.512CysSer: 0.512 ± 0.143
0.558CysThr: 0.558 ± 0.17
0.977CysVal: 0.977 ± 0.209
0.233CysTrp: 0.233 ± 0.118
0.326CysTyr: 0.326 ± 0.117
0.0CysXaa: 0.0 ± 0.0
Asp
7.256AspAla: 7.256 ± 0.501
1.023AspCys: 1.023 ± 0.244
4.977AspAsp: 4.977 ± 0.586
4.977AspGlu: 4.977 ± 0.636
1.675AspPhe: 1.675 ± 0.253
6.047AspGly: 6.047 ± 0.617
1.209AspHis: 1.209 ± 0.291
3.163AspIle: 3.163 ± 0.456
1.907AspLys: 1.907 ± 0.387
5.675AspLeu: 5.675 ± 0.527
0.93AspMet: 0.93 ± 0.3
2.047AspAsn: 2.047 ± 0.333
5.21AspPro: 5.21 ± 0.55
1.489AspGln: 1.489 ± 0.269
4.466AspArg: 4.466 ± 0.547
2.884AspSer: 2.884 ± 0.344
3.721AspThr: 3.721 ± 0.47
4.14AspVal: 4.14 ± 0.354
1.07AspTrp: 1.07 ± 0.218
1.861AspTyr: 1.861 ± 0.267
0.0AspXaa: 0.0 ± 0.0
Glu
7.024GluAla: 7.024 ± 0.732
0.698GluCys: 0.698 ± 0.215
3.489GluAsp: 3.489 ± 0.511
2.14GluGlu: 2.14 ± 0.283
2.186GluPhe: 2.186 ± 0.305
4.466GluGly: 4.466 ± 0.555
1.209GluHis: 1.209 ± 0.255
3.861GluIle: 3.861 ± 0.543
1.582GluLys: 1.582 ± 0.284
6.466GluLeu: 6.466 ± 0.647
0.884GluMet: 0.884 ± 0.168
1.116GluAsn: 1.116 ± 0.227
3.396GluPro: 3.396 ± 0.573
2.512GluGln: 2.512 ± 0.34
4.372GluArg: 4.372 ± 0.43
1.768GluSer: 1.768 ± 0.285
3.582GluThr: 3.582 ± 0.418
5.861GluVal: 5.861 ± 0.655
1.07GluTrp: 1.07 ± 0.23
1.07GluTyr: 1.07 ± 0.217
0.0GluXaa: 0.0 ± 0.0
Phe
2.931PheAla: 2.931 ± 0.349
0.279PheCys: 0.279 ± 0.099
1.582PheAsp: 1.582 ± 0.265
1.395PheGlu: 1.395 ± 0.249
0.419PhePhe: 0.419 ± 0.131
2.977PheGly: 2.977 ± 0.453
0.465PheHis: 0.465 ± 0.125
0.651PheIle: 0.651 ± 0.194
1.116PheLys: 1.116 ± 0.213
1.442PheLeu: 1.442 ± 0.286
0.279PheMet: 0.279 ± 0.118
0.884PheAsn: 0.884 ± 0.239
1.302PhePro: 1.302 ± 0.273
0.605PheGln: 0.605 ± 0.141
1.535PheArg: 1.535 ± 0.268
1.209PheSer: 1.209 ± 0.295
1.442PheThr: 1.442 ± 0.249
1.535PheVal: 1.535 ± 0.218
0.326PheTrp: 0.326 ± 0.11
0.651PheTyr: 0.651 ± 0.16
0.0PheXaa: 0.0 ± 0.0
Gly
9.675GlyAla: 9.675 ± 1.421
1.163GlyCys: 1.163 ± 0.295
5.303GlyAsp: 5.303 ± 0.612
5.256GlyGlu: 5.256 ± 0.538
2.047GlyPhe: 2.047 ± 0.314
13.304GlyGly: 13.304 ± 2.928
2.0GlyHis: 2.0 ± 0.356
4.791GlyIle: 4.791 ± 0.387
3.582GlyLys: 3.582 ± 0.455
7.908GlyLeu: 7.908 ± 0.894
2.186GlyMet: 2.186 ± 0.364
2.419GlyAsn: 2.419 ± 0.461
4.512GlyPro: 4.512 ± 0.573
3.535GlyGln: 3.535 ± 0.512
6.001GlyArg: 6.001 ± 0.578
5.07GlySer: 5.07 ± 0.535
6.559GlyThr: 6.559 ± 0.661
6.652GlyVal: 6.652 ± 0.601
1.861GlyTrp: 1.861 ± 0.314
2.605GlyTyr: 2.605 ± 0.322
0.0GlyXaa: 0.0 ± 0.0
His
1.907HisAla: 1.907 ± 0.344
0.233HisCys: 0.233 ± 0.127
1.209HisAsp: 1.209 ± 0.279
1.628HisGlu: 1.628 ± 0.312
0.512HisPhe: 0.512 ± 0.146
1.628HisGly: 1.628 ± 0.263
0.884HisHis: 0.884 ± 0.283
1.395HisIle: 1.395 ± 0.294
0.419HisLys: 0.419 ± 0.145
1.395HisLeu: 1.395 ± 0.315
0.419HisMet: 0.419 ± 0.133
0.512HisAsn: 0.512 ± 0.166
1.302HisPro: 1.302 ± 0.264
0.279HisGln: 0.279 ± 0.114
2.326HisArg: 2.326 ± 0.359
0.837HisSer: 0.837 ± 0.181
1.954HisThr: 1.954 ± 0.347
0.93HisVal: 0.93 ± 0.256
0.186HisTrp: 0.186 ± 0.1
0.558HisTyr: 0.558 ± 0.188
0.0HisXaa: 0.0 ± 0.0
Ile
5.721IleAla: 5.721 ± 0.536
0.512IleCys: 0.512 ± 0.146
4.047IleAsp: 4.047 ± 0.364
4.652IleGlu: 4.652 ± 0.509
0.884IlePhe: 0.884 ± 0.181
4.977IleGly: 4.977 ± 0.603
0.698IleHis: 0.698 ± 0.182
1.489IleIle: 1.489 ± 0.292
1.489IleLys: 1.489 ± 0.322
2.884IleLeu: 2.884 ± 0.348
0.791IleMet: 0.791 ± 0.156
1.349IleAsn: 1.349 ± 0.262
2.698IlePro: 2.698 ± 0.377
0.884IleGln: 0.884 ± 0.22
2.047IleArg: 2.047 ± 0.378
2.465IleSer: 2.465 ± 0.348
3.21IleThr: 3.21 ± 0.325
3.117IleVal: 3.117 ± 0.414
0.326IleTrp: 0.326 ± 0.119
0.93IleTyr: 0.93 ± 0.207
0.0IleXaa: 0.0 ± 0.0
Lys
4.186LysAla: 4.186 ± 0.463
0.14LysCys: 0.14 ± 0.081
1.395LysAsp: 1.395 ± 0.257
0.744LysGlu: 0.744 ± 0.242
0.698LysPhe: 0.698 ± 0.183
2.558LysGly: 2.558 ± 0.309
0.512LysHis: 0.512 ± 0.165
1.535LysIle: 1.535 ± 0.277
1.023LysLys: 1.023 ± 0.317
2.977LysLeu: 2.977 ± 0.384
0.372LysMet: 0.372 ± 0.133
0.744LysAsn: 0.744 ± 0.207
1.349LysPro: 1.349 ± 0.237
1.395LysGln: 1.395 ± 0.25
2.093LysArg: 2.093 ± 0.344
1.349LysSer: 1.349 ± 0.235
2.279LysThr: 2.279 ± 0.292
2.837LysVal: 2.837 ± 0.281
0.372LysTrp: 0.372 ± 0.134
0.651LysTyr: 0.651 ± 0.178
0.0LysXaa: 0.0 ± 0.0
Leu
11.676LeuAla: 11.676 ± 1.009
1.116LeuCys: 1.116 ± 0.214
6.326LeuAsp: 6.326 ± 0.599
3.303LeuGlu: 3.303 ± 0.35
1.907LeuPhe: 1.907 ± 0.353
7.396LeuGly: 7.396 ± 1.111
1.023LeuHis: 1.023 ± 0.204
3.768LeuIle: 3.768 ± 0.488
2.419LeuLys: 2.419 ± 0.427
5.349LeuLeu: 5.349 ± 0.619
1.768LeuMet: 1.768 ± 0.231
3.303LeuAsn: 3.303 ± 0.55
6.14LeuPro: 6.14 ± 0.514
1.675LeuGln: 1.675 ± 0.252
4.186LeuArg: 4.186 ± 0.497
4.047LeuSer: 4.047 ± 0.453
7.117LeuThr: 7.117 ± 0.637
4.977LeuVal: 4.977 ± 0.497
1.256LeuTrp: 1.256 ± 0.266
1.721LeuTyr: 1.721 ± 0.323
0.0LeuXaa: 0.0 ± 0.0
Met
2.605MetAla: 2.605 ± 0.394
0.233MetCys: 0.233 ± 0.104
0.651MetAsp: 0.651 ± 0.166
0.651MetGlu: 0.651 ± 0.13
0.837MetPhe: 0.837 ± 0.197
1.442MetGly: 1.442 ± 0.238
0.419MetHis: 0.419 ± 0.178
0.837MetIle: 0.837 ± 0.16
0.605MetLys: 0.605 ± 0.152
1.442MetLeu: 1.442 ± 0.217
0.372MetMet: 0.372 ± 0.125
0.465MetAsn: 0.465 ± 0.137
1.582MetPro: 1.582 ± 0.273
0.279MetGln: 0.279 ± 0.126
1.302MetArg: 1.302 ± 0.241
1.675MetSer: 1.675 ± 0.277
2.605MetThr: 2.605 ± 0.286
1.442MetVal: 1.442 ± 0.215
0.512MetTrp: 0.512 ± 0.156
0.558MetTyr: 0.558 ± 0.178
0.0MetXaa: 0.0 ± 0.0
Asn
3.535AsnAla: 3.535 ± 0.405
0.233AsnCys: 0.233 ± 0.098
1.954AsnAsp: 1.954 ± 0.354
1.209AsnGlu: 1.209 ± 0.252
0.326AsnPhe: 0.326 ± 0.117
3.489AsnGly: 3.489 ± 0.464
0.605AsnHis: 0.605 ± 0.185
1.349AsnIle: 1.349 ± 0.292
1.023AsnLys: 1.023 ± 0.252
1.861AsnLeu: 1.861 ± 0.268
0.419AsnMet: 0.419 ± 0.115
1.256AsnAsn: 1.256 ± 0.352
1.861AsnPro: 1.861 ± 0.289
0.419AsnGln: 0.419 ± 0.146
2.047AsnArg: 2.047 ± 0.345
1.582AsnSer: 1.582 ± 0.312
2.465AsnThr: 2.465 ± 0.369
1.675AsnVal: 1.675 ± 0.238
0.372AsnTrp: 0.372 ± 0.147
0.93AsnTyr: 0.93 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
7.256ProAla: 7.256 ± 0.654
0.372ProCys: 0.372 ± 0.138
4.419ProAsp: 4.419 ± 0.537
5.675ProGlu: 5.675 ± 0.699
1.302ProPhe: 1.302 ± 0.23
5.908ProGly: 5.908 ± 0.72
1.349ProHis: 1.349 ± 0.313
2.512ProIle: 2.512 ± 0.424
1.116ProLys: 1.116 ± 0.216
3.582ProLeu: 3.582 ± 0.339
1.256ProMet: 1.256 ± 0.233
2.0ProAsn: 2.0 ± 0.212
4.698ProPro: 4.698 ± 0.579
1.907ProGln: 1.907 ± 0.295
3.814ProArg: 3.814 ± 0.52
2.605ProSer: 2.605 ± 0.381
4.0ProThr: 4.0 ± 0.343
4.977ProVal: 4.977 ± 0.468
1.349ProTrp: 1.349 ± 0.269
1.116ProTyr: 1.116 ± 0.276
0.0ProXaa: 0.0 ± 0.0
Gln
4.279GlnAla: 4.279 ± 0.668
0.279GlnCys: 0.279 ± 0.119
1.07GlnAsp: 1.07 ± 0.256
0.837GlnGlu: 0.837 ± 0.203
1.023GlnPhe: 1.023 ± 0.204
2.326GlnGly: 2.326 ± 0.359
0.884GlnHis: 0.884 ± 0.189
1.907GlnIle: 1.907 ± 0.279
0.791GlnLys: 0.791 ± 0.235
2.465GlnLeu: 2.465 ± 0.338
0.605GlnMet: 0.605 ± 0.141
0.651GlnAsn: 0.651 ± 0.204
1.535GlnPro: 1.535 ± 0.214
1.675GlnGln: 1.675 ± 0.244
3.07GlnArg: 3.07 ± 0.402
1.721GlnSer: 1.721 ± 0.318
2.233GlnThr: 2.233 ± 0.339
2.093GlnVal: 2.093 ± 0.346
0.698GlnTrp: 0.698 ± 0.188
0.744GlnTyr: 0.744 ± 0.215
0.0GlnXaa: 0.0 ± 0.0
Arg
7.35ArgAla: 7.35 ± 0.617
0.651ArgCys: 0.651 ± 0.181
4.372ArgAsp: 4.372 ± 0.438
5.024ArgGlu: 5.024 ± 0.626
1.489ArgPhe: 1.489 ± 0.274
4.233ArgGly: 4.233 ± 0.463
1.814ArgHis: 1.814 ± 0.382
2.419ArgIle: 2.419 ± 0.339
1.907ArgLys: 1.907 ± 0.319
6.047ArgLeu: 6.047 ± 0.492
1.628ArgMet: 1.628 ± 0.387
2.326ArgAsn: 2.326 ± 0.377
4.14ArgPro: 4.14 ± 0.506
2.558ArgGln: 2.558 ± 0.31
6.512ArgArg: 6.512 ± 0.759
2.558ArgSer: 2.558 ± 0.318
4.093ArgThr: 4.093 ± 0.431
5.396ArgVal: 5.396 ± 0.693
1.861ArgTrp: 1.861 ± 0.291
2.279ArgTyr: 2.279 ± 0.392
0.0ArgXaa: 0.0 ± 0.0
Ser
4.977SerAla: 4.977 ± 0.506
0.512SerCys: 0.512 ± 0.158
3.303SerAsp: 3.303 ± 0.291
2.233SerGlu: 2.233 ± 0.349
0.605SerPhe: 0.605 ± 0.186
5.675SerGly: 5.675 ± 0.774
0.791SerHis: 0.791 ± 0.192
1.721SerIle: 1.721 ± 0.295
1.209SerLys: 1.209 ± 0.232
4.186SerLeu: 4.186 ± 0.609
1.256SerMet: 1.256 ± 0.246
1.07SerAsn: 1.07 ± 0.187
3.163SerPro: 3.163 ± 0.383
2.093SerGln: 2.093 ± 0.26
3.21SerArg: 3.21 ± 0.278
2.651SerSer: 2.651 ± 0.479
3.907SerThr: 3.907 ± 0.451
3.907SerVal: 3.907 ± 0.464
1.116SerTrp: 1.116 ± 0.19
1.442SerTyr: 1.442 ± 0.23
0.0SerXaa: 0.0 ± 0.0
Thr
8.698ThrAla: 8.698 ± 0.563
0.419ThrCys: 0.419 ± 0.137
5.256ThrAsp: 5.256 ± 0.471
3.303ThrGlu: 3.303 ± 0.396
1.582ThrPhe: 1.582 ± 0.241
6.884ThrGly: 6.884 ± 0.543
1.209ThrHis: 1.209 ± 0.233
3.535ThrIle: 3.535 ± 0.462
2.372ThrLys: 2.372 ± 0.299
5.303ThrLeu: 5.303 ± 0.52
1.582ThrMet: 1.582 ± 0.255
1.907ThrAsn: 1.907 ± 0.363
5.024ThrPro: 5.024 ± 0.638
1.814ThrGln: 1.814 ± 0.308
4.0ThrArg: 4.0 ± 0.443
4.0ThrSer: 4.0 ± 0.379
3.814ThrThr: 3.814 ± 0.531
6.326ThrVal: 6.326 ± 0.589
1.302ThrTrp: 1.302 ± 0.284
1.675ThrTyr: 1.675 ± 0.311
0.0ThrXaa: 0.0 ± 0.0
Val
8.233ValAla: 8.233 ± 0.502
0.93ValCys: 0.93 ± 0.206
4.419ValAsp: 4.419 ± 0.491
5.628ValGlu: 5.628 ± 0.64
1.582ValPhe: 1.582 ± 0.244
7.582ValGly: 7.582 ± 0.613
1.675ValHis: 1.675 ± 0.269
3.535ValIle: 3.535 ± 0.341
1.907ValLys: 1.907 ± 0.298
6.605ValLeu: 6.605 ± 0.431
1.442ValMet: 1.442 ± 0.278
2.093ValAsn: 2.093 ± 0.314
3.721ValPro: 3.721 ± 0.365
1.814ValGln: 1.814 ± 0.284
4.884ValArg: 4.884 ± 0.588
4.186ValSer: 4.186 ± 0.388
5.442ValThr: 5.442 ± 0.554
6.001ValVal: 6.001 ± 0.676
2.0ValTrp: 2.0 ± 0.304
1.954ValTyr: 1.954 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
2.326TrpAla: 2.326 ± 0.379
0.512TrpCys: 0.512 ± 0.157
1.395TrpAsp: 1.395 ± 0.262
0.558TrpGlu: 0.558 ± 0.156
0.744TrpPhe: 0.744 ± 0.247
1.07TrpGly: 1.07 ± 0.209
0.465TrpHis: 0.465 ± 0.157
0.93TrpIle: 0.93 ± 0.239
0.279TrpLys: 0.279 ± 0.098
1.489TrpLeu: 1.489 ± 0.327
0.465TrpMet: 0.465 ± 0.144
0.279TrpAsn: 0.279 ± 0.1
0.837TrpPro: 0.837 ± 0.205
0.744TrpGln: 0.744 ± 0.223
1.349TrpArg: 1.349 ± 0.235
1.349TrpSer: 1.349 ± 0.271
1.442TrpThr: 1.442 ± 0.258
1.535TrpVal: 1.535 ± 0.328
0.465TrpTrp: 0.465 ± 0.15
0.419TrpTyr: 0.419 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.117TyrAla: 3.117 ± 0.34
0.279TyrCys: 0.279 ± 0.106
2.186TyrAsp: 2.186 ± 0.287
1.442TyrGlu: 1.442 ± 0.271
0.605TyrPhe: 0.605 ± 0.147
2.186TyrGly: 2.186 ± 0.392
0.465TyrHis: 0.465 ± 0.127
1.023TyrIle: 1.023 ± 0.217
0.605TyrLys: 0.605 ± 0.162
2.047TyrLeu: 2.047 ± 0.419
0.372TyrMet: 0.372 ± 0.131
0.791TyrAsn: 0.791 ± 0.16
1.07TyrPro: 1.07 ± 0.246
0.93TyrGln: 0.93 ± 0.258
2.186TyrArg: 2.186 ± 0.348
1.07TyrSer: 1.07 ± 0.211
1.628TyrThr: 1.628 ± 0.278
2.233TyrVal: 2.233 ± 0.251
0.279TyrTrp: 0.279 ± 0.114
0.558TyrTyr: 0.558 ± 0.161
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (21499 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski