Amino acid dipepetide frequency for Mycobacterium phage LilSpotty

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
17.066AlaAla: 17.066 ± 1.384
0.706AlaCys: 0.706 ± 0.18
8.212AlaAsp: 8.212 ± 0.767
7.827AlaGlu: 7.827 ± 0.702
3.272AlaPhe: 3.272 ± 0.493
9.559AlaGly: 9.559 ± 1.028
1.732AlaHis: 1.732 ± 0.405
5.325AlaIle: 5.325 ± 0.507
4.042AlaLys: 4.042 ± 0.444
9.688AlaLeu: 9.688 ± 0.888
3.4AlaMet: 3.4 ± 0.42
4.491AlaAsn: 4.491 ± 0.506
5.646AlaPro: 5.646 ± 0.681
5.261AlaGln: 5.261 ± 0.738
7.57AlaArg: 7.57 ± 0.891
6.672AlaSer: 6.672 ± 0.775
5.774AlaThr: 5.774 ± 0.629
6.993AlaVal: 6.993 ± 0.646
2.63AlaTrp: 2.63 ± 0.453
2.887AlaTyr: 2.887 ± 0.544
0.0AlaXaa: 0.0 ± 0.0
Cys
1.476CysAla: 1.476 ± 0.337
0.192CysCys: 0.192 ± 0.119
0.962CysAsp: 0.962 ± 0.289
0.706CysGlu: 0.706 ± 0.238
0.257CysPhe: 0.257 ± 0.119
1.091CysGly: 1.091 ± 0.308
0.257CysHis: 0.257 ± 0.135
0.257CysIle: 0.257 ± 0.13
0.128CysLys: 0.128 ± 0.091
0.706CysLeu: 0.706 ± 0.203
0.128CysMet: 0.128 ± 0.097
0.064CysAsn: 0.064 ± 0.069
0.962CysPro: 0.962 ± 0.276
0.513CysGln: 0.513 ± 0.225
0.962CysArg: 0.962 ± 0.288
0.577CysSer: 0.577 ± 0.206
0.642CysThr: 0.642 ± 0.192
0.513CysVal: 0.513 ± 0.181
0.257CysTrp: 0.257 ± 0.129
0.192CysTyr: 0.192 ± 0.102
0.0CysXaa: 0.0 ± 0.0
Asp
8.469AspAla: 8.469 ± 0.674
0.577AspCys: 0.577 ± 0.192
4.234AspAsp: 4.234 ± 0.611
4.555AspGlu: 4.555 ± 0.559
1.732AspPhe: 1.732 ± 0.334
6.48AspGly: 6.48 ± 0.771
1.347AspHis: 1.347 ± 0.341
2.117AspIle: 2.117 ± 0.358
1.604AspLys: 1.604 ± 0.265
5.453AspLeu: 5.453 ± 0.543
1.604AspMet: 1.604 ± 0.317
1.604AspAsn: 1.604 ± 0.351
4.812AspPro: 4.812 ± 0.581
3.208AspGln: 3.208 ± 0.472
5.389AspArg: 5.389 ± 0.529
2.759AspSer: 2.759 ± 0.592
3.4AspThr: 3.4 ± 0.443
4.363AspVal: 4.363 ± 0.499
1.411AspTrp: 1.411 ± 0.28
2.31AspTyr: 2.31 ± 0.418
0.0AspXaa: 0.0 ± 0.0
Glu
5.967GluAla: 5.967 ± 0.761
1.091GluCys: 1.091 ± 0.308
3.144GluAsp: 3.144 ± 0.518
2.566GluGlu: 2.566 ± 0.506
1.989GluPhe: 1.989 ± 0.435
4.042GluGly: 4.042 ± 0.544
1.925GluHis: 1.925 ± 0.367
2.759GluIle: 2.759 ± 0.543
1.604GluLys: 1.604 ± 0.332
5.902GluLeu: 5.902 ± 0.759
1.476GluMet: 1.476 ± 0.291
2.053GluAsn: 2.053 ± 0.343
3.4GluPro: 3.4 ± 0.556
2.63GluGln: 2.63 ± 0.495
4.619GluArg: 4.619 ± 0.64
3.785GluSer: 3.785 ± 0.511
3.464GluThr: 3.464 ± 0.545
4.363GluVal: 4.363 ± 0.532
1.155GluTrp: 1.155 ± 0.244
1.476GluTyr: 1.476 ± 0.305
0.0GluXaa: 0.0 ± 0.0
Phe
2.374PheAla: 2.374 ± 0.37
0.257PheCys: 0.257 ± 0.135
2.63PheAsp: 2.63 ± 0.539
1.668PheGlu: 1.668 ± 0.304
1.155PhePhe: 1.155 ± 0.273
2.823PheGly: 2.823 ± 0.517
0.513PheHis: 0.513 ± 0.194
1.219PheIle: 1.219 ± 0.328
0.898PheLys: 0.898 ± 0.262
2.438PheLeu: 2.438 ± 0.358
0.77PheMet: 0.77 ± 0.21
0.77PheAsn: 0.77 ± 0.254
1.54PhePro: 1.54 ± 0.371
0.77PheGln: 0.77 ± 0.209
1.476PheArg: 1.476 ± 0.316
1.796PheSer: 1.796 ± 0.364
1.796PheThr: 1.796 ± 0.381
2.181PheVal: 2.181 ± 0.378
0.513PheTrp: 0.513 ± 0.198
0.706PheTyr: 0.706 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
9.238GlyAla: 9.238 ± 1.295
1.026GlyCys: 1.026 ± 0.345
5.71GlyAsp: 5.71 ± 0.629
4.812GlyGlu: 4.812 ± 0.481
2.695GlyPhe: 2.695 ± 0.519
8.02GlyGly: 8.02 ± 1.104
1.604GlyHis: 1.604 ± 0.319
5.068GlyIle: 5.068 ± 0.657
3.144GlyLys: 3.144 ± 0.48
6.672GlyLeu: 6.672 ± 0.641
1.283GlyMet: 1.283 ± 0.365
2.566GlyAsn: 2.566 ± 0.479
3.593GlyPro: 3.593 ± 0.472
3.015GlyGln: 3.015 ± 0.58
5.197GlyArg: 5.197 ± 0.676
4.106GlySer: 4.106 ± 0.646
5.582GlyThr: 5.582 ± 0.783
6.287GlyVal: 6.287 ± 0.496
2.502GlyTrp: 2.502 ± 0.378
1.989GlyTyr: 1.989 ± 0.387
0.0GlyXaa: 0.0 ± 0.0
His
1.476HisAla: 1.476 ± 0.422
0.385HisCys: 0.385 ± 0.146
1.219HisAsp: 1.219 ± 0.295
1.54HisGlu: 1.54 ± 0.363
0.449HisPhe: 0.449 ± 0.148
2.31HisGly: 2.31 ± 0.37
0.064HisHis: 0.064 ± 0.063
1.283HisIle: 1.283 ± 0.347
0.385HisLys: 0.385 ± 0.154
1.54HisLeu: 1.54 ± 0.271
0.257HisMet: 0.257 ± 0.158
0.449HisAsn: 0.449 ± 0.153
0.77HisPro: 0.77 ± 0.227
0.513HisGln: 0.513 ± 0.182
1.989HisArg: 1.989 ± 0.45
1.091HisSer: 1.091 ± 0.26
1.155HisThr: 1.155 ± 0.291
1.604HisVal: 1.604 ± 0.353
0.513HisTrp: 0.513 ± 0.193
1.026HisTyr: 1.026 ± 0.315
0.0HisXaa: 0.0 ± 0.0
Ile
5.197IleAla: 5.197 ± 0.448
0.321IleCys: 0.321 ± 0.143
4.812IleAsp: 4.812 ± 0.535
4.17IleGlu: 4.17 ± 0.506
0.962IlePhe: 0.962 ± 0.205
3.272IleGly: 3.272 ± 0.504
1.026IleHis: 1.026 ± 0.289
1.411IleIle: 1.411 ± 0.388
0.834IleLys: 0.834 ± 0.219
2.566IleLeu: 2.566 ± 0.368
0.834IleMet: 0.834 ± 0.218
1.283IleAsn: 1.283 ± 0.29
2.63IlePro: 2.63 ± 0.42
1.796IleGln: 1.796 ± 0.341
3.464IleArg: 3.464 ± 0.515
2.502IleSer: 2.502 ± 0.391
3.464IleThr: 3.464 ± 0.524
2.374IleVal: 2.374 ± 0.439
0.962IleTrp: 0.962 ± 0.337
0.962IleTyr: 0.962 ± 0.258
0.0IleXaa: 0.0 ± 0.0
Lys
3.849LysAla: 3.849 ± 0.527
0.257LysCys: 0.257 ± 0.134
1.668LysAsp: 1.668 ± 0.299
1.604LysGlu: 1.604 ± 0.34
0.577LysPhe: 0.577 ± 0.183
2.31LysGly: 2.31 ± 0.333
0.321LysHis: 0.321 ± 0.164
1.026LysIle: 1.026 ± 0.259
1.54LysLys: 1.54 ± 0.381
2.951LysLeu: 2.951 ± 0.516
0.449LysMet: 0.449 ± 0.145
1.476LysAsn: 1.476 ± 0.32
2.31LysPro: 2.31 ± 0.395
1.091LysGln: 1.091 ± 0.265
3.015LysArg: 3.015 ± 0.451
2.053LysSer: 2.053 ± 0.432
2.117LysThr: 2.117 ± 0.416
3.079LysVal: 3.079 ± 0.48
1.026LysTrp: 1.026 ± 0.273
0.77LysTyr: 0.77 ± 0.228
0.0LysXaa: 0.0 ± 0.0
Leu
9.495LeuAla: 9.495 ± 0.897
1.091LeuCys: 1.091 ± 0.349
5.453LeuAsp: 5.453 ± 0.705
4.234LeuGlu: 4.234 ± 0.631
2.374LeuPhe: 2.374 ± 0.429
7.378LeuGly: 7.378 ± 0.675
1.604LeuHis: 1.604 ± 0.351
3.721LeuIle: 3.721 ± 0.455
3.208LeuLys: 3.208 ± 0.425
5.774LeuLeu: 5.774 ± 0.603
1.796LeuMet: 1.796 ± 0.289
2.181LeuAsn: 2.181 ± 0.384
5.004LeuPro: 5.004 ± 0.521
3.208LeuGln: 3.208 ± 0.54
5.004LeuArg: 5.004 ± 0.625
4.427LeuSer: 4.427 ± 0.595
6.608LeuThr: 6.608 ± 0.64
5.004LeuVal: 5.004 ± 0.727
1.283LeuTrp: 1.283 ± 0.334
1.411LeuTyr: 1.411 ± 0.322
0.0LeuXaa: 0.0 ± 0.0
Met
2.502MetAla: 2.502 ± 0.425
0.321MetCys: 0.321 ± 0.155
0.449MetAsp: 0.449 ± 0.168
0.77MetGlu: 0.77 ± 0.29
0.706MetPhe: 0.706 ± 0.279
1.283MetGly: 1.283 ± 0.321
0.577MetHis: 0.577 ± 0.219
0.706MetIle: 0.706 ± 0.225
0.577MetLys: 0.577 ± 0.268
1.476MetLeu: 1.476 ± 0.284
0.577MetMet: 0.577 ± 0.204
1.091MetAsn: 1.091 ± 0.245
1.861MetPro: 1.861 ± 0.445
0.834MetGln: 0.834 ± 0.219
1.411MetArg: 1.411 ± 0.269
2.245MetSer: 2.245 ± 0.367
1.925MetThr: 1.925 ± 0.305
1.476MetVal: 1.476 ± 0.297
0.321MetTrp: 0.321 ± 0.187
0.321MetTyr: 0.321 ± 0.138
0.0MetXaa: 0.0 ± 0.0
Asn
2.695AsnAla: 2.695 ± 0.362
0.192AsnCys: 0.192 ± 0.113
1.925AsnAsp: 1.925 ± 0.319
1.668AsnGlu: 1.668 ± 0.29
0.834AsnPhe: 0.834 ± 0.205
3.208AsnGly: 3.208 ± 0.439
0.449AsnHis: 0.449 ± 0.156
1.026AsnIle: 1.026 ± 0.3
1.155AsnLys: 1.155 ± 0.26
3.079AsnLeu: 3.079 ± 0.555
0.449AsnMet: 0.449 ± 0.151
1.155AsnAsn: 1.155 ± 0.352
2.502AsnPro: 2.502 ± 0.474
1.091AsnGln: 1.091 ± 0.261
1.925AsnArg: 1.925 ± 0.35
1.925AsnSer: 1.925 ± 0.334
1.411AsnThr: 1.411 ± 0.292
2.63AsnVal: 2.63 ± 0.486
0.834AsnTrp: 0.834 ± 0.255
0.449AsnTyr: 0.449 ± 0.149
0.0AsnXaa: 0.0 ± 0.0
Pro
6.736ProAla: 6.736 ± 0.615
0.577ProCys: 0.577 ± 0.246
4.491ProAsp: 4.491 ± 0.557
3.272ProGlu: 3.272 ± 0.437
1.283ProPhe: 1.283 ± 0.341
5.517ProGly: 5.517 ± 0.867
1.668ProHis: 1.668 ± 0.282
2.438ProIle: 2.438 ± 0.446
2.117ProLys: 2.117 ± 0.416
3.464ProLeu: 3.464 ± 0.488
1.283ProMet: 1.283 ± 0.279
2.181ProAsn: 2.181 ± 0.327
3.336ProPro: 3.336 ± 0.551
1.668ProGln: 1.668 ± 0.344
3.4ProArg: 3.4 ± 0.508
3.079ProSer: 3.079 ± 0.519
3.978ProThr: 3.978 ± 0.585
4.683ProVal: 4.683 ± 0.575
1.54ProTrp: 1.54 ± 0.347
1.476ProTyr: 1.476 ± 0.344
0.0ProXaa: 0.0 ± 0.0
Gln
4.876GlnAla: 4.876 ± 0.731
0.192GlnCys: 0.192 ± 0.124
1.347GlnAsp: 1.347 ± 0.317
2.245GlnGlu: 2.245 ± 0.465
1.347GlnPhe: 1.347 ± 0.222
2.245GlnGly: 2.245 ± 0.361
0.77GlnHis: 0.77 ± 0.19
1.668GlnIle: 1.668 ± 0.303
1.604GlnLys: 1.604 ± 0.345
3.529GlnLeu: 3.529 ± 0.532
0.706GlnMet: 0.706 ± 0.281
1.091GlnAsn: 1.091 ± 0.31
1.925GlnPro: 1.925 ± 0.392
1.989GlnGln: 1.989 ± 0.501
2.823GlnArg: 2.823 ± 0.524
2.053GlnSer: 2.053 ± 0.326
2.245GlnThr: 2.245 ± 0.374
3.015GlnVal: 3.015 ± 0.49
0.898GlnTrp: 0.898 ± 0.213
0.706GlnTyr: 0.706 ± 0.177
0.0GlnXaa: 0.0 ± 0.0
Arg
8.918ArgAla: 8.918 ± 0.974
0.898ArgCys: 0.898 ± 0.279
5.582ArgAsp: 5.582 ± 0.612
4.042ArgGlu: 4.042 ± 0.548
2.31ArgPhe: 2.31 ± 0.409
5.068ArgGly: 5.068 ± 0.573
1.604ArgHis: 1.604 ± 0.346
3.657ArgIle: 3.657 ± 0.496
2.695ArgLys: 2.695 ± 0.462
5.646ArgLeu: 5.646 ± 0.717
1.732ArgMet: 1.732 ± 0.433
2.245ArgAsn: 2.245 ± 0.341
3.593ArgPro: 3.593 ± 0.534
2.438ArgGln: 2.438 ± 0.383
6.608ArgArg: 6.608 ± 0.997
2.566ArgSer: 2.566 ± 0.505
3.657ArgThr: 3.657 ± 0.5
4.234ArgVal: 4.234 ± 0.699
2.053ArgTrp: 2.053 ± 0.327
1.604ArgTyr: 1.604 ± 0.32
0.0ArgXaa: 0.0 ± 0.0
Ser
8.02SerAla: 8.02 ± 0.708
0.513SerCys: 0.513 ± 0.185
3.721SerAsp: 3.721 ± 0.489
3.336SerGlu: 3.336 ± 0.477
1.411SerPhe: 1.411 ± 0.281
5.838SerGly: 5.838 ± 0.658
0.77SerHis: 0.77 ± 0.205
1.861SerIle: 1.861 ± 0.346
1.861SerLys: 1.861 ± 0.426
4.427SerLeu: 4.427 ± 0.545
1.219SerMet: 1.219 ± 0.266
1.476SerAsn: 1.476 ± 0.295
2.63SerPro: 2.63 ± 0.346
1.476SerGln: 1.476 ± 0.314
3.593SerArg: 3.593 ± 0.693
3.079SerSer: 3.079 ± 0.451
3.4SerThr: 3.4 ± 0.494
4.427SerVal: 4.427 ± 0.504
1.219SerTrp: 1.219 ± 0.248
0.77SerTyr: 0.77 ± 0.243
0.0SerXaa: 0.0 ± 0.0
Thr
8.02ThrAla: 8.02 ± 0.686
0.77ThrCys: 0.77 ± 0.224
4.298ThrAsp: 4.298 ± 0.533
3.336ThrGlu: 3.336 ± 0.463
1.732ThrPhe: 1.732 ± 0.361
4.363ThrGly: 4.363 ± 0.612
1.347ThrHis: 1.347 ± 0.327
3.4ThrIle: 3.4 ± 0.555
1.732ThrLys: 1.732 ± 0.302
5.582ThrLeu: 5.582 ± 0.655
0.77ThrMet: 0.77 ± 0.267
1.411ThrAsn: 1.411 ± 0.291
4.876ThrPro: 4.876 ± 0.444
1.796ThrGln: 1.796 ± 0.387
4.94ThrArg: 4.94 ± 0.543
3.079ThrSer: 3.079 ± 0.456
3.657ThrThr: 3.657 ± 0.62
5.325ThrVal: 5.325 ± 0.621
1.219ThrTrp: 1.219 ± 0.238
0.834ThrTyr: 0.834 ± 0.232
0.0ThrXaa: 0.0 ± 0.0
Val
8.276ValAla: 8.276 ± 0.754
0.577ValCys: 0.577 ± 0.182
4.619ValAsp: 4.619 ± 0.542
4.491ValGlu: 4.491 ± 0.654
1.989ValPhe: 1.989 ± 0.355
5.389ValGly: 5.389 ± 0.633
1.668ValHis: 1.668 ± 0.359
3.914ValIle: 3.914 ± 0.538
2.566ValLys: 2.566 ± 0.362
5.197ValLeu: 5.197 ± 0.572
1.219ValMet: 1.219 ± 0.28
2.181ValAsn: 2.181 ± 0.437
4.298ValPro: 4.298 ± 0.444
2.695ValGln: 2.695 ± 0.46
4.491ValArg: 4.491 ± 0.639
3.914ValSer: 3.914 ± 0.523
5.068ValThr: 5.068 ± 0.581
6.095ValVal: 6.095 ± 0.795
1.411ValTrp: 1.411 ± 0.345
1.283ValTyr: 1.283 ± 0.31
0.0ValXaa: 0.0 ± 0.0
Trp
2.31TrpAla: 2.31 ± 0.38
0.449TrpCys: 0.449 ± 0.17
1.54TrpAsp: 1.54 ± 0.376
1.091TrpGlu: 1.091 ± 0.249
0.577TrpPhe: 0.577 ± 0.234
1.411TrpGly: 1.411 ± 0.282
0.449TrpHis: 0.449 ± 0.171
0.962TrpIle: 0.962 ± 0.384
1.091TrpLys: 1.091 ± 0.281
2.245TrpLeu: 2.245 ± 0.38
0.834TrpMet: 0.834 ± 0.263
0.513TrpAsn: 0.513 ± 0.158
1.091TrpPro: 1.091 ± 0.288
0.706TrpGln: 0.706 ± 0.226
1.411TrpArg: 1.411 ± 0.291
1.989TrpSer: 1.989 ± 0.557
1.54TrpThr: 1.54 ± 0.293
1.283TrpVal: 1.283 ± 0.281
0.642TrpTrp: 0.642 ± 0.235
0.642TrpTyr: 0.642 ± 0.216
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.796TyrAla: 1.796 ± 0.39
0.449TyrCys: 0.449 ± 0.193
1.411TyrAsp: 1.411 ± 0.364
1.476TyrGlu: 1.476 ± 0.254
0.642TyrPhe: 0.642 ± 0.276
2.502TyrGly: 2.502 ± 0.399
0.321TyrHis: 0.321 ± 0.135
1.026TyrIle: 1.026 ± 0.219
0.706TyrLys: 0.706 ± 0.183
1.925TyrLeu: 1.925 ± 0.435
0.577TyrMet: 0.577 ± 0.169
0.321TyrAsn: 0.321 ± 0.149
1.411TyrPro: 1.411 ± 0.308
0.577TyrGln: 0.577 ± 0.178
1.925TyrArg: 1.925 ± 0.423
1.347TyrSer: 1.347 ± 0.266
1.604TyrThr: 1.604 ± 0.323
1.411TyrVal: 1.411 ± 0.372
0.385TyrTrp: 0.385 ± 0.196
0.449TyrTyr: 0.449 ± 0.204
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 85 proteins (15588 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski