Amino acid dipepetide frequency for Ralstonia phage phiITL-1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.311AlaAla: 14.311 ± 1.392
1.114AlaCys: 1.114 ± 0.347
6.941AlaAsp: 6.941 ± 0.925
7.37AlaGlu: 7.37 ± 0.788
3.856AlaPhe: 3.856 ± 0.534
9.084AlaGly: 9.084 ± 1.244
1.8AlaHis: 1.8 ± 0.512
4.799AlaIle: 4.799 ± 0.579
6.427AlaLys: 6.427 ± 0.713
7.627AlaLeu: 7.627 ± 0.793
2.828AlaMet: 2.828 ± 0.38
3.942AlaAsn: 3.942 ± 0.619
4.885AlaPro: 4.885 ± 1.046
5.142AlaGln: 5.142 ± 0.761
8.056AlaArg: 8.056 ± 0.962
5.999AlaSer: 5.999 ± 0.872
6.684AlaThr: 6.684 ± 1.148
7.456AlaVal: 7.456 ± 0.773
1.628AlaTrp: 1.628 ± 0.392
2.914AlaTyr: 2.914 ± 0.51
0.0AlaXaa: 0.0 ± 0.0
Cys
0.857CysAla: 0.857 ± 0.307
0.343CysCys: 0.343 ± 0.327
0.086CysAsp: 0.086 ± 0.088
0.857CysGlu: 0.857 ± 0.269
0.857CysPhe: 0.857 ± 0.287
0.857CysGly: 0.857 ± 0.296
0.343CysHis: 0.343 ± 0.164
0.171CysIle: 0.171 ± 0.117
0.514CysLys: 0.514 ± 0.236
1.2CysLeu: 1.2 ± 0.343
0.171CysMet: 0.171 ± 0.135
0.171CysAsn: 0.171 ± 0.12
0.514CysPro: 0.514 ± 0.27
0.171CysGln: 0.171 ± 0.104
0.6CysArg: 0.6 ± 0.234
0.6CysSer: 0.6 ± 0.319
0.686CysThr: 0.686 ± 0.248
0.6CysVal: 0.6 ± 0.214
0.257CysTrp: 0.257 ± 0.177
0.257CysTyr: 0.257 ± 0.166
0.0CysXaa: 0.0 ± 0.0
Asp
6.941AspAla: 6.941 ± 0.879
0.428AspCys: 0.428 ± 0.206
2.828AspAsp: 2.828 ± 0.524
3.685AspGlu: 3.685 ± 0.615
2.142AspPhe: 2.142 ± 0.381
5.142AspGly: 5.142 ± 0.663
1.028AspHis: 1.028 ± 0.284
2.657AspIle: 2.657 ± 0.484
3.599AspLys: 3.599 ± 0.721
4.713AspLeu: 4.713 ± 0.634
0.6AspMet: 0.6 ± 0.242
2.4AspAsn: 2.4 ± 0.479
3.856AspPro: 3.856 ± 0.713
1.714AspGln: 1.714 ± 0.4
3.942AspArg: 3.942 ± 0.485
2.742AspSer: 2.742 ± 0.541
3.514AspThr: 3.514 ± 0.54
4.028AspVal: 4.028 ± 0.546
0.686AspTrp: 0.686 ± 0.202
1.628AspTyr: 1.628 ± 0.251
0.0AspXaa: 0.0 ± 0.0
Glu
8.57GluAla: 8.57 ± 0.699
0.686GluCys: 0.686 ± 0.247
4.628GluAsp: 4.628 ± 0.863
4.456GluGlu: 4.456 ± 0.697
2.485GluPhe: 2.485 ± 0.428
6.77GluGly: 6.77 ± 1.004
0.943GluHis: 0.943 ± 0.281
1.714GluIle: 1.714 ± 0.328
2.999GluLys: 2.999 ± 0.562
4.542GluLeu: 4.542 ± 0.581
1.714GluMet: 1.714 ± 0.354
1.543GluAsn: 1.543 ± 0.431
1.714GluPro: 1.714 ± 0.413
2.999GluGln: 2.999 ± 0.575
4.028GluArg: 4.028 ± 0.608
2.485GluSer: 2.485 ± 0.495
3.685GluThr: 3.685 ± 0.529
4.542GluVal: 4.542 ± 0.647
1.028GluTrp: 1.028 ± 0.323
2.485GluTyr: 2.485 ± 0.506
0.0GluXaa: 0.0 ± 0.0
Phe
2.914PheAla: 2.914 ± 0.381
0.343PheCys: 0.343 ± 0.153
2.4PheAsp: 2.4 ± 0.515
1.885PheGlu: 1.885 ± 0.416
1.028PhePhe: 1.028 ± 0.261
1.885PheGly: 1.885 ± 0.475
0.943PheHis: 0.943 ± 0.383
1.714PheIle: 1.714 ± 0.337
2.057PheLys: 2.057 ± 0.412
2.142PheLeu: 2.142 ± 0.409
1.114PheMet: 1.114 ± 0.297
1.714PheAsn: 1.714 ± 0.352
1.8PhePro: 1.8 ± 0.37
1.543PheGln: 1.543 ± 0.409
2.4PheArg: 2.4 ± 0.442
1.714PheSer: 1.714 ± 0.539
1.971PheThr: 1.971 ± 0.38
2.999PheVal: 2.999 ± 0.603
0.514PheTrp: 0.514 ± 0.191
0.686PheTyr: 0.686 ± 0.208
0.0PheXaa: 0.0 ± 0.0
Gly
6.684GlyAla: 6.684 ± 0.814
0.686GlyCys: 0.686 ± 0.305
4.97GlyAsp: 4.97 ± 0.906
5.742GlyGlu: 5.742 ± 0.745
2.828GlyPhe: 2.828 ± 0.452
5.913GlyGly: 5.913 ± 0.581
1.714GlyHis: 1.714 ± 0.309
4.028GlyIle: 4.028 ± 0.839
6.17GlyLys: 6.17 ± 0.756
8.313GlyLeu: 8.313 ± 0.889
1.971GlyMet: 1.971 ± 0.471
3.085GlyAsn: 3.085 ± 0.644
3.085GlyPro: 3.085 ± 0.601
3.942GlyGln: 3.942 ± 0.603
4.456GlyArg: 4.456 ± 0.59
5.228GlySer: 5.228 ± 0.727
4.713GlyThr: 4.713 ± 0.631
6.599GlyVal: 6.599 ± 1.197
1.543GlyTrp: 1.543 ± 0.312
2.571GlyTyr: 2.571 ± 0.438
0.0GlyXaa: 0.0 ± 0.0
His
2.4HisAla: 2.4 ± 0.575
0.257HisCys: 0.257 ± 0.144
1.714HisAsp: 1.714 ± 0.45
1.2HisGlu: 1.2 ± 0.35
0.943HisPhe: 0.943 ± 0.306
1.2HisGly: 1.2 ± 0.3
0.514HisHis: 0.514 ± 0.218
1.285HisIle: 1.285 ± 0.31
0.771HisLys: 0.771 ± 0.224
1.543HisLeu: 1.543 ± 0.553
0.686HisMet: 0.686 ± 0.231
0.428HisAsn: 0.428 ± 0.175
0.857HisPro: 0.857 ± 0.286
0.771HisGln: 0.771 ± 0.285
1.8HisArg: 1.8 ± 0.58
0.514HisSer: 0.514 ± 0.186
1.028HisThr: 1.028 ± 0.342
1.2HisVal: 1.2 ± 0.44
0.6HisTrp: 0.6 ± 0.306
0.428HisTyr: 0.428 ± 0.197
0.0HisXaa: 0.0 ± 0.0
Ile
5.485IleAla: 5.485 ± 0.69
0.343IleCys: 0.343 ± 0.171
2.828IleAsp: 2.828 ± 0.437
3.856IleGlu: 3.856 ± 0.427
0.686IlePhe: 0.686 ± 0.228
3.599IleGly: 3.599 ± 0.589
1.028IleHis: 1.028 ± 0.293
2.742IleIle: 2.742 ± 0.587
2.4IleLys: 2.4 ± 0.591
2.828IleLeu: 2.828 ± 0.491
1.114IleMet: 1.114 ± 0.312
1.8IleAsn: 1.8 ± 0.403
1.971IlePro: 1.971 ± 0.409
2.4IleGln: 2.4 ± 0.318
2.4IleArg: 2.4 ± 0.428
1.114IleSer: 1.114 ± 0.331
2.828IleThr: 2.828 ± 0.559
4.199IleVal: 4.199 ± 0.563
0.6IleTrp: 0.6 ± 0.267
0.943IleTyr: 0.943 ± 0.266
0.0IleXaa: 0.0 ± 0.0
Lys
7.541LysAla: 7.541 ± 0.796
0.257LysCys: 0.257 ± 0.137
3.428LysAsp: 3.428 ± 0.511
2.999LysGlu: 2.999 ± 0.585
2.4LysPhe: 2.4 ± 0.501
3.942LysGly: 3.942 ± 0.687
1.2LysHis: 1.2 ± 0.361
2.057LysIle: 2.057 ± 0.346
2.999LysLys: 2.999 ± 0.64
4.113LysLeu: 4.113 ± 0.552
0.771LysMet: 0.771 ± 0.218
0.771LysAsn: 0.771 ± 0.291
2.742LysPro: 2.742 ± 0.632
2.314LysGln: 2.314 ± 0.538
2.228LysArg: 2.228 ± 0.494
2.485LysSer: 2.485 ± 0.482
2.142LysThr: 2.142 ± 0.49
5.313LysVal: 5.313 ± 0.91
1.028LysTrp: 1.028 ± 0.301
2.657LysTyr: 2.657 ± 0.505
0.0LysXaa: 0.0 ± 0.0
Leu
8.141LeuAla: 8.141 ± 1.027
0.343LeuCys: 0.343 ± 0.15
4.199LeuAsp: 4.199 ± 0.559
3.856LeuGlu: 3.856 ± 0.741
2.228LeuPhe: 2.228 ± 0.352
5.313LeuGly: 5.313 ± 1.036
1.628LeuHis: 1.628 ± 0.27
3.771LeuIle: 3.771 ± 0.527
5.313LeuLys: 5.313 ± 0.704
5.142LeuLeu: 5.142 ± 0.784
1.628LeuMet: 1.628 ± 0.271
3.856LeuAsn: 3.856 ± 0.635
4.113LeuPro: 4.113 ± 0.737
2.657LeuGln: 2.657 ± 0.507
6.427LeuArg: 6.427 ± 0.778
4.628LeuSer: 4.628 ± 0.632
4.456LeuThr: 4.456 ± 0.616
5.656LeuVal: 5.656 ± 0.629
0.943LeuTrp: 0.943 ± 0.275
2.4LeuTyr: 2.4 ± 0.48
0.0LeuXaa: 0.0 ± 0.0
Met
2.657MetAla: 2.657 ± 0.447
0.343MetCys: 0.343 ± 0.18
1.028MetAsp: 1.028 ± 0.218
1.285MetGlu: 1.285 ± 0.409
0.6MetPhe: 0.6 ± 0.209
2.657MetGly: 2.657 ± 0.412
0.343MetHis: 0.343 ± 0.149
1.028MetIle: 1.028 ± 0.306
0.6MetLys: 0.6 ± 0.243
2.057MetLeu: 2.057 ± 0.329
0.6MetMet: 0.6 ± 0.191
0.943MetAsn: 0.943 ± 0.356
1.114MetPro: 1.114 ± 0.32
0.943MetGln: 0.943 ± 0.229
1.457MetArg: 1.457 ± 0.346
2.142MetSer: 2.142 ± 0.483
1.628MetThr: 1.628 ± 0.388
0.857MetVal: 0.857 ± 0.301
0.086MetTrp: 0.086 ± 0.08
0.343MetTyr: 0.343 ± 0.207
0.0MetXaa: 0.0 ± 0.0
Asn
4.028AsnAla: 4.028 ± 0.583
0.171AsnCys: 0.171 ± 0.102
2.142AsnAsp: 2.142 ± 0.501
1.971AsnGlu: 1.971 ± 0.326
1.028AsnPhe: 1.028 ± 0.396
3.171AsnGly: 3.171 ± 0.537
0.514AsnHis: 0.514 ± 0.186
1.971AsnIle: 1.971 ± 0.35
1.8AsnLys: 1.8 ± 0.417
3.599AsnLeu: 3.599 ± 0.597
0.686AsnMet: 0.686 ± 0.235
1.114AsnAsn: 1.114 ± 0.32
2.742AsnPro: 2.742 ± 0.573
1.2AsnGln: 1.2 ± 0.406
2.657AsnArg: 2.657 ± 0.473
2.142AsnSer: 2.142 ± 0.471
1.971AsnThr: 1.971 ± 0.435
1.8AsnVal: 1.8 ± 0.368
0.257AsnTrp: 0.257 ± 0.155
1.028AsnTyr: 1.028 ± 0.205
0.0AsnXaa: 0.0 ± 0.0
Pro
4.97ProAla: 4.97 ± 1.055
0.686ProCys: 0.686 ± 0.275
2.571ProAsp: 2.571 ± 0.518
4.199ProGlu: 4.199 ± 0.689
1.8ProPhe: 1.8 ± 0.333
4.285ProGly: 4.285 ± 0.626
0.686ProHis: 0.686 ± 0.291
1.8ProIle: 1.8 ± 0.424
2.4ProLys: 2.4 ± 0.625
2.999ProLeu: 2.999 ± 0.408
0.771ProMet: 0.771 ± 0.264
1.457ProAsn: 1.457 ± 0.287
2.057ProPro: 2.057 ± 0.512
1.714ProGln: 1.714 ± 0.361
2.314ProArg: 2.314 ± 0.597
3.342ProSer: 3.342 ± 0.624
3.085ProThr: 3.085 ± 0.504
3.771ProVal: 3.771 ± 0.501
0.514ProTrp: 0.514 ± 0.225
1.457ProTyr: 1.457 ± 0.368
0.0ProXaa: 0.0 ± 0.0
Gln
5.57GlnAla: 5.57 ± 0.867
0.514GlnCys: 0.514 ± 0.213
2.4GlnAsp: 2.4 ± 0.398
2.314GlnGlu: 2.314 ± 0.467
1.971GlnPhe: 1.971 ± 0.451
2.999GlnGly: 2.999 ± 0.476
0.514GlnHis: 0.514 ± 0.183
2.314GlnIle: 2.314 ± 0.336
1.543GlnLys: 1.543 ± 0.338
3.171GlnLeu: 3.171 ± 0.451
0.686GlnMet: 0.686 ± 0.27
1.714GlnAsn: 1.714 ± 0.371
1.114GlnPro: 1.114 ± 0.259
2.228GlnGln: 2.228 ± 0.442
3.256GlnArg: 3.256 ± 0.473
1.371GlnSer: 1.371 ± 0.274
1.885GlnThr: 1.885 ± 0.46
2.742GlnVal: 2.742 ± 0.372
0.943GlnTrp: 0.943 ± 0.35
1.2GlnTyr: 1.2 ± 0.288
0.0GlnXaa: 0.0 ± 0.0
Arg
7.798ArgAla: 7.798 ± 0.752
0.686ArgCys: 0.686 ± 0.299
4.199ArgAsp: 4.199 ± 0.717
3.599ArgGlu: 3.599 ± 0.613
1.971ArgPhe: 1.971 ± 0.41
6.256ArgGly: 6.256 ± 0.629
1.714ArgHis: 1.714 ± 0.434
2.999ArgIle: 2.999 ± 0.468
2.742ArgLys: 2.742 ± 0.47
4.456ArgLeu: 4.456 ± 0.696
1.971ArgMet: 1.971 ± 0.406
2.657ArgAsn: 2.657 ± 0.576
2.314ArgPro: 2.314 ± 0.628
2.228ArgGln: 2.228 ± 0.434
4.799ArgArg: 4.799 ± 0.741
5.056ArgSer: 5.056 ± 1.194
4.628ArgThr: 4.628 ± 0.73
4.371ArgVal: 4.371 ± 0.511
1.2ArgTrp: 1.2 ± 0.334
1.457ArgTyr: 1.457 ± 0.444
0.0ArgXaa: 0.0 ± 0.0
Ser
5.485SerAla: 5.485 ± 0.765
1.2SerCys: 1.2 ± 0.653
2.742SerAsp: 2.742 ± 0.462
3.085SerGlu: 3.085 ± 0.424
2.314SerPhe: 2.314 ± 0.475
5.313SerGly: 5.313 ± 0.562
0.943SerHis: 0.943 ± 0.241
1.885SerIle: 1.885 ± 0.303
2.571SerLys: 2.571 ± 0.274
4.885SerLeu: 4.885 ± 0.659
1.2SerMet: 1.2 ± 0.28
1.885SerAsn: 1.885 ± 0.36
2.571SerPro: 2.571 ± 0.534
2.142SerGln: 2.142 ± 0.337
4.028SerArg: 4.028 ± 1.021
4.542SerSer: 4.542 ± 0.963
2.485SerThr: 2.485 ± 0.493
4.028SerVal: 4.028 ± 0.436
0.686SerTrp: 0.686 ± 0.196
1.543SerTyr: 1.543 ± 0.247
0.0SerXaa: 0.0 ± 0.0
Thr
5.399ThrAla: 5.399 ± 0.797
0.943ThrCys: 0.943 ± 0.332
3.514ThrAsp: 3.514 ± 0.394
3.856ThrGlu: 3.856 ± 0.418
2.142ThrPhe: 2.142 ± 0.455
5.228ThrGly: 5.228 ± 0.661
1.457ThrHis: 1.457 ± 0.4
2.742ThrIle: 2.742 ± 0.542
2.314ThrLys: 2.314 ± 0.387
4.371ThrLeu: 4.371 ± 0.569
0.943ThrMet: 0.943 ± 0.294
1.714ThrAsn: 1.714 ± 0.465
3.771ThrPro: 3.771 ± 0.565
1.885ThrGln: 1.885 ± 0.484
3.428ThrArg: 3.428 ± 0.506
3.771ThrSer: 3.771 ± 0.683
2.485ThrThr: 2.485 ± 0.564
4.628ThrVal: 4.628 ± 1.007
0.514ThrTrp: 0.514 ± 0.234
1.885ThrTyr: 1.885 ± 0.417
0.0ThrXaa: 0.0 ± 0.0
Val
8.056ValAla: 8.056 ± 0.825
0.257ValCys: 0.257 ± 0.161
3.514ValAsp: 3.514 ± 0.519
5.313ValGlu: 5.313 ± 0.536
1.457ValPhe: 1.457 ± 0.368
5.913ValGly: 5.913 ± 0.699
1.714ValHis: 1.714 ± 0.41
3.685ValIle: 3.685 ± 0.627
4.113ValLys: 4.113 ± 0.596
5.913ValLeu: 5.913 ± 0.69
2.057ValMet: 2.057 ± 0.409
2.999ValAsn: 2.999 ± 0.562
3.942ValPro: 3.942 ± 0.513
2.742ValGln: 2.742 ± 0.564
5.142ValArg: 5.142 ± 0.601
4.028ValSer: 4.028 ± 0.414
4.371ValThr: 4.371 ± 0.484
4.799ValVal: 4.799 ± 0.74
1.114ValTrp: 1.114 ± 0.304
1.2ValTyr: 1.2 ± 0.455
0.0ValXaa: 0.0 ± 0.0
Trp
2.057TrpAla: 2.057 ± 0.419
0.171TrpCys: 0.171 ± 0.14
0.6TrpAsp: 0.6 ± 0.225
1.114TrpGlu: 1.114 ± 0.336
0.6TrpPhe: 0.6 ± 0.278
1.2TrpGly: 1.2 ± 0.371
0.428TrpHis: 0.428 ± 0.173
0.943TrpIle: 0.943 ± 0.253
0.943TrpLys: 0.943 ± 0.258
1.371TrpLeu: 1.371 ± 0.36
0.514TrpMet: 0.514 ± 0.19
0.686TrpAsn: 0.686 ± 0.25
0.428TrpPro: 0.428 ± 0.167
0.514TrpGln: 0.514 ± 0.228
0.943TrpArg: 0.943 ± 0.353
0.6TrpSer: 0.6 ± 0.21
0.943TrpThr: 0.943 ± 0.267
1.028TrpVal: 1.028 ± 0.352
0.428TrpTrp: 0.428 ± 0.178
0.171TrpTyr: 0.171 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.999TyrAla: 2.999 ± 0.457
0.343TyrCys: 0.343 ± 0.189
1.543TyrAsp: 1.543 ± 0.405
1.371TyrGlu: 1.371 ± 0.309
0.514TyrPhe: 0.514 ± 0.201
3.514TyrGly: 3.514 ± 0.395
0.686TyrHis: 0.686 ± 0.223
0.857TyrIle: 0.857 ± 0.311
1.114TyrLys: 1.114 ± 0.318
1.714TyrLeu: 1.714 ± 0.509
0.6TyrMet: 0.6 ± 0.229
1.114TyrAsn: 1.114 ± 0.252
1.285TyrPro: 1.285 ± 0.333
1.2TyrGln: 1.2 ± 0.287
2.742TyrArg: 2.742 ± 0.468
1.028TyrSer: 1.028 ± 0.249
1.8TyrThr: 1.8 ± 0.363
1.714TyrVal: 1.714 ± 0.512
1.114TyrTrp: 1.114 ± 0.279
0.428TyrTyr: 0.428 ± 0.158
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11670 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski