Amino acid dipepetide frequency for Pseudomonas virus LPB1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
18.309AlaAla: 18.309 ± 2.098
1.344AlaCys: 1.344 ± 0.437
6.719AlaAsp: 6.719 ± 0.724
8.23AlaGlu: 8.23 ± 0.997
3.611AlaPhe: 3.611 ± 0.555
10.498AlaGly: 10.498 ± 1.134
1.344AlaHis: 1.344 ± 0.359
5.963AlaIle: 5.963 ± 0.642
4.115AlaLys: 4.115 ± 0.618
12.094AlaLeu: 12.094 ± 1.126
4.031AlaMet: 4.031 ± 0.601
3.611AlaAsn: 3.611 ± 0.588
4.703AlaPro: 4.703 ± 0.726
6.047AlaGln: 6.047 ± 0.761
9.406AlaArg: 9.406 ± 0.86
7.139AlaSer: 7.139 ± 1.097
6.719AlaThr: 6.719 ± 0.664
6.635AlaVal: 6.635 ± 0.784
1.932AlaTrp: 1.932 ± 0.313
3.443AlaTyr: 3.443 ± 0.602
0.0AlaXaa: 0.0 ± 0.0
Cys
1.008CysAla: 1.008 ± 0.334
0.084CysCys: 0.084 ± 0.083
0.924CysAsp: 0.924 ± 0.305
0.504CysGlu: 0.504 ± 0.278
0.672CysPhe: 0.672 ± 0.245
0.756CysGly: 0.756 ± 0.27
0.336CysHis: 0.336 ± 0.157
0.168CysIle: 0.168 ± 0.11
0.252CysLys: 0.252 ± 0.195
0.588CysLeu: 0.588 ± 0.2
0.252CysMet: 0.252 ± 0.164
0.252CysAsn: 0.252 ± 0.149
0.84CysPro: 0.84 ± 0.328
0.168CysGln: 0.168 ± 0.122
0.924CysArg: 0.924 ± 0.358
0.504CysSer: 0.504 ± 0.249
0.588CysThr: 0.588 ± 0.228
0.504CysVal: 0.504 ± 0.211
0.252CysTrp: 0.252 ± 0.143
0.084CysTyr: 0.084 ± 0.094
0.0CysXaa: 0.0 ± 0.0
Asp
6.131AspAla: 6.131 ± 0.676
0.168AspCys: 0.168 ± 0.117
3.695AspAsp: 3.695 ± 0.538
3.779AspGlu: 3.779 ± 0.624
1.68AspPhe: 1.68 ± 0.285
6.803AspGly: 6.803 ± 0.99
1.344AspHis: 1.344 ± 0.359
2.52AspIle: 2.52 ± 0.317
1.176AspLys: 1.176 ± 0.381
5.375AspLeu: 5.375 ± 0.709
1.512AspMet: 1.512 ± 0.486
1.512AspAsn: 1.512 ± 0.334
2.939AspPro: 2.939 ± 0.54
3.779AspGln: 3.779 ± 0.801
3.527AspArg: 3.527 ± 0.51
3.527AspSer: 3.527 ± 0.549
3.023AspThr: 3.023 ± 0.523
4.367AspVal: 4.367 ± 0.603
1.344AspTrp: 1.344 ± 0.275
1.596AspTyr: 1.596 ± 0.318
0.0AspXaa: 0.0 ± 0.0
Glu
7.475GluAla: 7.475 ± 0.769
1.008GluCys: 1.008 ± 0.41
2.771GluAsp: 2.771 ± 0.506
2.52GluGlu: 2.52 ± 0.431
2.436GluPhe: 2.436 ± 0.494
3.359GluGly: 3.359 ± 0.488
1.26GluHis: 1.26 ± 0.392
2.436GluIle: 2.436 ± 0.45
2.1GluLys: 2.1 ± 0.543
6.551GluLeu: 6.551 ± 0.66
1.512GluMet: 1.512 ± 0.462
1.344GluAsn: 1.344 ± 0.437
2.855GluPro: 2.855 ± 0.535
3.191GluGln: 3.191 ± 0.502
4.871GluArg: 4.871 ± 0.779
2.52GluSer: 2.52 ± 0.37
2.855GluThr: 2.855 ± 0.46
4.115GluVal: 4.115 ± 0.622
1.008GluTrp: 1.008 ± 0.266
1.26GluTyr: 1.26 ± 0.333
0.0GluXaa: 0.0 ± 0.0
Phe
3.527PheAla: 3.527 ± 0.549
0.42PheCys: 0.42 ± 0.167
2.436PheAsp: 2.436 ± 0.491
1.512PheGlu: 1.512 ± 0.341
0.924PhePhe: 0.924 ± 0.272
2.604PheGly: 2.604 ± 0.488
0.336PheHis: 0.336 ± 0.159
1.008PheIle: 1.008 ± 0.311
1.008PheLys: 1.008 ± 0.274
2.436PheLeu: 2.436 ± 0.477
0.924PheMet: 0.924 ± 0.352
1.092PheAsn: 1.092 ± 0.273
1.512PhePro: 1.512 ± 0.404
1.26PheGln: 1.26 ± 0.337
1.932PheArg: 1.932 ± 0.528
1.512PheSer: 1.512 ± 0.335
1.596PheThr: 1.596 ± 0.33
1.68PheVal: 1.68 ± 0.301
0.42PheTrp: 0.42 ± 0.176
1.008PheTyr: 1.008 ± 0.261
0.0PheXaa: 0.0 ± 0.0
Gly
7.307GlyAla: 7.307 ± 1.316
0.588GlyCys: 0.588 ± 0.26
4.871GlyAsp: 4.871 ± 0.848
4.955GlyGlu: 4.955 ± 0.619
3.359GlyPhe: 3.359 ± 0.427
6.971GlyGly: 6.971 ± 0.655
0.756GlyHis: 0.756 ± 0.237
3.779GlyIle: 3.779 ± 0.588
3.107GlyLys: 3.107 ± 0.634
6.719GlyLeu: 6.719 ± 0.864
1.092GlyMet: 1.092 ± 0.343
2.52GlyAsn: 2.52 ± 0.434
2.268GlyPro: 2.268 ± 0.341
5.039GlyGln: 5.039 ± 0.686
6.719GlyArg: 6.719 ± 0.663
6.047GlySer: 6.047 ± 0.905
4.619GlyThr: 4.619 ± 0.502
5.039GlyVal: 5.039 ± 0.741
1.932GlyTrp: 1.932 ± 0.353
2.352GlyTyr: 2.352 ± 0.577
0.0GlyXaa: 0.0 ± 0.0
His
2.016HisAla: 2.016 ± 0.346
0.0HisCys: 0.0 ± 0.0
0.924HisAsp: 0.924 ± 0.226
0.756HisGlu: 0.756 ± 0.263
0.168HisPhe: 0.168 ± 0.105
1.092HisGly: 1.092 ± 0.361
0.252HisHis: 0.252 ± 0.144
0.672HisIle: 0.672 ± 0.254
0.252HisLys: 0.252 ± 0.137
1.68HisLeu: 1.68 ± 0.396
0.672HisMet: 0.672 ± 0.238
0.924HisAsn: 0.924 ± 0.262
1.344HisPro: 1.344 ± 0.345
0.84HisGln: 0.84 ± 0.281
0.924HisArg: 0.924 ± 0.308
0.588HisSer: 0.588 ± 0.204
0.84HisThr: 0.84 ± 0.255
0.504HisVal: 0.504 ± 0.221
0.42HisTrp: 0.42 ± 0.208
0.84HisTyr: 0.84 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
6.299IleAla: 6.299 ± 0.924
0.672IleCys: 0.672 ± 0.266
3.191IleAsp: 3.191 ± 0.493
2.855IleGlu: 2.855 ± 0.488
0.84IlePhe: 0.84 ± 0.22
2.939IleGly: 2.939 ± 0.446
0.672IleHis: 0.672 ± 0.182
1.596IleIle: 1.596 ± 0.601
1.68IleLys: 1.68 ± 0.406
2.268IleLeu: 2.268 ± 0.412
0.504IleMet: 0.504 ± 0.174
1.008IleAsn: 1.008 ± 0.339
2.268IlePro: 2.268 ± 0.376
2.268IleGln: 2.268 ± 0.513
3.527IleArg: 3.527 ± 0.463
1.848IleSer: 1.848 ± 0.346
2.604IleThr: 2.604 ± 0.56
2.771IleVal: 2.771 ± 0.417
0.672IleTrp: 0.672 ± 0.214
1.092IleTyr: 1.092 ± 0.383
0.0IleXaa: 0.0 ± 0.0
Lys
5.795LysAla: 5.795 ± 0.875
0.084LysCys: 0.084 ± 0.086
1.26LysAsp: 1.26 ± 0.337
1.176LysGlu: 1.176 ± 0.347
0.588LysPhe: 0.588 ± 0.229
2.771LysGly: 2.771 ± 0.367
0.756LysHis: 0.756 ± 0.245
0.756LysIle: 0.756 ± 0.247
1.932LysLys: 1.932 ± 0.458
3.695LysLeu: 3.695 ± 0.6
0.756LysMet: 0.756 ± 0.323
0.924LysAsn: 0.924 ± 0.29
3.107LysPro: 3.107 ± 0.667
1.008LysGln: 1.008 ± 0.279
3.191LysArg: 3.191 ± 0.7
1.68LysSer: 1.68 ± 0.47
1.932LysThr: 1.932 ± 0.373
2.268LysVal: 2.268 ± 0.446
0.336LysTrp: 0.336 ± 0.156
1.008LysTyr: 1.008 ± 0.209
0.0LysXaa: 0.0 ± 0.0
Leu
12.766LeuAla: 12.766 ± 1.09
1.008LeuCys: 1.008 ± 0.261
6.635LeuAsp: 6.635 ± 0.787
6.131LeuGlu: 6.131 ± 0.701
2.52LeuPhe: 2.52 ± 0.595
7.643LeuGly: 7.643 ± 0.758
2.184LeuHis: 2.184 ± 0.476
3.611LeuIle: 3.611 ± 0.669
3.863LeuLys: 3.863 ± 0.77
8.062LeuLeu: 8.062 ± 0.844
2.1LeuMet: 2.1 ± 0.526
2.771LeuAsn: 2.771 ± 0.499
4.619LeuPro: 4.619 ± 0.825
4.115LeuGln: 4.115 ± 0.736
6.383LeuArg: 6.383 ± 0.638
4.535LeuSer: 4.535 ± 0.714
5.627LeuThr: 5.627 ± 0.85
7.055LeuVal: 7.055 ± 1.021
1.428LeuTrp: 1.428 ± 0.295
2.268LeuTyr: 2.268 ± 0.466
0.0LeuXaa: 0.0 ± 0.0
Met
3.527MetAla: 3.527 ± 0.467
0.168MetCys: 0.168 ± 0.119
2.184MetAsp: 2.184 ± 0.4
1.428MetGlu: 1.428 ± 0.425
0.42MetPhe: 0.42 ± 0.189
1.428MetGly: 1.428 ± 0.324
0.252MetHis: 0.252 ± 0.127
0.336MetIle: 0.336 ± 0.151
0.924MetLys: 0.924 ± 0.253
1.848MetLeu: 1.848 ± 0.402
0.42MetMet: 0.42 ± 0.21
0.672MetAsn: 0.672 ± 0.229
1.176MetPro: 1.176 ± 0.403
1.092MetGln: 1.092 ± 0.287
1.344MetArg: 1.344 ± 0.248
1.764MetSer: 1.764 ± 0.394
1.596MetThr: 1.596 ± 0.383
1.008MetVal: 1.008 ± 0.313
0.252MetTrp: 0.252 ± 0.185
0.252MetTyr: 0.252 ± 0.15
0.0MetXaa: 0.0 ± 0.0
Asn
3.275AsnAla: 3.275 ± 0.664
0.168AsnCys: 0.168 ± 0.113
1.512AsnAsp: 1.512 ± 0.367
1.176AsnGlu: 1.176 ± 0.307
0.42AsnPhe: 0.42 ± 0.199
3.275AsnGly: 3.275 ± 0.58
0.588AsnHis: 0.588 ± 0.215
0.924AsnIle: 0.924 ± 0.336
0.924AsnLys: 0.924 ± 0.326
2.939AsnLeu: 2.939 ± 0.606
0.588AsnMet: 0.588 ± 0.197
1.344AsnAsn: 1.344 ± 0.44
2.436AsnPro: 2.436 ± 0.458
1.176AsnGln: 1.176 ± 0.291
2.771AsnArg: 2.771 ± 0.421
1.344AsnSer: 1.344 ± 0.33
1.512AsnThr: 1.512 ± 0.356
1.008AsnVal: 1.008 ± 0.329
0.756AsnTrp: 0.756 ± 0.198
0.924AsnTyr: 0.924 ± 0.208
0.0AsnXaa: 0.0 ± 0.0
Pro
6.719ProAla: 6.719 ± 0.885
0.504ProCys: 0.504 ± 0.191
4.283ProAsp: 4.283 ± 0.658
2.436ProGlu: 2.436 ± 0.43
1.344ProPhe: 1.344 ± 0.349
3.527ProGly: 3.527 ± 0.539
0.84ProHis: 0.84 ± 0.447
1.68ProIle: 1.68 ± 0.402
1.932ProLys: 1.932 ± 0.447
4.619ProLeu: 4.619 ± 0.477
0.84ProMet: 0.84 ± 0.285
1.68ProAsn: 1.68 ± 0.36
2.268ProPro: 2.268 ± 0.555
2.352ProGln: 2.352 ± 0.493
3.527ProArg: 3.527 ± 0.499
3.443ProSer: 3.443 ± 0.387
2.771ProThr: 2.771 ± 0.481
3.779ProVal: 3.779 ± 0.683
0.504ProTrp: 0.504 ± 0.204
1.344ProTyr: 1.344 ± 0.384
0.0ProXaa: 0.0 ± 0.0
Gln
6.047GlnAla: 6.047 ± 0.985
0.336GlnCys: 0.336 ± 0.169
2.016GlnAsp: 2.016 ± 0.389
2.268GlnGlu: 2.268 ± 0.414
1.428GlnPhe: 1.428 ± 0.339
4.115GlnGly: 4.115 ± 0.631
0.84GlnHis: 0.84 ± 0.25
2.352GlnIle: 2.352 ± 0.475
1.092GlnLys: 1.092 ± 0.36
7.391GlnLeu: 7.391 ± 0.726
0.672GlnMet: 0.672 ± 0.223
0.84GlnAsn: 0.84 ± 0.233
1.932GlnPro: 1.932 ± 0.339
3.947GlnGln: 3.947 ± 0.992
4.535GlnArg: 4.535 ± 0.799
2.184GlnSer: 2.184 ± 0.385
1.932GlnThr: 1.932 ± 0.406
4.871GlnVal: 4.871 ± 0.539
0.672GlnTrp: 0.672 ± 0.276
1.092GlnTyr: 1.092 ± 0.286
0.0GlnXaa: 0.0 ± 0.0
Arg
7.391ArgAla: 7.391 ± 0.704
0.84ArgCys: 0.84 ± 0.247
4.619ArgAsp: 4.619 ± 0.622
4.955ArgGlu: 4.955 ± 0.677
2.184ArgPhe: 2.184 ± 0.485
4.451ArgGly: 4.451 ± 0.672
1.26ArgHis: 1.26 ± 0.269
3.779ArgIle: 3.779 ± 0.586
2.687ArgLys: 2.687 ± 0.495
7.223ArgLeu: 7.223 ± 0.846
1.344ArgMet: 1.344 ± 0.342
1.764ArgAsn: 1.764 ± 0.373
4.031ArgPro: 4.031 ± 0.783
4.115ArgGln: 4.115 ± 0.64
5.795ArgArg: 5.795 ± 0.779
4.535ArgSer: 4.535 ± 0.622
3.359ArgThr: 3.359 ± 0.407
4.535ArgVal: 4.535 ± 0.74
1.68ArgTrp: 1.68 ± 0.482
2.604ArgTyr: 2.604 ± 0.441
0.0ArgXaa: 0.0 ± 0.0
Ser
8.398SerAla: 8.398 ± 0.83
0.672SerCys: 0.672 ± 0.226
3.359SerAsp: 3.359 ± 0.579
2.687SerGlu: 2.687 ± 0.345
1.176SerPhe: 1.176 ± 0.241
4.619SerGly: 4.619 ± 0.757
0.84SerHis: 0.84 ± 0.255
2.855SerIle: 2.855 ± 0.418
1.932SerLys: 1.932 ± 0.507
5.963SerLeu: 5.963 ± 0.77
1.092SerMet: 1.092 ± 0.291
2.1SerAsn: 2.1 ± 0.497
3.275SerPro: 3.275 ± 0.689
2.855SerGln: 2.855 ± 0.443
3.023SerArg: 3.023 ± 0.554
3.779SerSer: 3.779 ± 0.582
2.855SerThr: 2.855 ± 0.473
3.107SerVal: 3.107 ± 0.479
1.092SerTrp: 1.092 ± 0.319
1.932SerTyr: 1.932 ± 0.378
0.0SerXaa: 0.0 ± 0.0
Thr
6.971ThrAla: 6.971 ± 0.933
0.336ThrCys: 0.336 ± 0.24
3.359ThrAsp: 3.359 ± 0.567
2.604ThrGlu: 2.604 ± 0.481
1.596ThrPhe: 1.596 ± 0.364
4.619ThrGly: 4.619 ± 0.799
0.672ThrHis: 0.672 ± 0.226
2.352ThrIle: 2.352 ± 0.431
2.1ThrLys: 2.1 ± 0.551
5.207ThrLeu: 5.207 ± 0.888
1.26ThrMet: 1.26 ± 0.311
1.512ThrAsn: 1.512 ± 0.298
2.939ThrPro: 2.939 ± 0.424
1.848ThrGln: 1.848 ± 0.333
3.023ThrArg: 3.023 ± 0.458
3.275ThrSer: 3.275 ± 0.607
3.695ThrThr: 3.695 ± 0.579
5.375ThrVal: 5.375 ± 0.836
1.008ThrTrp: 1.008 ± 0.352
1.344ThrTyr: 1.344 ± 0.323
0.0ThrXaa: 0.0 ± 0.0
Val
8.65ValAla: 8.65 ± 0.886
0.588ValCys: 0.588 ± 0.256
3.275ValAsp: 3.275 ± 0.559
4.871ValGlu: 4.871 ± 0.681
2.016ValPhe: 2.016 ± 0.34
5.207ValGly: 5.207 ± 0.717
0.504ValHis: 0.504 ± 0.149
2.52ValIle: 2.52 ± 0.361
2.268ValLys: 2.268 ± 0.476
6.383ValLeu: 6.383 ± 0.815
1.092ValMet: 1.092 ± 0.412
1.764ValAsn: 1.764 ± 0.291
3.191ValPro: 3.191 ± 0.548
3.107ValGln: 3.107 ± 0.428
4.283ValArg: 4.283 ± 0.628
4.031ValSer: 4.031 ± 0.504
4.703ValThr: 4.703 ± 0.678
4.451ValVal: 4.451 ± 0.577
1.428ValTrp: 1.428 ± 0.289
2.436ValTyr: 2.436 ± 0.535
0.0ValXaa: 0.0 ± 0.0
Trp
1.344TrpAla: 1.344 ± 0.31
0.42TrpCys: 0.42 ± 0.146
0.588TrpAsp: 0.588 ± 0.191
0.924TrpGlu: 0.924 ± 0.263
0.756TrpPhe: 0.756 ± 0.243
0.924TrpGly: 0.924 ± 0.31
0.084TrpHis: 0.084 ± 0.068
1.008TrpIle: 1.008 ± 0.239
0.924TrpLys: 0.924 ± 0.23
1.932TrpLeu: 1.932 ± 0.343
0.84TrpMet: 0.84 ± 0.315
0.42TrpAsn: 0.42 ± 0.181
1.008TrpPro: 1.008 ± 0.29
0.924TrpGln: 0.924 ± 0.248
1.428TrpArg: 1.428 ± 0.341
1.26TrpSer: 1.26 ± 0.338
1.008TrpThr: 1.008 ± 0.251
1.596TrpVal: 1.596 ± 0.335
0.504TrpTrp: 0.504 ± 0.216
0.168TrpTyr: 0.168 ± 0.157
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.939TyrAla: 2.939 ± 0.409
0.336TyrCys: 0.336 ± 0.166
1.176TyrAsp: 1.176 ± 0.316
1.68TyrGlu: 1.68 ± 0.393
1.008TyrPhe: 1.008 ± 0.26
2.436TyrGly: 2.436 ± 0.566
0.588TyrHis: 0.588 ± 0.281
1.26TyrIle: 1.26 ± 0.32
0.756TyrLys: 0.756 ± 0.28
2.352TyrLeu: 2.352 ± 0.38
0.504TyrMet: 0.504 ± 0.186
1.008TyrAsn: 1.008 ± 0.261
1.68TyrPro: 1.68 ± 0.495
1.428TyrGln: 1.428 ± 0.281
2.016TyrArg: 2.016 ± 0.361
2.1TyrSer: 2.1 ± 0.445
1.26TyrThr: 1.26 ± 0.367
2.1TyrVal: 2.1 ± 0.409
0.42TyrTrp: 0.42 ± 0.224
0.588TyrTyr: 0.588 ± 0.225
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 54 proteins (11908 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski