Amino acid dipepetide frequency for Freshwater phage uvFW-CGR-AMD-COM-C429

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
12.882AlaAla: 12.882 ± 1.591
0.888AlaCys: 0.888 ± 0.342
7.551AlaAsp: 7.551 ± 1.408
6.774AlaGlu: 6.774 ± 0.776
2.887AlaPhe: 2.887 ± 0.66
8.773AlaGly: 8.773 ± 0.945
1.888AlaHis: 1.888 ± 0.484
6.219AlaIle: 6.219 ± 1.0
7.329AlaLys: 7.329 ± 1.098
8.329AlaLeu: 8.329 ± 1.207
2.221AlaMet: 2.221 ± 0.502
4.886AlaAsn: 4.886 ± 0.658
4.775AlaPro: 4.775 ± 1.213
4.442AlaGln: 4.442 ± 0.769
4.331AlaArg: 4.331 ± 0.805
7.551AlaSer: 7.551 ± 0.994
7.44AlaThr: 7.44 ± 1.242
7.44AlaVal: 7.44 ± 0.872
1.777AlaTrp: 1.777 ± 0.419
3.22AlaTyr: 3.22 ± 0.524
0.0AlaXaa: 0.0 ± 0.0
Cys
0.444CysAla: 0.444 ± 0.215
0.0CysCys: 0.0 ± 0.0
0.444CysAsp: 0.444 ± 0.225
0.555CysGlu: 0.555 ± 0.312
0.111CysPhe: 0.111 ± 0.101
0.555CysGly: 0.555 ± 0.373
0.0CysHis: 0.0 ± 0.0
0.333CysIle: 0.333 ± 0.174
0.222CysLys: 0.222 ± 0.165
1.11CysLeu: 1.11 ± 0.416
0.222CysMet: 0.222 ± 0.198
0.222CysAsn: 0.222 ± 0.161
0.333CysPro: 0.333 ± 0.208
0.333CysGln: 0.333 ± 0.174
0.555CysArg: 0.555 ± 0.249
0.444CysSer: 0.444 ± 0.216
0.333CysThr: 0.333 ± 0.208
0.777CysVal: 0.777 ± 0.315
0.333CysTrp: 0.333 ± 0.196
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
6.552AspAla: 6.552 ± 0.699
0.444AspCys: 0.444 ± 0.297
3.554AspAsp: 3.554 ± 0.785
3.331AspGlu: 3.331 ± 0.63
1.333AspPhe: 1.333 ± 0.37
3.22AspGly: 3.22 ± 0.547
1.11AspHis: 1.11 ± 0.476
2.332AspIle: 2.332 ± 0.45
3.776AspLys: 3.776 ± 0.639
5.33AspLeu: 5.33 ± 0.717
1.666AspMet: 1.666 ± 0.444
2.554AspAsn: 2.554 ± 0.421
3.331AspPro: 3.331 ± 0.604
1.666AspGln: 1.666 ± 0.394
3.22AspArg: 3.22 ± 0.451
2.998AspSer: 2.998 ± 0.399
3.331AspThr: 3.331 ± 0.507
3.109AspVal: 3.109 ± 0.531
1.555AspTrp: 1.555 ± 0.446
0.999AspTyr: 0.999 ± 0.275
0.0AspXaa: 0.0 ± 0.0
Glu
6.774GluAla: 6.774 ± 1.083
0.444GluCys: 0.444 ± 0.199
3.22GluAsp: 3.22 ± 0.534
3.665GluGlu: 3.665 ± 0.703
2.665GluPhe: 2.665 ± 0.726
2.887GluGly: 2.887 ± 0.618
1.333GluHis: 1.333 ± 0.471
3.887GluIle: 3.887 ± 0.817
2.887GluLys: 2.887 ± 0.513
5.664GluLeu: 5.664 ± 0.612
2.221GluMet: 2.221 ± 0.67
2.776GluAsn: 2.776 ± 0.481
2.665GluPro: 2.665 ± 0.425
2.443GluGln: 2.443 ± 0.607
3.331GluArg: 3.331 ± 0.546
3.109GluSer: 3.109 ± 0.516
3.443GluThr: 3.443 ± 0.67
3.331GluVal: 3.331 ± 0.663
1.444GluTrp: 1.444 ± 0.407
2.221GluTyr: 2.221 ± 0.436
0.0GluXaa: 0.0 ± 0.0
Phe
2.554PheAla: 2.554 ± 0.551
0.111PheCys: 0.111 ± 0.093
1.555PheAsp: 1.555 ± 0.326
2.554PheGlu: 2.554 ± 0.432
0.999PhePhe: 0.999 ± 0.317
2.998PheGly: 2.998 ± 0.722
0.444PheHis: 0.444 ± 0.208
1.333PheIle: 1.333 ± 0.383
1.11PheLys: 1.11 ± 0.377
1.999PheLeu: 1.999 ± 0.72
0.444PheMet: 0.444 ± 0.211
0.999PheAsn: 0.999 ± 0.255
0.888PhePro: 0.888 ± 0.409
0.888PheGln: 0.888 ± 0.293
1.11PheArg: 1.11 ± 0.411
1.888PheSer: 1.888 ± 0.494
1.999PheThr: 1.999 ± 0.393
1.666PheVal: 1.666 ± 0.452
0.444PheTrp: 0.444 ± 0.224
0.999PheTyr: 0.999 ± 0.329
0.0PheXaa: 0.0 ± 0.0
Gly
5.775GlyAla: 5.775 ± 0.869
0.444GlyCys: 0.444 ± 0.196
3.887GlyAsp: 3.887 ± 0.65
3.554GlyGlu: 3.554 ± 0.613
1.999GlyPhe: 1.999 ± 0.624
4.997GlyGly: 4.997 ± 0.908
1.222GlyHis: 1.222 ± 0.402
4.886GlyIle: 4.886 ± 0.712
4.886GlyLys: 4.886 ± 0.878
5.886GlyLeu: 5.886 ± 0.724
2.11GlyMet: 2.11 ± 0.359
4.109GlyAsn: 4.109 ± 0.621
2.11GlyPro: 2.11 ± 0.362
2.221GlyGln: 2.221 ± 0.428
3.554GlyArg: 3.554 ± 0.606
6.996GlySer: 6.996 ± 1.195
5.664GlyThr: 5.664 ± 1.461
3.998GlyVal: 3.998 ± 0.533
1.11GlyTrp: 1.11 ± 0.37
4.553GlyTyr: 4.553 ± 0.937
0.0GlyXaa: 0.0 ± 0.0
His
1.888HisAla: 1.888 ± 0.372
0.222HisCys: 0.222 ± 0.16
1.333HisAsp: 1.333 ± 0.33
0.333HisGlu: 0.333 ± 0.203
0.888HisPhe: 0.888 ± 0.362
1.777HisGly: 1.777 ± 0.47
0.333HisHis: 0.333 ± 0.196
1.777HisIle: 1.777 ± 0.484
0.555HisLys: 0.555 ± 0.239
1.666HisLeu: 1.666 ± 0.562
0.333HisMet: 0.333 ± 0.198
0.555HisAsn: 0.555 ± 0.25
1.333HisPro: 1.333 ± 0.325
0.444HisGln: 0.444 ± 0.182
0.777HisArg: 0.777 ± 0.249
0.777HisSer: 0.777 ± 0.288
0.777HisThr: 0.777 ± 0.233
0.888HisVal: 0.888 ± 0.352
0.444HisTrp: 0.444 ± 0.242
0.444HisTyr: 0.444 ± 0.195
0.0HisXaa: 0.0 ± 0.0
Ile
6.885IleAla: 6.885 ± 0.85
0.666IleCys: 0.666 ± 0.335
3.776IleAsp: 3.776 ± 0.684
3.22IleGlu: 3.22 ± 0.578
0.888IlePhe: 0.888 ± 0.299
3.22IleGly: 3.22 ± 0.795
0.333IleHis: 0.333 ± 0.199
2.332IleIle: 2.332 ± 0.544
3.443IleLys: 3.443 ± 0.684
1.888IleLeu: 1.888 ± 0.437
0.888IleMet: 0.888 ± 0.325
2.443IleAsn: 2.443 ± 0.594
2.332IlePro: 2.332 ± 0.453
1.888IleGln: 1.888 ± 0.37
2.887IleArg: 2.887 ± 0.542
3.665IleSer: 3.665 ± 0.59
4.109IleThr: 4.109 ± 0.747
2.998IleVal: 2.998 ± 0.551
0.888IleTrp: 0.888 ± 0.422
1.222IleTyr: 1.222 ± 0.283
0.0IleXaa: 0.0 ± 0.0
Lys
7.218LysAla: 7.218 ± 1.0
0.444LysCys: 0.444 ± 0.244
3.554LysAsp: 3.554 ± 0.63
3.998LysGlu: 3.998 ± 0.881
1.333LysPhe: 1.333 ± 0.39
3.109LysGly: 3.109 ± 0.605
0.777LysHis: 0.777 ± 0.232
2.221LysIle: 2.221 ± 0.483
4.886LysLys: 4.886 ± 0.925
4.886LysLeu: 4.886 ± 0.623
2.332LysMet: 2.332 ± 0.458
2.776LysAsn: 2.776 ± 0.487
3.776LysPro: 3.776 ± 0.663
3.109LysGln: 3.109 ± 0.573
2.554LysArg: 2.554 ± 0.682
3.776LysSer: 3.776 ± 0.696
5.441LysThr: 5.441 ± 0.944
2.887LysVal: 2.887 ± 0.499
0.444LysTrp: 0.444 ± 0.185
1.777LysTyr: 1.777 ± 0.391
0.0LysXaa: 0.0 ± 0.0
Leu
8.884LeuAla: 8.884 ± 1.154
0.888LeuCys: 0.888 ± 0.336
4.109LeuAsp: 4.109 ± 0.667
4.331LeuGlu: 4.331 ± 0.67
1.777LeuPhe: 1.777 ± 0.44
6.108LeuGly: 6.108 ± 0.879
2.11LeuHis: 2.11 ± 0.558
3.22LeuIle: 3.22 ± 0.829
4.442LeuLys: 4.442 ± 0.617
5.775LeuLeu: 5.775 ± 1.166
1.666LeuMet: 1.666 ± 0.605
3.887LeuAsn: 3.887 ± 0.67
2.776LeuPro: 2.776 ± 0.625
2.11LeuGln: 2.11 ± 0.367
5.33LeuArg: 5.33 ± 0.773
6.774LeuSer: 6.774 ± 0.667
5.108LeuThr: 5.108 ± 0.84
4.109LeuVal: 4.109 ± 0.736
0.777LeuTrp: 0.777 ± 0.415
2.11LeuTyr: 2.11 ± 0.572
0.0LeuXaa: 0.0 ± 0.0
Met
3.22MetAla: 3.22 ± 0.71
0.222MetCys: 0.222 ± 0.166
1.11MetAsp: 1.11 ± 0.307
1.888MetGlu: 1.888 ± 0.373
0.888MetPhe: 0.888 ± 0.287
1.222MetGly: 1.222 ± 0.342
0.333MetHis: 0.333 ± 0.216
0.999MetIle: 0.999 ± 0.275
1.666MetLys: 1.666 ± 0.585
1.777MetLeu: 1.777 ± 0.431
0.444MetMet: 0.444 ± 0.314
0.555MetAsn: 0.555 ± 0.271
1.999MetPro: 1.999 ± 0.544
1.11MetGln: 1.11 ± 0.338
1.333MetArg: 1.333 ± 0.454
1.888MetSer: 1.888 ± 0.485
2.332MetThr: 2.332 ± 0.653
1.11MetVal: 1.11 ± 0.354
0.333MetTrp: 0.333 ± 0.219
1.11MetTyr: 1.11 ± 0.366
0.0MetXaa: 0.0 ± 0.0
Asn
3.887AsnAla: 3.887 ± 0.661
0.444AsnCys: 0.444 ± 0.311
2.776AsnAsp: 2.776 ± 0.592
1.777AsnGlu: 1.777 ± 0.42
0.999AsnPhe: 0.999 ± 0.241
2.332AsnGly: 2.332 ± 0.553
0.555AsnHis: 0.555 ± 0.282
2.332AsnIle: 2.332 ± 0.481
3.331AsnLys: 3.331 ± 0.522
3.22AsnLeu: 3.22 ± 0.561
1.444AsnMet: 1.444 ± 0.404
1.888AsnAsn: 1.888 ± 0.584
3.109AsnPro: 3.109 ± 0.796
1.888AsnGln: 1.888 ± 0.362
1.777AsnArg: 1.777 ± 0.421
3.443AsnSer: 3.443 ± 0.653
3.998AsnThr: 3.998 ± 1.525
3.443AsnVal: 3.443 ± 0.718
0.222AsnTrp: 0.222 ± 0.14
1.444AsnTyr: 1.444 ± 0.466
0.0AsnXaa: 0.0 ± 0.0
Pro
4.775ProAla: 4.775 ± 0.762
0.111ProCys: 0.111 ± 0.113
2.554ProAsp: 2.554 ± 0.39
3.331ProGlu: 3.331 ± 0.561
1.555ProPhe: 1.555 ± 0.455
3.443ProGly: 3.443 ± 0.716
0.777ProHis: 0.777 ± 0.226
1.777ProIle: 1.777 ± 0.415
3.331ProLys: 3.331 ± 0.747
2.776ProLeu: 2.776 ± 0.58
1.11ProMet: 1.11 ± 0.333
2.554ProAsn: 2.554 ± 0.673
3.109ProPro: 3.109 ± 0.562
1.555ProGln: 1.555 ± 0.39
2.332ProArg: 2.332 ± 0.495
2.776ProSer: 2.776 ± 0.472
3.665ProThr: 3.665 ± 0.753
3.998ProVal: 3.998 ± 0.89
0.666ProTrp: 0.666 ± 0.176
0.777ProTyr: 0.777 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
5.219GlnAla: 5.219 ± 0.832
0.111GlnCys: 0.111 ± 0.126
1.333GlnAsp: 1.333 ± 0.348
3.109GlnGlu: 3.109 ± 0.679
0.888GlnPhe: 0.888 ± 0.232
1.777GlnGly: 1.777 ± 0.4
0.333GlnHis: 0.333 ± 0.215
2.11GlnIle: 2.11 ± 0.457
1.777GlnLys: 1.777 ± 0.465
3.554GlnLeu: 3.554 ± 0.684
1.11GlnMet: 1.11 ± 0.412
1.333GlnAsn: 1.333 ± 0.347
0.999GlnPro: 0.999 ± 0.427
1.999GlnGln: 1.999 ± 0.517
2.776GlnArg: 2.776 ± 0.629
1.666GlnSer: 1.666 ± 0.493
1.888GlnThr: 1.888 ± 0.386
3.109GlnVal: 3.109 ± 0.677
0.333GlnTrp: 0.333 ± 0.178
1.333GlnTyr: 1.333 ± 0.276
0.0GlnXaa: 0.0 ± 0.0
Arg
5.219ArgAla: 5.219 ± 0.922
0.444ArgCys: 0.444 ± 0.225
2.332ArgAsp: 2.332 ± 0.466
2.998ArgGlu: 2.998 ± 0.595
1.888ArgPhe: 1.888 ± 0.417
2.332ArgGly: 2.332 ± 0.397
0.777ArgHis: 0.777 ± 0.289
2.665ArgIle: 2.665 ± 0.441
3.554ArgLys: 3.554 ± 0.641
4.553ArgLeu: 4.553 ± 0.724
1.888ArgMet: 1.888 ± 0.436
1.444ArgAsn: 1.444 ± 0.381
1.777ArgPro: 1.777 ± 0.349
1.222ArgGln: 1.222 ± 0.383
3.331ArgArg: 3.331 ± 0.816
3.665ArgSer: 3.665 ± 0.601
4.442ArgThr: 4.442 ± 0.99
3.776ArgVal: 3.776 ± 0.758
0.555ArgTrp: 0.555 ± 0.246
1.888ArgTyr: 1.888 ± 0.679
0.0ArgXaa: 0.0 ± 0.0
Ser
7.773SerAla: 7.773 ± 1.344
0.222SerCys: 0.222 ± 0.18
2.887SerAsp: 2.887 ± 0.646
3.776SerGlu: 3.776 ± 0.566
1.999SerPhe: 1.999 ± 0.49
8.107SerGly: 8.107 ± 1.365
1.222SerHis: 1.222 ± 0.367
3.776SerIle: 3.776 ± 0.713
3.331SerLys: 3.331 ± 0.672
4.775SerLeu: 4.775 ± 0.569
1.555SerMet: 1.555 ± 0.453
3.331SerAsn: 3.331 ± 0.993
3.554SerPro: 3.554 ± 0.665
2.554SerGln: 2.554 ± 0.443
2.887SerArg: 2.887 ± 0.495
5.33SerSer: 5.33 ± 0.882
5.997SerThr: 5.997 ± 0.957
4.442SerVal: 4.442 ± 0.532
0.999SerTrp: 0.999 ± 0.484
2.554SerTyr: 2.554 ± 0.669
0.0SerXaa: 0.0 ± 0.0
Thr
9.994ThrAla: 9.994 ± 1.801
0.333ThrCys: 0.333 ± 0.271
2.998ThrAsp: 2.998 ± 0.507
3.887ThrGlu: 3.887 ± 0.554
1.333ThrPhe: 1.333 ± 0.312
8.551ThrGly: 8.551 ± 1.601
1.333ThrHis: 1.333 ± 0.372
3.109ThrIle: 3.109 ± 0.627
3.443ThrLys: 3.443 ± 1.079
6.219ThrLeu: 6.219 ± 1.173
1.11ThrMet: 1.11 ± 0.361
2.221ThrAsn: 2.221 ± 0.468
4.553ThrPro: 4.553 ± 1.502
1.666ThrGln: 1.666 ± 0.358
2.332ThrArg: 2.332 ± 0.476
6.219ThrSer: 6.219 ± 1.035
6.552ThrThr: 6.552 ± 1.588
5.219ThrVal: 5.219 ± 0.918
0.666ThrTrp: 0.666 ± 0.258
2.221ThrTyr: 2.221 ± 0.595
0.0ThrXaa: 0.0 ± 0.0
Val
6.885ValAla: 6.885 ± 1.174
0.666ValCys: 0.666 ± 0.332
4.109ValAsp: 4.109 ± 0.596
4.553ValGlu: 4.553 ± 0.619
1.333ValPhe: 1.333 ± 0.332
4.997ValGly: 4.997 ± 0.845
1.444ValHis: 1.444 ± 0.401
3.109ValIle: 3.109 ± 0.645
4.331ValLys: 4.331 ± 0.827
3.887ValLeu: 3.887 ± 0.524
1.666ValMet: 1.666 ± 0.56
3.22ValAsn: 3.22 ± 0.724
1.888ValPro: 1.888 ± 0.498
2.11ValGln: 2.11 ± 0.465
2.776ValArg: 2.776 ± 0.599
4.997ValSer: 4.997 ± 0.651
4.553ValThr: 4.553 ± 0.703
5.108ValVal: 5.108 ± 0.878
0.555ValTrp: 0.555 ± 0.284
1.777ValTyr: 1.777 ± 0.415
0.0ValXaa: 0.0 ± 0.0
Trp
1.222TrpAla: 1.222 ± 0.297
0.111TrpCys: 0.111 ± 0.115
0.444TrpAsp: 0.444 ± 0.173
1.222TrpGlu: 1.222 ± 0.358
0.333TrpPhe: 0.333 ± 0.145
0.555TrpGly: 0.555 ± 0.227
0.666TrpHis: 0.666 ± 0.257
0.555TrpIle: 0.555 ± 0.225
1.11TrpLys: 1.11 ± 0.385
0.777TrpLeu: 0.777 ± 0.248
0.222TrpMet: 0.222 ± 0.191
0.444TrpAsn: 0.444 ± 0.212
0.333TrpPro: 0.333 ± 0.173
1.333TrpGln: 1.333 ± 0.531
1.222TrpArg: 1.222 ± 0.381
1.333TrpSer: 1.333 ± 0.282
0.999TrpThr: 0.999 ± 0.503
0.444TrpVal: 0.444 ± 0.171
0.111TrpTrp: 0.111 ± 0.093
0.444TrpTyr: 0.444 ± 0.247
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.554TyrAla: 3.554 ± 0.818
0.111TyrCys: 0.111 ± 0.117
1.888TyrAsp: 1.888 ± 0.609
1.555TyrGlu: 1.555 ± 0.375
0.888TyrPhe: 0.888 ± 0.261
2.998TyrGly: 2.998 ± 0.482
0.555TyrHis: 0.555 ± 0.182
0.999TyrIle: 0.999 ± 0.409
1.888TyrLys: 1.888 ± 0.465
2.332TyrLeu: 2.332 ± 0.614
0.777TyrMet: 0.777 ± 0.271
2.11TyrAsn: 2.11 ± 0.622
1.333TyrPro: 1.333 ± 0.438
1.777TyrGln: 1.777 ± 0.48
2.332TyrArg: 2.332 ± 0.458
1.777TyrSer: 1.777 ± 0.369
1.888TyrThr: 1.888 ± 0.37
2.11TyrVal: 2.11 ± 0.339
0.222TyrTrp: 0.222 ± 0.132
0.888TyrTyr: 0.888 ± 0.295
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 40 proteins (9006 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski