Amino acid dipepetide frequency for Thermococcus prieurii virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.816AlaAla: 2.816 ± 0.639
0.156AlaCys: 0.156 ± 0.135
2.816AlaAsp: 2.816 ± 0.66
5.164AlaGlu: 5.164 ± 0.831
4.538AlaPhe: 4.538 ± 1.177
4.694AlaGly: 4.694 ± 0.887
1.252AlaHis: 1.252 ± 0.441
4.851AlaIle: 4.851 ± 0.813
4.851AlaLys: 4.851 ± 0.81
8.449AlaLeu: 8.449 ± 1.118
2.66AlaMet: 2.66 ± 0.648
2.66AlaAsn: 2.66 ± 0.635
2.66AlaPro: 2.66 ± 0.641
2.66AlaGln: 2.66 ± 0.574
5.007AlaArg: 5.007 ± 0.691
4.851AlaSer: 4.851 ± 0.672
2.816AlaThr: 2.816 ± 0.594
4.851AlaVal: 4.851 ± 0.837
1.408AlaTrp: 1.408 ± 0.483
2.347AlaTyr: 2.347 ± 0.975
0.0AlaXaa: 0.0 ± 0.0
Cys
0.313CysAla: 0.313 ± 0.338
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.156CysGlu: 0.156 ± 0.135
0.313CysPhe: 0.313 ± 0.246
0.939CysGly: 0.939 ± 0.381
0.156CysHis: 0.156 ± 0.161
0.156CysIle: 0.156 ± 0.156
0.156CysLys: 0.156 ± 0.169
0.469CysLeu: 0.469 ± 0.308
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.313CysPro: 0.313 ± 0.242
0.156CysGln: 0.156 ± 0.135
0.313CysArg: 0.313 ± 0.248
0.156CysSer: 0.156 ± 0.157
0.156CysThr: 0.156 ± 0.147
0.156CysVal: 0.156 ± 0.153
0.0CysTrp: 0.0 ± 0.0
0.156CysTyr: 0.156 ± 0.176
0.0CysXaa: 0.0 ± 0.0
Asp
2.504AspAla: 2.504 ± 0.682
0.313AspCys: 0.313 ± 0.191
1.408AspAsp: 1.408 ± 0.555
4.068AspGlu: 4.068 ± 0.898
1.721AspPhe: 1.721 ± 0.597
2.034AspGly: 2.034 ± 0.501
0.469AspHis: 0.469 ± 0.243
3.755AspIle: 3.755 ± 0.828
2.816AspLys: 2.816 ± 0.762
4.068AspLeu: 4.068 ± 0.765
1.408AspMet: 1.408 ± 0.322
1.252AspAsn: 1.252 ± 0.527
2.973AspPro: 2.973 ± 0.794
0.782AspGln: 0.782 ± 0.312
2.347AspArg: 2.347 ± 0.727
2.973AspSer: 2.973 ± 0.657
1.878AspThr: 1.878 ± 0.418
5.007AspVal: 5.007 ± 0.791
0.782AspTrp: 0.782 ± 0.275
1.878AspTyr: 1.878 ± 0.483
0.0AspXaa: 0.0 ± 0.0
Glu
5.32GluAla: 5.32 ± 1.033
0.156GluCys: 0.156 ± 0.194
4.225GluAsp: 4.225 ± 0.824
7.824GluGlu: 7.824 ± 2.118
2.504GluPhe: 2.504 ± 0.529
4.851GluGly: 4.851 ± 1.356
1.095GluHis: 1.095 ± 0.451
4.538GluIle: 4.538 ± 0.818
7.198GluLys: 7.198 ± 1.36
10.014GluLeu: 10.014 ± 1.491
1.721GluMet: 1.721 ± 0.696
1.721GluAsn: 1.721 ± 0.453
1.408GluPro: 1.408 ± 0.426
3.755GluGln: 3.755 ± 0.773
5.007GluArg: 5.007 ± 1.139
3.599GluSer: 3.599 ± 0.659
3.912GluThr: 3.912 ± 1.035
4.381GluVal: 4.381 ± 0.993
0.939GluTrp: 0.939 ± 0.357
2.347GluTyr: 2.347 ± 0.589
0.0GluXaa: 0.0 ± 0.0
Phe
3.442PheAla: 3.442 ± 0.685
0.156PheCys: 0.156 ± 0.147
2.034PheAsp: 2.034 ± 0.477
4.381PheGlu: 4.381 ± 0.891
2.034PhePhe: 2.034 ± 0.615
2.347PheGly: 2.347 ± 0.521
0.782PheHis: 0.782 ± 0.287
2.816PheIle: 2.816 ± 0.874
3.129PheLys: 3.129 ± 0.679
3.755PheLeu: 3.755 ± 0.624
1.252PheMet: 1.252 ± 0.531
1.878PheAsn: 1.878 ± 0.419
1.878PhePro: 1.878 ± 0.591
0.782PheGln: 0.782 ± 0.38
2.347PheArg: 2.347 ± 0.752
3.912PheSer: 3.912 ± 0.799
3.912PheThr: 3.912 ± 0.933
2.034PheVal: 2.034 ± 0.668
0.469PheTrp: 0.469 ± 0.266
1.721PheTyr: 1.721 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
5.633GlyAla: 5.633 ± 0.947
0.313GlyCys: 0.313 ± 0.257
3.912GlyAsp: 3.912 ± 0.798
4.694GlyGlu: 4.694 ± 0.683
3.599GlyPhe: 3.599 ± 0.999
5.164GlyGly: 5.164 ± 0.708
1.565GlyHis: 1.565 ± 0.466
4.381GlyIle: 4.381 ± 0.625
5.32GlyLys: 5.32 ± 0.956
6.259GlyLeu: 6.259 ± 1.154
2.816GlyMet: 2.816 ± 0.906
3.755GlyAsn: 3.755 ± 1.109
2.191GlyPro: 2.191 ± 0.67
2.034GlyGln: 2.034 ± 0.672
5.476GlyArg: 5.476 ± 1.431
3.755GlySer: 3.755 ± 1.002
4.694GlyThr: 4.694 ± 1.433
3.755GlyVal: 3.755 ± 0.83
1.721GlyTrp: 1.721 ± 0.496
1.721GlyTyr: 1.721 ± 0.415
0.0GlyXaa: 0.0 ± 0.0
His
1.408HisAla: 1.408 ± 0.483
0.0HisCys: 0.0 ± 0.0
0.626HisAsp: 0.626 ± 0.306
2.034HisGlu: 2.034 ± 0.56
0.469HisPhe: 0.469 ± 0.247
1.252HisGly: 1.252 ± 0.353
0.156HisHis: 0.156 ± 0.147
0.939HisIle: 0.939 ± 0.437
1.095HisLys: 1.095 ± 0.441
1.721HisLeu: 1.721 ± 0.477
0.313HisMet: 0.313 ± 0.242
0.0HisAsn: 0.0 ± 0.0
0.939HisPro: 0.939 ± 0.375
0.469HisGln: 0.469 ± 0.234
1.095HisArg: 1.095 ± 0.408
0.156HisSer: 0.156 ± 0.174
0.313HisThr: 0.313 ± 0.179
1.408HisVal: 1.408 ± 0.562
0.313HisTrp: 0.313 ± 0.243
0.156HisTyr: 0.156 ± 0.192
0.0HisXaa: 0.0 ± 0.0
Ile
5.164IleAla: 5.164 ± 0.897
0.156IleCys: 0.156 ± 0.176
2.347IleAsp: 2.347 ± 0.551
3.286IleGlu: 3.286 ± 0.637
1.721IlePhe: 1.721 ± 0.469
4.694IleGly: 4.694 ± 1.033
0.626IleHis: 0.626 ± 0.331
4.694IleIle: 4.694 ± 1.074
4.068IleLys: 4.068 ± 0.569
5.633IleLeu: 5.633 ± 1.045
0.626IleMet: 0.626 ± 0.39
1.878IleAsn: 1.878 ± 0.485
3.912IlePro: 3.912 ± 1.203
0.939IleGln: 0.939 ± 0.297
2.973IleArg: 2.973 ± 0.873
3.755IleSer: 3.755 ± 0.716
4.851IleThr: 4.851 ± 0.804
5.164IleVal: 5.164 ± 0.924
0.782IleTrp: 0.782 ± 0.423
3.286IleTyr: 3.286 ± 1.056
0.0IleXaa: 0.0 ± 0.0
Lys
5.32LysAla: 5.32 ± 1.122
0.313LysCys: 0.313 ± 0.231
2.347LysAsp: 2.347 ± 0.718
6.572LysGlu: 6.572 ± 1.147
3.599LysPhe: 3.599 ± 0.779
4.538LysGly: 4.538 ± 0.885
1.721LysHis: 1.721 ± 0.589
4.851LysIle: 4.851 ± 1.101
4.225LysLys: 4.225 ± 1.1
7.667LysLeu: 7.667 ± 1.289
0.469LysMet: 0.469 ± 0.259
3.442LysAsn: 3.442 ± 0.715
3.129LysPro: 3.129 ± 0.953
2.034LysGln: 2.034 ± 0.539
2.973LysArg: 2.973 ± 0.682
4.851LysSer: 4.851 ± 1.015
3.755LysThr: 3.755 ± 0.841
5.633LysVal: 5.633 ± 1.201
1.252LysTrp: 1.252 ± 0.495
1.878LysTyr: 1.878 ± 0.488
0.0LysXaa: 0.0 ± 0.0
Leu
10.327LeuAla: 10.327 ± 1.378
0.313LeuCys: 0.313 ± 0.246
4.068LeuAsp: 4.068 ± 1.08
7.511LeuGlu: 7.511 ± 1.407
3.755LeuPhe: 3.755 ± 0.633
6.572LeuGly: 6.572 ± 0.919
1.721LeuHis: 1.721 ± 0.483
5.007LeuIle: 5.007 ± 0.906
6.572LeuLys: 6.572 ± 1.004
9.075LeuLeu: 9.075 ± 1.432
2.034LeuMet: 2.034 ± 0.39
5.007LeuAsn: 5.007 ± 0.954
5.164LeuPro: 5.164 ± 0.77
2.034LeuGln: 2.034 ± 0.521
5.946LeuArg: 5.946 ± 1.161
7.198LeuSer: 7.198 ± 1.025
3.442LeuThr: 3.442 ± 0.771
7.511LeuVal: 7.511 ± 1.125
2.191LeuTrp: 2.191 ± 0.553
3.599LeuTyr: 3.599 ± 0.57
0.0LeuXaa: 0.0 ± 0.0
Met
1.252MetAla: 1.252 ± 0.505
0.0MetCys: 0.0 ± 0.0
0.469MetAsp: 0.469 ± 0.262
1.565MetGlu: 1.565 ± 0.485
1.252MetPhe: 1.252 ± 0.45
1.408MetGly: 1.408 ± 0.406
0.313MetHis: 0.313 ± 0.167
1.095MetIle: 1.095 ± 0.372
1.252MetLys: 1.252 ± 0.307
1.878MetLeu: 1.878 ± 0.695
0.313MetMet: 0.313 ± 0.215
0.939MetAsn: 0.939 ± 0.432
1.252MetPro: 1.252 ± 0.476
0.469MetGln: 0.469 ± 0.276
1.252MetArg: 1.252 ± 0.505
1.565MetSer: 1.565 ± 0.402
1.565MetThr: 1.565 ± 0.385
2.191MetVal: 2.191 ± 0.776
0.939MetTrp: 0.939 ± 0.41
0.626MetTyr: 0.626 ± 0.391
0.0MetXaa: 0.0 ± 0.0
Asn
1.878AsnAla: 1.878 ± 0.528
0.0AsnCys: 0.0 ± 0.0
2.504AsnAsp: 2.504 ± 0.504
2.034AsnGlu: 2.034 ± 0.797
1.408AsnPhe: 1.408 ± 0.372
4.538AsnGly: 4.538 ± 1.379
0.782AsnHis: 0.782 ± 0.459
1.721AsnIle: 1.721 ± 0.391
2.66AsnLys: 2.66 ± 0.649
3.286AsnLeu: 3.286 ± 0.61
0.782AsnMet: 0.782 ± 0.388
1.095AsnAsn: 1.095 ± 0.326
1.878AsnPro: 1.878 ± 0.523
1.252AsnGln: 1.252 ± 0.561
1.252AsnArg: 1.252 ± 0.441
3.442AsnSer: 3.442 ± 0.849
2.034AsnThr: 2.034 ± 0.703
4.694AsnVal: 4.694 ± 0.843
0.626AsnTrp: 0.626 ± 0.329
0.939AsnTyr: 0.939 ± 0.429
0.0AsnXaa: 0.0 ± 0.0
Pro
3.912ProAla: 3.912 ± 0.767
0.313ProCys: 0.313 ± 0.204
1.878ProAsp: 1.878 ± 0.646
5.007ProGlu: 5.007 ± 0.877
1.408ProPhe: 1.408 ± 0.497
3.599ProGly: 3.599 ± 0.863
0.939ProHis: 0.939 ± 0.469
3.599ProIle: 3.599 ± 0.795
3.755ProLys: 3.755 ± 0.996
2.66ProLeu: 2.66 ± 0.723
0.313ProMet: 0.313 ± 0.169
2.504ProAsn: 2.504 ± 0.848
3.912ProPro: 3.912 ± 1.165
2.66ProGln: 2.66 ± 1.019
3.755ProArg: 3.755 ± 1.006
2.66ProSer: 2.66 ± 0.576
1.565ProThr: 1.565 ± 0.393
2.347ProVal: 2.347 ± 0.537
1.408ProTrp: 1.408 ± 0.508
0.626ProTyr: 0.626 ± 0.37
0.0ProXaa: 0.0 ± 0.0
Gln
2.973GlnAla: 2.973 ± 0.958
0.156GlnCys: 0.156 ± 0.169
1.095GlnAsp: 1.095 ± 0.494
2.347GlnGlu: 2.347 ± 0.908
1.408GlnPhe: 1.408 ± 0.385
1.878GlnGly: 1.878 ± 0.39
0.626GlnHis: 0.626 ± 0.317
2.816GlnIle: 2.816 ± 0.557
2.504GlnLys: 2.504 ± 0.845
2.816GlnLeu: 2.816 ± 0.655
0.156GlnMet: 0.156 ± 0.139
1.408GlnAsn: 1.408 ± 0.36
1.408GlnPro: 1.408 ± 0.368
0.782GlnGln: 0.782 ± 0.415
0.469GlnArg: 0.469 ± 0.275
1.721GlnSer: 1.721 ± 0.453
2.347GlnThr: 2.347 ± 0.621
3.442GlnVal: 3.442 ± 0.898
0.0GlnTrp: 0.0 ± 0.0
1.252GlnTyr: 1.252 ± 0.473
0.0GlnXaa: 0.0 ± 0.0
Arg
4.538ArgAla: 4.538 ± 0.86
0.156ArgCys: 0.156 ± 0.153
2.973ArgAsp: 2.973 ± 0.787
4.694ArgGlu: 4.694 ± 0.961
2.66ArgPhe: 2.66 ± 0.769
5.633ArgGly: 5.633 ± 1.365
0.313ArgHis: 0.313 ± 0.21
2.034ArgIle: 2.034 ± 0.564
5.007ArgLys: 5.007 ± 1.136
3.755ArgLeu: 3.755 ± 0.827
0.939ArgMet: 0.939 ± 0.399
1.878ArgAsn: 1.878 ± 0.619
3.286ArgPro: 3.286 ± 0.716
1.565ArgGln: 1.565 ± 0.363
5.476ArgArg: 5.476 ± 1.6
3.129ArgSer: 3.129 ± 0.779
1.721ArgThr: 1.721 ± 0.463
4.225ArgVal: 4.225 ± 0.79
1.095ArgTrp: 1.095 ± 0.324
1.878ArgTyr: 1.878 ± 0.452
0.0ArgXaa: 0.0 ± 0.0
Ser
3.129SerAla: 3.129 ± 0.653
0.469SerCys: 0.469 ± 0.275
1.408SerAsp: 1.408 ± 0.47
4.068SerGlu: 4.068 ± 0.745
3.755SerPhe: 3.755 ± 0.981
5.007SerGly: 5.007 ± 0.92
0.156SerHis: 0.156 ± 0.139
4.225SerIle: 4.225 ± 1.073
4.381SerLys: 4.381 ± 0.995
5.789SerLeu: 5.789 ± 0.953
2.191SerMet: 2.191 ± 0.599
1.565SerAsn: 1.565 ± 0.782
3.442SerPro: 3.442 ± 0.621
3.442SerGln: 3.442 ± 1.128
4.538SerArg: 4.538 ± 0.825
5.164SerSer: 5.164 ± 1.817
3.599SerThr: 3.599 ± 0.767
2.816SerVal: 2.816 ± 0.725
1.408SerTrp: 1.408 ± 0.444
3.286SerTyr: 3.286 ± 0.878
0.0SerXaa: 0.0 ± 0.0
Thr
3.599ThrAla: 3.599 ± 0.516
0.156ThrCys: 0.156 ± 0.153
1.721ThrAsp: 1.721 ± 0.444
2.66ThrGlu: 2.66 ± 0.576
2.816ThrPhe: 2.816 ± 0.789
5.164ThrGly: 5.164 ± 1.3
0.782ThrHis: 0.782 ± 0.46
3.129ThrIle: 3.129 ± 0.672
3.129ThrLys: 3.129 ± 0.895
6.259ThrLeu: 6.259 ± 0.97
1.095ThrMet: 1.095 ± 0.366
1.721ThrAsn: 1.721 ± 0.59
3.599ThrPro: 3.599 ± 0.934
1.095ThrGln: 1.095 ± 0.368
1.408ThrArg: 1.408 ± 0.317
2.973ThrSer: 2.973 ± 0.745
3.599ThrThr: 3.599 ± 0.726
4.381ThrVal: 4.381 ± 0.942
0.156ThrTrp: 0.156 ± 0.156
1.721ThrTyr: 1.721 ± 0.61
0.0ThrXaa: 0.0 ± 0.0
Val
4.694ValAla: 4.694 ± 0.956
0.469ValCys: 0.469 ± 0.33
5.32ValAsp: 5.32 ± 0.864
5.164ValGlu: 5.164 ± 0.921
3.286ValPhe: 3.286 ± 0.813
4.225ValGly: 4.225 ± 0.831
0.939ValHis: 0.939 ± 0.398
3.286ValIle: 3.286 ± 0.921
5.007ValLys: 5.007 ± 0.921
8.449ValLeu: 8.449 ± 1.135
0.939ValMet: 0.939 ± 0.494
3.599ValAsn: 3.599 ± 0.554
4.225ValPro: 4.225 ± 0.824
2.504ValGln: 2.504 ± 0.638
2.816ValArg: 2.816 ± 0.715
5.164ValSer: 5.164 ± 1.121
2.816ValThr: 2.816 ± 0.697
6.728ValVal: 6.728 ± 1.779
0.939ValTrp: 0.939 ± 0.29
4.068ValTyr: 4.068 ± 0.724
0.0ValXaa: 0.0 ± 0.0
Trp
0.782TrpAla: 0.782 ± 0.465
0.156TrpCys: 0.156 ± 0.152
1.252TrpAsp: 1.252 ± 0.434
0.939TrpGlu: 0.939 ± 0.377
1.252TrpPhe: 1.252 ± 0.301
1.721TrpGly: 1.721 ± 0.433
0.156TrpHis: 0.156 ± 0.156
1.408TrpIle: 1.408 ± 0.436
1.565TrpLys: 1.565 ± 0.499
2.504TrpLeu: 2.504 ± 0.687
0.469TrpMet: 0.469 ± 0.375
0.626TrpAsn: 0.626 ± 0.351
0.469TrpPro: 0.469 ± 0.257
0.469TrpGln: 0.469 ± 0.236
0.626TrpArg: 0.626 ± 0.315
0.469TrpSer: 0.469 ± 0.234
0.313TrpThr: 0.313 ± 0.181
1.095TrpVal: 1.095 ± 0.363
0.626TrpTrp: 0.626 ± 0.349
0.626TrpTyr: 0.626 ± 0.296
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.504TyrAla: 2.504 ± 0.573
0.313TyrCys: 0.313 ± 0.264
2.034TyrAsp: 2.034 ± 0.556
2.347TyrGlu: 2.347 ± 0.672
1.565TyrPhe: 1.565 ± 0.442
2.504TyrGly: 2.504 ± 0.52
0.313TyrHis: 0.313 ± 0.179
1.252TyrIle: 1.252 ± 0.45
1.878TyrLys: 1.878 ± 0.555
4.538TyrLeu: 4.538 ± 0.884
0.939TyrMet: 0.939 ± 0.379
1.878TyrAsn: 1.878 ± 0.686
0.782TyrPro: 0.782 ± 0.272
1.721TyrGln: 1.721 ± 0.546
1.878TyrArg: 1.878 ± 0.477
2.504TyrSer: 2.504 ± 0.619
1.878TyrThr: 1.878 ± 0.624
2.816TyrVal: 2.816 ± 0.551
0.469TyrTrp: 0.469 ± 0.281
1.565TyrTyr: 1.565 ± 0.498
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 28 proteins (6392 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski