Amino acid dipepetide frequency for Teviot virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.686AlaAla: 4.686 ± 2.215
1.222AlaCys: 1.222 ± 0.417
3.26AlaAsp: 3.26 ± 0.496
2.852AlaGlu: 2.852 ± 0.718
2.037AlaPhe: 2.037 ± 0.738
2.445AlaGly: 2.445 ± 1.09
1.426AlaHis: 1.426 ± 0.47
5.297AlaIle: 5.297 ± 1.293
3.871AlaLys: 3.871 ± 0.996
5.705AlaLeu: 5.705 ± 1.478
1.834AlaMet: 1.834 ± 0.604
2.241AlaAsn: 2.241 ± 0.947
1.834AlaPro: 1.834 ± 0.446
3.871AlaGln: 3.871 ± 1.656
4.075AlaArg: 4.075 ± 1.219
5.705AlaSer: 5.705 ± 1.005
4.89AlaThr: 4.89 ± 1.964
2.649AlaVal: 2.649 ± 0.951
1.019AlaTrp: 1.019 ± 0.474
1.426AlaTyr: 1.426 ± 0.311
0.0AlaXaa: 0.0 ± 0.0
Cys
1.426CysAla: 1.426 ± 0.457
0.407CysCys: 0.407 ± 0.274
1.63CysAsp: 1.63 ± 0.454
1.019CysGlu: 1.019 ± 0.282
1.426CysPhe: 1.426 ± 0.445
1.222CysGly: 1.222 ± 0.391
0.815CysHis: 0.815 ± 0.373
1.63CysIle: 1.63 ± 0.544
1.63CysLys: 1.63 ± 0.356
2.037CysLeu: 2.037 ± 0.722
0.407CysMet: 0.407 ± 0.389
1.222CysAsn: 1.222 ± 0.576
1.222CysPro: 1.222 ± 0.576
0.815CysGln: 0.815 ± 0.231
0.815CysArg: 0.815 ± 0.373
2.852CysSer: 2.852 ± 0.808
0.611CysThr: 0.611 ± 0.491
0.815CysVal: 0.815 ± 0.441
0.0CysTrp: 0.0 ± 0.0
0.815CysTyr: 0.815 ± 0.493
0.0CysXaa: 0.0 ± 0.0
Asp
2.852AspAla: 2.852 ± 0.997
1.019AspCys: 1.019 ± 0.305
2.852AspAsp: 2.852 ± 0.847
3.464AspGlu: 3.464 ± 1.064
1.222AspPhe: 1.222 ± 0.425
1.63AspGly: 1.63 ± 0.454
1.019AspHis: 1.019 ± 0.269
4.075AspIle: 4.075 ± 0.719
3.26AspLys: 3.26 ± 0.619
6.112AspLeu: 6.112 ± 2.68
1.834AspMet: 1.834 ± 0.959
2.852AspAsn: 2.852 ± 0.6
4.075AspPro: 4.075 ± 1.265
1.426AspGln: 1.426 ± 0.405
1.834AspArg: 1.834 ± 0.457
3.667AspSer: 3.667 ± 1.069
2.852AspThr: 2.852 ± 0.719
2.852AspVal: 2.852 ± 0.653
0.815AspTrp: 0.815 ± 0.28
1.834AspTyr: 1.834 ± 0.616
0.0AspXaa: 0.0 ± 0.0
Glu
1.63GluAla: 1.63 ± 0.675
1.019GluCys: 1.019 ± 0.572
2.445GluAsp: 2.445 ± 0.887
3.871GluGlu: 3.871 ± 1.513
2.037GluPhe: 2.037 ± 0.678
2.445GluGly: 2.445 ± 0.718
0.407GluHis: 0.407 ± 0.414
4.075GluIle: 4.075 ± 0.737
4.482GluLys: 4.482 ± 1.24
6.927GluLeu: 6.927 ± 0.965
0.815GluMet: 0.815 ± 0.478
2.445GluAsn: 2.445 ± 0.688
0.815GluPro: 0.815 ± 0.258
4.075GluGln: 4.075 ± 1.037
2.037GluArg: 2.037 ± 0.638
4.279GluSer: 4.279 ± 0.461
3.26GluThr: 3.26 ± 0.871
3.26GluVal: 3.26 ± 0.912
0.815GluTrp: 0.815 ± 0.306
0.815GluTyr: 0.815 ± 0.414
0.0GluXaa: 0.0 ± 0.0
Phe
2.852PheAla: 2.852 ± 0.735
1.426PheCys: 1.426 ± 0.478
1.834PheAsp: 1.834 ± 0.541
1.834PheGlu: 1.834 ± 0.681
1.834PhePhe: 1.834 ± 0.56
1.834PheGly: 1.834 ± 1.102
0.611PheHis: 0.611 ± 0.336
3.056PheIle: 3.056 ± 0.726
1.63PheLys: 1.63 ± 0.987
4.482PheLeu: 4.482 ± 1.14
1.019PheMet: 1.019 ± 0.306
1.426PheAsn: 1.426 ± 0.562
2.037PhePro: 2.037 ± 0.759
1.834PheGln: 1.834 ± 0.538
1.426PheArg: 1.426 ± 0.705
3.056PheSer: 3.056 ± 0.956
1.222PheThr: 1.222 ± 0.627
0.815PheVal: 0.815 ± 0.389
0.0PheTrp: 0.0 ± 0.0
0.815PheTyr: 0.815 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
3.667GlyAla: 3.667 ± 1.394
0.815GlyCys: 0.815 ± 0.414
2.852GlyAsp: 2.852 ± 0.949
3.26GlyGlu: 3.26 ± 0.942
1.834GlyPhe: 1.834 ± 0.438
2.649GlyGly: 2.649 ± 0.73
1.019GlyHis: 1.019 ± 0.367
4.482GlyIle: 4.482 ± 1.096
2.037GlyLys: 2.037 ± 0.707
5.297GlyLeu: 5.297 ± 0.913
1.019GlyMet: 1.019 ± 0.448
2.037GlyAsn: 2.037 ± 0.739
2.445GlyPro: 2.445 ± 1.638
2.037GlyGln: 2.037 ± 0.721
3.056GlyArg: 3.056 ± 1.085
4.279GlySer: 4.279 ± 1.015
2.037GlyThr: 2.037 ± 0.904
2.649GlyVal: 2.649 ± 0.56
0.407GlyTrp: 0.407 ± 0.409
1.222GlyTyr: 1.222 ± 0.425
0.0GlyXaa: 0.0 ± 0.0
His
1.426HisAla: 1.426 ± 0.484
0.815HisCys: 0.815 ± 0.467
1.222HisAsp: 1.222 ± 0.547
1.222HisGlu: 1.222 ± 0.408
0.407HisPhe: 0.407 ± 0.414
0.407HisGly: 0.407 ± 0.424
1.222HisHis: 1.222 ± 0.639
1.426HisIle: 1.426 ± 0.758
0.407HisLys: 0.407 ± 0.364
2.649HisLeu: 2.649 ± 0.967
0.407HisMet: 0.407 ± 0.207
0.407HisAsn: 0.407 ± 0.207
1.222HisPro: 1.222 ± 0.458
1.834HisGln: 1.834 ± 0.799
1.019HisArg: 1.019 ± 0.553
0.204HisSer: 0.204 ± 0.137
0.815HisThr: 0.815 ± 0.389
0.815HisVal: 0.815 ± 0.414
0.204HisTrp: 0.204 ± 0.137
0.204HisTyr: 0.204 ± 0.312
0.0HisXaa: 0.0 ± 0.0
Ile
3.871IleAla: 3.871 ± 1.357
1.426IleCys: 1.426 ± 0.303
2.037IleAsp: 2.037 ± 0.547
3.464IleGlu: 3.464 ± 0.95
2.037IlePhe: 2.037 ± 0.672
3.056IleGly: 3.056 ± 1.106
1.222IleHis: 1.222 ± 0.504
3.871IleIle: 3.871 ± 1.158
3.667IleLys: 3.667 ± 0.962
7.946IleLeu: 7.946 ± 1.563
2.241IleMet: 2.241 ± 0.373
5.501IleAsn: 5.501 ± 2.203
4.075IlePro: 4.075 ± 1.098
2.852IleGln: 2.852 ± 1.271
3.871IleArg: 3.871 ± 0.386
7.131IleSer: 7.131 ± 1.353
4.075IleThr: 4.075 ± 1.529
5.094IleVal: 5.094 ± 1.414
1.019IleTrp: 1.019 ± 0.463
3.056IleTyr: 3.056 ± 0.462
0.0IleXaa: 0.0 ± 0.0
Lys
1.63LysAla: 1.63 ± 0.592
1.019LysCys: 1.019 ± 0.359
1.426LysAsp: 1.426 ± 0.45
4.482LysGlu: 4.482 ± 1.481
1.426LysPhe: 1.426 ± 0.547
3.464LysGly: 3.464 ± 1.453
1.222LysHis: 1.222 ± 0.572
2.649LysIle: 2.649 ± 0.99
1.834LysLys: 1.834 ± 0.644
5.705LysLeu: 5.705 ± 1.333
0.407LysMet: 0.407 ± 0.623
2.445LysAsn: 2.445 ± 0.397
2.649LysPro: 2.649 ± 1.822
1.834LysGln: 1.834 ± 0.552
2.649LysArg: 2.649 ± 0.497
5.094LysSer: 5.094 ± 0.836
2.852LysThr: 2.852 ± 0.797
3.056LysVal: 3.056 ± 0.779
0.407LysTrp: 0.407 ± 0.29
1.834LysTyr: 1.834 ± 0.614
0.0LysXaa: 0.0 ± 0.0
Leu
5.705LeuAla: 5.705 ± 1.196
3.056LeuCys: 3.056 ± 0.924
8.761LeuAsp: 8.761 ± 2.362
4.075LeuGlu: 4.075 ± 1.315
4.279LeuPhe: 4.279 ± 1.335
4.686LeuGly: 4.686 ± 1.001
1.426LeuHis: 1.426 ± 0.625
7.742LeuIle: 7.742 ± 0.939
5.501LeuLys: 5.501 ± 1.313
8.965LeuLeu: 8.965 ± 2.589
2.241LeuMet: 2.241 ± 0.759
6.316LeuAsn: 6.316 ± 1.193
4.686LeuPro: 4.686 ± 1.161
4.686LeuGln: 4.686 ± 0.937
4.482LeuArg: 4.482 ± 1.104
13.651LeuSer: 13.651 ± 2.594
9.372LeuThr: 9.372 ± 2.512
5.094LeuVal: 5.094 ± 1.01
1.63LeuTrp: 1.63 ± 0.454
3.26LeuTyr: 3.26 ± 0.627
0.0LeuXaa: 0.0 ± 0.0
Met
2.037MetAla: 2.037 ± 0.839
0.407MetCys: 0.407 ± 0.21
0.815MetAsp: 0.815 ± 0.705
1.222MetGlu: 1.222 ± 0.365
0.815MetPhe: 0.815 ± 0.231
1.222MetGly: 1.222 ± 0.625
0.407MetHis: 0.407 ± 0.279
1.426MetIle: 1.426 ± 0.517
0.611MetLys: 0.611 ± 0.374
2.037MetLeu: 2.037 ± 0.706
1.834MetMet: 1.834 ± 1.085
1.834MetAsn: 1.834 ± 0.669
1.019MetPro: 1.019 ± 0.353
0.611MetGln: 0.611 ± 0.555
2.037MetArg: 2.037 ± 0.619
2.852MetSer: 2.852 ± 0.688
1.426MetThr: 1.426 ± 0.693
1.834MetVal: 1.834 ± 0.676
0.611MetTrp: 0.611 ± 0.336
0.407MetTyr: 0.407 ± 0.24
0.0MetXaa: 0.0 ± 0.0
Asn
2.852AsnAla: 2.852 ± 0.962
1.426AsnCys: 1.426 ± 1.048
2.445AsnAsp: 2.445 ± 0.957
1.63AsnGlu: 1.63 ± 0.425
1.426AsnPhe: 1.426 ± 0.562
3.26AsnGly: 3.26 ± 0.589
1.426AsnHis: 1.426 ± 0.646
2.852AsnIle: 2.852 ± 0.763
2.037AsnLys: 2.037 ± 0.319
7.946AsnLeu: 7.946 ± 1.747
1.834AsnMet: 1.834 ± 0.301
1.834AsnAsn: 1.834 ± 0.517
4.075AsnPro: 4.075 ± 1.125
3.26AsnGln: 3.26 ± 0.692
2.241AsnArg: 2.241 ± 0.606
4.482AsnSer: 4.482 ± 0.745
2.445AsnThr: 2.445 ± 0.559
1.63AsnVal: 1.63 ± 0.462
0.815AsnTrp: 0.815 ± 0.549
1.834AsnTyr: 1.834 ± 0.641
0.0AsnXaa: 0.0 ± 0.0
Pro
3.26ProAla: 3.26 ± 1.367
0.815ProCys: 0.815 ± 0.309
2.241ProAsp: 2.241 ± 0.541
3.26ProGlu: 3.26 ± 0.743
2.241ProPhe: 2.241 ± 1.479
3.056ProGly: 3.056 ± 1.471
0.815ProHis: 0.815 ± 0.339
3.667ProIle: 3.667 ± 1.247
3.26ProLys: 3.26 ± 1.073
6.52ProLeu: 6.52 ± 1.383
0.611ProMet: 0.611 ± 0.273
2.241ProAsn: 2.241 ± 0.525
4.279ProPro: 4.279 ± 1.416
1.834ProGln: 1.834 ± 0.653
2.037ProArg: 2.037 ± 0.586
4.89ProSer: 4.89 ± 1.761
3.464ProThr: 3.464 ± 1.053
2.445ProVal: 2.445 ± 1.056
0.407ProTrp: 0.407 ± 0.207
2.445ProTyr: 2.445 ± 0.522
0.0ProXaa: 0.0 ± 0.0
Gln
2.445GlnAla: 2.445 ± 1.297
1.019GlnCys: 1.019 ± 0.397
2.852GlnAsp: 2.852 ± 0.862
1.426GlnGlu: 1.426 ± 0.45
1.834GlnPhe: 1.834 ± 0.454
2.649GlnGly: 2.649 ± 1.223
0.407GlnHis: 0.407 ± 0.363
3.871GlnIle: 3.871 ± 1.398
1.834GlnLys: 1.834 ± 0.565
6.316GlnLeu: 6.316 ± 1.243
1.019GlnMet: 1.019 ± 0.516
3.667GlnAsn: 3.667 ± 1.015
1.834GlnPro: 1.834 ± 0.702
2.649GlnGln: 2.649 ± 0.678
1.426GlnArg: 1.426 ± 0.311
3.464GlnSer: 3.464 ± 1.455
2.852GlnThr: 2.852 ± 0.69
3.26GlnVal: 3.26 ± 0.516
0.204GlnTrp: 0.204 ± 0.269
1.63GlnTyr: 1.63 ± 0.638
0.0GlnXaa: 0.0 ± 0.0
Arg
2.445ArgAla: 2.445 ± 1.113
0.815ArgCys: 0.815 ± 0.437
2.445ArgAsp: 2.445 ± 0.748
2.241ArgGlu: 2.241 ± 0.536
1.63ArgPhe: 1.63 ± 0.614
2.649ArgGly: 2.649 ± 0.631
1.222ArgHis: 1.222 ± 0.407
3.464ArgIle: 3.464 ± 0.332
2.445ArgLys: 2.445 ± 0.929
6.724ArgLeu: 6.724 ± 1.055
0.815ArgMet: 0.815 ± 0.419
2.649ArgAsn: 2.649 ± 0.722
3.056ArgPro: 3.056 ± 1.118
2.241ArgGln: 2.241 ± 1.418
4.075ArgArg: 4.075 ± 0.868
5.094ArgSer: 5.094 ± 0.685
2.649ArgThr: 2.649 ± 0.8
2.649ArgVal: 2.649 ± 0.634
0.204ArgTrp: 0.204 ± 0.225
1.222ArgTyr: 1.222 ± 0.458
0.0ArgXaa: 0.0 ± 0.0
Ser
8.15SerAla: 8.15 ± 1.98
3.26SerCys: 3.26 ± 0.644
5.297SerAsp: 5.297 ± 0.467
4.89SerGlu: 4.89 ± 1.037
1.222SerPhe: 1.222 ± 0.367
4.279SerGly: 4.279 ± 1.049
1.222SerHis: 1.222 ± 0.425
5.094SerIle: 5.094 ± 2.0
3.26SerLys: 3.26 ± 0.853
9.576SerLeu: 9.576 ± 1.132
2.241SerMet: 2.241 ± 0.756
2.852SerAsn: 2.852 ± 0.53
5.501SerPro: 5.501 ± 1.623
3.056SerGln: 3.056 ± 0.373
4.482SerArg: 4.482 ± 1.54
9.576SerSer: 9.576 ± 2.313
5.094SerThr: 5.094 ± 1.568
8.354SerVal: 8.354 ± 1.109
0.815SerTrp: 0.815 ± 0.437
3.26SerTyr: 3.26 ± 0.797
0.0SerXaa: 0.0 ± 0.0
Thr
5.297ThrAla: 5.297 ± 1.341
1.019ThrCys: 1.019 ± 0.437
2.445ThrAsp: 2.445 ± 0.759
3.871ThrGlu: 3.871 ± 0.866
3.26ThrPhe: 3.26 ± 0.633
3.667ThrGly: 3.667 ± 0.882
1.019ThrHis: 1.019 ± 0.426
3.667ThrIle: 3.667 ± 1.199
1.834ThrLys: 1.834 ± 0.862
4.89ThrLeu: 4.89 ± 0.715
2.037ThrMet: 2.037 ± 0.548
3.871ThrAsn: 3.871 ± 1.264
3.667ThrPro: 3.667 ± 0.447
3.056ThrGln: 3.056 ± 0.869
2.649ThrArg: 2.649 ± 0.415
3.056ThrSer: 3.056 ± 1.37
4.482ThrThr: 4.482 ± 1.215
2.649ThrVal: 2.649 ± 0.808
1.019ThrTrp: 1.019 ± 0.367
2.852ThrTyr: 2.852 ± 1.161
0.0ThrXaa: 0.0 ± 0.0
Val
2.649ValAla: 2.649 ± 1.266
1.019ValCys: 1.019 ± 0.531
3.667ValAsp: 3.667 ± 0.847
2.037ValGlu: 2.037 ± 0.392
2.241ValPhe: 2.241 ± 0.463
2.649ValGly: 2.649 ± 0.763
1.222ValHis: 1.222 ± 0.72
5.297ValIle: 5.297 ± 1.006
3.056ValLys: 3.056 ± 1.114
3.871ValLeu: 3.871 ± 1.163
1.834ValMet: 1.834 ± 0.454
3.26ValAsn: 3.26 ± 0.882
2.852ValPro: 2.852 ± 0.623
2.649ValGln: 2.649 ± 0.814
3.871ValArg: 3.871 ± 1.363
3.871ValSer: 3.871 ± 0.731
4.075ValThr: 4.075 ± 0.666
4.482ValVal: 4.482 ± 1.117
0.407ValTrp: 0.407 ± 0.29
1.426ValTyr: 1.426 ± 0.626
0.0ValXaa: 0.0 ± 0.0
Trp
0.611TrpAla: 0.611 ± 0.33
0.204TrpCys: 0.204 ± 0.238
0.204TrpAsp: 0.204 ± 0.137
0.611TrpGlu: 0.611 ± 0.273
0.815TrpPhe: 0.815 ± 0.309
1.019TrpGly: 1.019 ± 0.463
0.204TrpHis: 0.204 ± 0.257
1.019TrpIle: 1.019 ± 0.411
0.611TrpLys: 0.611 ± 0.31
0.815TrpLeu: 0.815 ± 0.48
0.0TrpMet: 0.0 ± 0.0
0.815TrpAsn: 0.815 ± 0.309
1.222TrpPro: 1.222 ± 0.407
0.204TrpGln: 0.204 ± 0.137
0.407TrpArg: 0.407 ± 0.274
1.222TrpSer: 1.222 ± 0.916
0.204TrpThr: 0.204 ± 0.137
0.407TrpVal: 0.407 ± 0.21
0.0TrpTrp: 0.0 ± 0.0
0.204TrpTyr: 0.204 ± 0.137
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.852TyrAla: 2.852 ± 1.027
0.611TyrCys: 0.611 ± 0.268
0.815TyrAsp: 0.815 ± 0.309
1.222TyrGlu: 1.222 ± 0.365
1.019TyrPhe: 1.019 ± 0.865
0.815TyrGly: 0.815 ± 0.309
0.0TyrHis: 0.0 ± 0.0
2.852TyrIle: 2.852 ± 0.552
0.815TyrLys: 0.815 ± 0.231
5.094TyrLeu: 5.094 ± 1.439
0.815TyrMet: 0.815 ± 0.381
1.834TyrAsn: 1.834 ± 1.235
1.222TyrPro: 1.222 ± 0.432
1.834TyrGln: 1.834 ± 0.56
2.241TyrArg: 2.241 ± 0.548
3.056TyrSer: 3.056 ± 0.786
1.63TyrThr: 1.63 ± 0.538
1.63TyrVal: 1.63 ± 0.478
0.0TyrTrp: 0.0 ± 0.0
1.834TyrTyr: 1.834 ± 0.759
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 7 proteins (4909 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski