Amino acid dipepetide frequency for Tobacco virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.36AlaAla: 7.36 ± 1.139
0.491AlaCys: 0.491 ± 0.499
3.925AlaAsp: 3.925 ± 1.367
3.925AlaGlu: 3.925 ± 1.428
1.963AlaPhe: 1.963 ± 0.746
3.435AlaGly: 3.435 ± 0.694
0.981AlaHis: 0.981 ± 0.587
4.416AlaIle: 4.416 ± 0.863
4.416AlaLys: 4.416 ± 1.448
2.944AlaLeu: 2.944 ± 1.107
1.963AlaMet: 1.963 ± 0.978
3.435AlaAsn: 3.435 ± 0.982
5.397AlaPro: 5.397 ± 1.455
3.435AlaGln: 3.435 ± 1.139
3.435AlaArg: 3.435 ± 1.635
4.907AlaSer: 4.907 ± 2.114
6.379AlaThr: 6.379 ± 1.909
6.379AlaVal: 6.379 ± 1.833
0.491AlaTrp: 0.491 ± 0.557
3.925AlaTyr: 3.925 ± 1.059
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.491CysAsp: 0.491 ± 0.557
1.963CysGlu: 1.963 ± 0.881
1.472CysPhe: 1.472 ± 0.718
1.472CysGly: 1.472 ± 1.189
0.491CysHis: 0.491 ± 0.393
1.472CysIle: 1.472 ± 0.717
0.981CysLys: 0.981 ± 0.758
2.453CysLeu: 2.453 ± 1.108
0.0CysMet: 0.0 ± 0.0
1.472CysAsn: 1.472 ± 1.063
0.491CysPro: 0.491 ± 0.66
0.491CysGln: 0.491 ± 0.499
0.491CysArg: 0.491 ± 0.66
1.963CysSer: 1.963 ± 1.071
0.0CysThr: 0.0 ± 0.0
0.491CysVal: 0.491 ± 0.354
0.0CysTrp: 0.0 ± 0.0
0.491CysTyr: 0.491 ± 0.499
0.0CysXaa: 0.0 ± 0.0
Asp
5.888AspAla: 5.888 ± 1.558
0.981AspCys: 0.981 ± 0.851
2.453AspAsp: 2.453 ± 0.968
4.416AspGlu: 4.416 ± 0.863
1.963AspPhe: 1.963 ± 0.507
4.416AspGly: 4.416 ± 1.948
0.0AspHis: 0.0 ± 0.0
0.981AspIle: 0.981 ± 0.463
1.963AspLys: 1.963 ± 1.417
0.491AspLeu: 0.491 ± 0.393
1.472AspMet: 1.472 ± 0.68
1.472AspAsn: 1.472 ± 0.593
2.453AspPro: 2.453 ± 1.139
1.963AspGln: 1.963 ± 0.834
0.981AspArg: 0.981 ± 0.785
2.453AspSer: 2.453 ± 1.263
0.491AspThr: 0.491 ± 0.499
1.472AspVal: 1.472 ± 0.865
1.963AspTrp: 1.963 ± 1.113
1.472AspTyr: 1.472 ± 0.536
0.0AspXaa: 0.0 ± 0.0
Glu
6.379GluAla: 6.379 ± 0.792
1.472GluCys: 1.472 ± 1.189
2.944GluAsp: 2.944 ± 1.399
5.888GluGlu: 5.888 ± 1.878
2.453GluPhe: 2.453 ± 1.434
6.379GluGly: 6.379 ± 1.725
2.453GluHis: 2.453 ± 0.75
1.963GluIle: 1.963 ± 1.056
3.925GluLys: 3.925 ± 1.537
6.869GluLeu: 6.869 ± 2.338
0.491GluMet: 0.491 ± 0.354
3.925GluAsn: 3.925 ± 1.717
1.963GluPro: 1.963 ± 0.844
1.472GluGln: 1.472 ± 0.536
2.944GluArg: 2.944 ± 1.39
2.453GluSer: 2.453 ± 0.929
1.963GluThr: 1.963 ± 0.56
3.435GluVal: 3.435 ± 1.473
0.981GluTrp: 0.981 ± 0.657
3.435GluTyr: 3.435 ± 0.939
0.0GluXaa: 0.0 ± 0.0
Phe
0.491PheAla: 0.491 ± 0.598
0.981PheCys: 0.981 ± 0.536
1.472PheAsp: 1.472 ± 0.68
1.963PheGlu: 1.963 ± 0.62
0.981PhePhe: 0.981 ± 0.463
1.963PheGly: 1.963 ± 0.582
0.981PheHis: 0.981 ± 0.587
0.981PheIle: 0.981 ± 0.709
2.944PheLys: 2.944 ± 0.81
2.944PheLeu: 2.944 ± 0.847
0.0PheMet: 0.0 ± 0.0
0.981PheAsn: 0.981 ± 0.587
0.981PhePro: 0.981 ± 0.837
2.944PheGln: 2.944 ± 0.912
1.963PheArg: 1.963 ± 0.774
3.925PheSer: 3.925 ± 1.428
2.944PheThr: 2.944 ± 1.647
4.416PheVal: 4.416 ± 1.926
0.981PheTrp: 0.981 ± 0.536
0.981PheTyr: 0.981 ± 0.999
0.0PheXaa: 0.0 ± 0.0
Gly
4.907GlyAla: 4.907 ± 1.5
1.472GlyCys: 1.472 ± 0.718
1.963GlyAsp: 1.963 ± 0.62
2.453GlyGlu: 2.453 ± 0.549
3.435GlyPhe: 3.435 ± 1.538
4.416GlyGly: 4.416 ± 1.778
1.963GlyHis: 1.963 ± 1.174
0.491GlyIle: 0.491 ± 0.66
1.963GlyLys: 1.963 ± 0.73
4.907GlyLeu: 4.907 ± 2.291
0.981GlyMet: 0.981 ± 0.652
5.397GlyAsn: 5.397 ± 2.686
2.944GlyPro: 2.944 ± 1.032
1.963GlyGln: 1.963 ± 0.858
5.888GlyArg: 5.888 ± 1.306
7.851GlySer: 7.851 ± 2.438
5.397GlyThr: 5.397 ± 1.625
4.907GlyVal: 4.907 ± 1.582
0.491GlyTrp: 0.491 ± 0.499
3.435GlyTyr: 3.435 ± 1.229
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
1.472HisCys: 1.472 ± 0.756
0.981HisAsp: 0.981 ± 0.657
1.963HisGlu: 1.963 ± 0.582
0.981HisPhe: 0.981 ± 0.785
0.981HisGly: 0.981 ± 0.557
0.491HisHis: 0.491 ± 0.354
0.981HisIle: 0.981 ± 0.722
1.472HisLys: 1.472 ± 1.033
1.963HisLeu: 1.963 ± 0.693
0.0HisMet: 0.0 ± 0.489
0.491HisAsn: 0.491 ± 0.499
2.453HisPro: 2.453 ± 0.907
0.0HisGln: 0.0 ± 0.0
0.0HisArg: 0.0 ± 0.0
1.963HisSer: 1.963 ± 1.398
0.981HisThr: 0.981 ± 0.587
0.491HisVal: 0.491 ± 0.393
0.491HisTrp: 0.491 ± 0.499
0.491HisTyr: 0.491 ± 0.499
0.0HisXaa: 0.0 ± 0.0
Ile
5.397IleAla: 5.397 ± 1.448
0.0IleCys: 0.0 ± 0.0
1.472IleAsp: 1.472 ± 0.617
1.963IleGlu: 1.963 ± 0.927
1.472IlePhe: 1.472 ± 0.717
2.944IleGly: 2.944 ± 1.35
0.491IleHis: 0.491 ± 0.557
2.453IleIle: 2.453 ± 1.165
1.963IleLys: 1.963 ± 0.916
3.925IleLeu: 3.925 ± 1.705
0.981IleMet: 0.981 ± 0.536
1.472IleAsn: 1.472 ± 0.593
3.925IlePro: 3.925 ± 1.553
3.435IleGln: 3.435 ± 1.492
3.435IleArg: 3.435 ± 0.797
4.907IleSer: 4.907 ± 1.814
3.435IleThr: 3.435 ± 1.506
0.491IleVal: 0.491 ± 0.354
0.981IleTrp: 0.981 ± 0.709
1.963IleTyr: 1.963 ± 1.014
0.0IleXaa: 0.0 ± 0.0
Lys
4.907LysAla: 4.907 ± 1.971
0.491LysCys: 0.491 ± 0.557
2.944LysAsp: 2.944 ± 1.309
3.435LysGlu: 3.435 ± 0.797
1.963LysPhe: 1.963 ± 0.73
4.907LysGly: 4.907 ± 1.615
0.0LysHis: 0.0 ± 0.0
2.944LysIle: 2.944 ± 0.832
3.435LysLys: 3.435 ± 1.07
6.379LysLeu: 6.379 ± 1.238
2.944LysMet: 2.944 ± 0.817
2.453LysAsn: 2.453 ± 0.549
5.397LysPro: 5.397 ± 2.056
1.963LysGln: 1.963 ± 1.071
3.925LysArg: 3.925 ± 0.901
3.925LysSer: 3.925 ± 1.406
3.925LysThr: 3.925 ± 1.423
2.453LysVal: 2.453 ± 1.302
0.981LysTrp: 0.981 ± 0.722
1.963LysTyr: 1.963 ± 0.845
0.0LysXaa: 0.0 ± 0.0
Leu
4.416LeuAla: 4.416 ± 2.072
3.925LeuCys: 3.925 ± 1.637
2.944LeuAsp: 2.944 ± 1.477
8.342LeuGlu: 8.342 ± 2.602
3.435LeuPhe: 3.435 ± 1.529
4.416LeuGly: 4.416 ± 1.157
1.472LeuHis: 1.472 ± 1.189
3.925LeuIle: 3.925 ± 1.273
4.416LeuLys: 4.416 ± 0.589
7.851LeuLeu: 7.851 ± 2.325
0.491LeuMet: 0.491 ± 0.598
1.472LeuAsn: 1.472 ± 0.536
2.944LeuPro: 2.944 ± 1.213
4.907LeuGln: 4.907 ± 1.774
3.925LeuArg: 3.925 ± 1.643
4.907LeuSer: 4.907 ± 1.755
6.379LeuThr: 6.379 ± 1.686
5.397LeuVal: 5.397 ± 1.494
2.944LeuTrp: 2.944 ± 0.601
2.453LeuTyr: 2.453 ± 0.923
0.0LeuXaa: 0.0 ± 0.0
Met
1.963MetAla: 1.963 ± 0.507
0.491MetCys: 0.491 ± 0.354
0.491MetAsp: 0.491 ± 0.354
2.944MetGlu: 2.944 ± 0.813
0.491MetPhe: 0.491 ± 0.499
0.491MetGly: 0.491 ± 0.557
0.0MetHis: 0.0 ± 0.0
0.981MetIle: 0.981 ± 0.649
0.981MetLys: 0.981 ± 0.463
2.453MetLeu: 2.453 ± 0.968
0.0MetMet: 0.0 ± 0.0
0.491MetAsn: 0.491 ± 0.354
0.0MetPro: 0.0 ± 0.0
0.491MetGln: 0.491 ± 0.393
0.981MetArg: 0.981 ± 0.657
3.435MetSer: 3.435 ± 1.95
0.491MetThr: 0.491 ± 0.557
1.963MetVal: 1.963 ± 0.897
0.0MetTrp: 0.0 ± 0.0
0.491MetTyr: 0.491 ± 0.393
0.0MetXaa: 0.0 ± 0.0
Asn
2.944AsnAla: 2.944 ± 1.373
0.491AsnCys: 0.491 ± 0.354
0.981AsnAsp: 0.981 ± 0.785
0.0AsnGlu: 0.0 ± 0.0
1.472AsnPhe: 1.472 ± 0.68
3.435AsnGly: 3.435 ± 2.09
0.0AsnHis: 0.0 ± 0.0
0.981AsnIle: 0.981 ± 0.587
3.925AsnLys: 3.925 ± 0.921
3.435AsnLeu: 3.435 ± 1.633
0.0AsnMet: 0.0 ± 0.0
1.472AsnAsn: 1.472 ± 0.622
3.925AsnPro: 3.925 ± 1.158
1.472AsnGln: 1.472 ± 0.867
2.944AsnArg: 2.944 ± 1.109
5.397AsnSer: 5.397 ± 1.264
2.944AsnThr: 2.944 ± 1.474
2.944AsnVal: 2.944 ± 1.081
1.472AsnTrp: 1.472 ± 0.725
3.925AsnTyr: 3.925 ± 0.889
0.0AsnXaa: 0.0 ± 0.0
Pro
4.416ProAla: 4.416 ± 1.231
0.0ProCys: 0.0 ± 0.0
0.981ProAsp: 0.981 ± 0.463
4.907ProGlu: 4.907 ± 1.715
0.981ProPhe: 0.981 ± 0.683
2.944ProGly: 2.944 ± 1.004
1.963ProHis: 1.963 ± 1.145
3.435ProIle: 3.435 ± 0.647
3.925ProLys: 3.925 ± 1.197
3.435ProLeu: 3.435 ± 1.211
1.472ProMet: 1.472 ± 0.725
1.963ProAsn: 1.963 ± 1.043
7.851ProPro: 7.851 ± 3.173
1.963ProGln: 1.963 ± 1.043
5.397ProArg: 5.397 ± 2.056
8.832ProSer: 8.832 ± 1.384
4.907ProThr: 4.907 ± 1.559
2.944ProVal: 2.944 ± 0.601
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.907GlnAla: 4.907 ± 1.246
0.981GlnCys: 0.981 ± 0.758
1.472GlnAsp: 1.472 ± 1.017
2.944GlnGlu: 2.944 ± 1.221
0.491GlnPhe: 0.491 ± 0.66
2.453GlnGly: 2.453 ± 0.695
0.981GlnHis: 0.981 ± 0.709
2.944GlnIle: 2.944 ± 0.849
1.963GlnLys: 1.963 ± 0.507
2.453GlnLeu: 2.453 ± 0.549
0.0GlnMet: 0.0 ± 0.0
2.453GlnAsn: 2.453 ± 0.513
3.435GlnPro: 3.435 ± 2.088
1.472GlnGln: 1.472 ± 0.756
1.963GlnArg: 1.963 ± 1.053
3.435GlnSer: 3.435 ± 1.291
4.416GlnThr: 4.416 ± 1.157
0.981GlnVal: 0.981 ± 0.557
1.472GlnTrp: 1.472 ± 0.94
0.491GlnTyr: 0.491 ± 0.499
0.0GlnXaa: 0.0 ± 0.0
Arg
2.453ArgAla: 2.453 ± 0.84
0.981ArgCys: 0.981 ± 0.999
1.963ArgAsp: 1.963 ± 1.148
4.416ArgGlu: 4.416 ± 1.477
0.491ArgPhe: 0.491 ± 0.393
6.379ArgGly: 6.379 ± 1.993
1.963ArgHis: 1.963 ± 1.368
3.435ArgIle: 3.435 ± 0.587
3.925ArgLys: 3.925 ± 1.489
4.416ArgLeu: 4.416 ± 2.451
2.453ArgMet: 2.453 ± 1.478
1.963ArgAsn: 1.963 ± 0.682
2.453ArgPro: 2.453 ± 1.765
1.472ArgGln: 1.472 ± 0.845
7.851ArgArg: 7.851 ± 7.713
4.416ArgSer: 4.416 ± 1.077
1.963ArgThr: 1.963 ± 0.936
4.907ArgVal: 4.907 ± 1.04
0.0ArgTrp: 0.0 ± 0.0
0.981ArgTyr: 0.981 ± 1.114
0.0ArgXaa: 0.0 ± 0.0
Ser
6.869SerAla: 6.869 ± 2.889
1.472SerCys: 1.472 ± 0.94
4.907SerAsp: 4.907 ± 1.984
3.925SerGlu: 3.925 ± 1.595
4.416SerPhe: 4.416 ± 1.566
5.397SerGly: 5.397 ± 0.352
1.472SerHis: 1.472 ± 0.717
6.869SerIle: 6.869 ± 0.829
7.36SerLys: 7.36 ± 2.448
7.851SerLeu: 7.851 ± 1.224
2.944SerMet: 2.944 ± 0.925
2.944SerAsn: 2.944 ± 0.862
4.416SerPro: 4.416 ± 1.123
3.925SerGln: 3.925 ± 1.329
5.397SerArg: 5.397 ± 2.877
11.286SerSer: 11.286 ± 3.767
4.907SerThr: 4.907 ± 0.808
3.925SerVal: 3.925 ± 1.899
2.453SerTrp: 2.453 ± 1.219
2.944SerTyr: 2.944 ± 0.715
0.0SerXaa: 0.0 ± 0.0
Thr
3.925ThrAla: 3.925 ± 1.441
0.0ThrCys: 0.0 ± 0.0
3.925ThrAsp: 3.925 ± 1.747
2.944ThrGlu: 2.944 ± 0.925
2.453ThrPhe: 2.453 ± 1.232
5.397ThrGly: 5.397 ± 1.531
0.491ThrHis: 0.491 ± 0.499
4.416ThrIle: 4.416 ± 1.627
3.925ThrLys: 3.925 ± 1.402
6.869ThrLeu: 6.869 ± 1.802
0.491ThrMet: 0.491 ± 0.598
1.472ThrAsn: 1.472 ± 0.471
4.907ThrPro: 4.907 ± 0.933
1.963ThrGln: 1.963 ± 0.762
0.981ThrArg: 0.981 ± 0.709
7.36ThrSer: 7.36 ± 1.488
4.416ThrThr: 4.416 ± 1.31
3.435ThrVal: 3.435 ± 1.101
0.491ThrTrp: 0.491 ± 0.354
2.944ThrTyr: 2.944 ± 0.892
0.0ThrXaa: 0.0 ± 0.0
Val
3.435ValAla: 3.435 ± 2.005
0.491ValCys: 0.491 ± 0.499
1.963ValAsp: 1.963 ± 1.208
3.435ValGlu: 3.435 ± 1.091
3.435ValPhe: 3.435 ± 1.424
3.435ValGly: 3.435 ± 1.213
0.0ValHis: 0.0 ± 0.0
2.453ValIle: 2.453 ± 0.923
3.925ValLys: 3.925 ± 0.479
4.416ValLeu: 4.416 ± 1.946
0.981ValMet: 0.981 ± 0.851
2.944ValAsn: 2.944 ± 1.232
4.416ValPro: 4.416 ± 0.323
3.435ValGln: 3.435 ± 1.625
2.453ValArg: 2.453 ± 0.637
6.869ValSer: 6.869 ± 1.097
2.944ValThr: 2.944 ± 1.309
3.435ValVal: 3.435 ± 1.855
0.0ValTrp: 0.0 ± 0.0
1.963ValTyr: 1.963 ± 1.129
0.0ValXaa: 0.0 ± 0.0
Trp
1.472TrpAla: 1.472 ± 0.622
0.0TrpCys: 0.0 ± 0.0
0.491TrpAsp: 0.491 ± 0.393
0.491TrpGlu: 0.491 ± 0.354
0.491TrpPhe: 0.491 ± 0.354
0.981TrpGly: 0.981 ± 0.999
0.981TrpHis: 0.981 ± 0.758
0.491TrpIle: 0.491 ± 0.499
0.981TrpLys: 0.981 ± 0.649
1.963TrpLeu: 1.963 ± 1.101
0.981TrpMet: 0.981 ± 0.536
0.491TrpAsn: 0.491 ± 0.393
0.491TrpPro: 0.491 ± 0.393
0.491TrpGln: 0.491 ± 0.354
1.472TrpArg: 1.472 ± 0.717
1.963TrpSer: 1.963 ± 0.526
2.453TrpThr: 2.453 ± 0.953
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
0.491TrpTyr: 0.491 ± 0.393
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.472TyrAla: 1.472 ± 1.063
0.491TyrCys: 0.491 ± 0.354
0.981TyrAsp: 0.981 ± 0.557
1.472TyrGlu: 1.472 ± 0.708
0.981TyrPhe: 0.981 ± 0.683
0.981TyrGly: 0.981 ± 0.463
1.963TyrHis: 1.963 ± 0.879
0.491TyrIle: 0.491 ± 0.393
2.944TyrLys: 2.944 ± 1.218
2.944TyrLeu: 2.944 ± 1.234
0.491TyrMet: 0.491 ± 0.393
5.397TyrAsn: 5.397 ± 2.688
1.472TyrPro: 1.472 ± 0.718
2.453TyrGln: 2.453 ± 0.884
2.453TyrArg: 2.453 ± 0.92
3.435TyrSer: 3.435 ± 1.289
1.472TyrThr: 1.472 ± 0.865
1.963TyrVal: 1.963 ± 0.986
0.981TyrTrp: 0.981 ± 0.463
1.472TyrTyr: 1.472 ± 0.782
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (2039 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski