Amino acid dipepetide frequency for Sanxia tombus-like virus 7

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.621AlaAla: 11.621 ± 3.022
0.0AlaCys: 0.0 ± 0.0
7.554AlaAsp: 7.554 ± 1.482
4.648AlaGlu: 4.648 ± 1.84
1.743AlaPhe: 1.743 ± 0.945
11.04AlaGly: 11.04 ± 1.82
2.324AlaHis: 2.324 ± 1.272
4.648AlaIle: 4.648 ± 1.32
8.716AlaLys: 8.716 ± 2.431
7.554AlaLeu: 7.554 ± 1.509
2.905AlaMet: 2.905 ± 0.323
4.067AlaAsn: 4.067 ± 0.522
6.392AlaPro: 6.392 ± 2.283
5.23AlaGln: 5.23 ± 1.551
9.878AlaArg: 9.878 ± 0.66
5.811AlaSer: 5.811 ± 0.878
5.811AlaThr: 5.811 ± 1.58
6.973AlaVal: 6.973 ± 1.43
4.648AlaTrp: 4.648 ± 1.46
1.162AlaTyr: 1.162 ± 0.551
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.581CysAsp: 0.581 ± 0.463
1.743CysGlu: 1.743 ± 0.802
0.581CysPhe: 0.581 ± 0.463
0.581CysGly: 0.581 ± 0.549
0.581CysHis: 0.581 ± 0.426
1.162CysIle: 1.162 ± 0.498
0.0CysLys: 0.0 ± 0.0
1.162CysLeu: 1.162 ± 0.925
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
1.162CysPro: 1.162 ± 0.619
2.324CysGln: 2.324 ± 0.995
1.162CysArg: 1.162 ± 0.498
1.162CysSer: 1.162 ± 0.7
1.743CysThr: 1.743 ± 1.076
1.743CysVal: 1.743 ± 0.487
0.581CysTrp: 0.581 ± 0.426
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.648AspAla: 4.648 ± 1.188
1.743AspCys: 1.743 ± 0.861
5.23AspAsp: 5.23 ± 3.227
1.743AspGlu: 1.743 ± 0.891
0.581AspPhe: 0.581 ± 0.426
3.486AspGly: 3.486 ± 2.151
2.324AspHis: 2.324 ± 1.29
1.162AspIle: 1.162 ± 0.925
1.743AspLys: 1.743 ± 1.197
2.324AspLeu: 2.324 ± 1.239
0.0AspMet: 0.0 ± 0.0
0.581AspAsn: 0.581 ± 0.426
4.067AspPro: 4.067 ± 2.013
3.486AspGln: 3.486 ± 1.615
1.162AspArg: 1.162 ± 0.619
2.905AspSer: 2.905 ± 1.53
4.067AspThr: 4.067 ± 1.628
1.743AspVal: 1.743 ± 0.487
0.581AspTrp: 0.581 ± 0.426
0.581AspTyr: 0.581 ± 0.549
0.0AspXaa: 0.0 ± 0.0
Glu
4.648GluAla: 4.648 ± 1.5
0.0GluCys: 0.0 ± 0.0
1.743GluAsp: 1.743 ± 0.891
2.324GluGlu: 2.324 ± 0.208
1.162GluPhe: 1.162 ± 0.619
4.067GluGly: 4.067 ± 1.424
1.743GluHis: 1.743 ± 0.861
0.581GluIle: 0.581 ± 0.426
1.743GluLys: 1.743 ± 0.884
5.23GluLeu: 5.23 ± 1.039
1.743GluMet: 1.743 ± 0.861
2.905GluAsn: 2.905 ± 1.788
6.973GluPro: 6.973 ± 2.148
0.0GluGln: 0.0 ± 0.0
5.811GluArg: 5.811 ± 1.226
4.067GluSer: 4.067 ± 1.531
1.162GluThr: 1.162 ± 0.852
5.23GluVal: 5.23 ± 1.424
0.581GluTrp: 0.581 ± 0.426
1.162GluTyr: 1.162 ± 0.619
0.0GluXaa: 0.0 ± 0.0
Phe
2.324PheAla: 2.324 ± 1.29
1.162PheCys: 1.162 ± 0.498
1.743PheAsp: 1.743 ± 0.945
0.581PheGlu: 0.581 ± 0.463
0.0PhePhe: 0.0 ± 0.0
1.162PheGly: 1.162 ± 0.498
1.162PheHis: 1.162 ± 0.925
1.162PheIle: 1.162 ± 0.619
1.743PheLys: 1.743 ± 1.014
0.581PheLeu: 0.581 ± 0.426
0.0PheMet: 0.0 ± 0.0
0.581PheAsn: 0.581 ± 0.463
1.162PhePro: 1.162 ± 1.099
1.162PheGln: 1.162 ± 1.099
1.162PheArg: 1.162 ± 0.925
1.162PheSer: 1.162 ± 0.551
2.324PheThr: 2.324 ± 1.354
1.743PheVal: 1.743 ± 0.494
0.0PheTrp: 0.0 ± 0.0
2.905PheTyr: 2.905 ± 1.53
0.0PheXaa: 0.0 ± 0.0
Gly
9.297GlyAla: 9.297 ± 1.055
1.743GlyCys: 1.743 ± 0.802
1.743GlyAsp: 1.743 ± 0.884
3.486GlyGlu: 3.486 ± 0.974
0.581GlyPhe: 0.581 ± 0.426
5.23GlyGly: 5.23 ± 1.608
2.324GlyHis: 2.324 ± 0.933
2.324GlyIle: 2.324 ± 0.933
4.648GlyLys: 4.648 ± 1.393
2.324GlyLeu: 2.324 ± 1.771
2.905GlyMet: 2.905 ± 0.668
4.648GlyAsn: 4.648 ± 1.1
2.905GlyPro: 2.905 ± 0.778
2.324GlyGln: 2.324 ± 0.738
7.554GlyArg: 7.554 ± 1.27
6.392GlySer: 6.392 ± 1.041
5.811GlyThr: 5.811 ± 2.848
8.135GlyVal: 8.135 ± 0.796
0.0GlyTrp: 0.0 ± 0.0
0.581GlyTyr: 0.581 ± 0.549
0.0GlyXaa: 0.0 ± 0.0
His
1.162HisAla: 1.162 ± 0.619
0.0HisCys: 0.0 ± 0.0
0.581HisAsp: 0.581 ± 0.463
0.0HisGlu: 0.0 ± 0.0
0.581HisPhe: 0.581 ± 0.463
1.162HisGly: 1.162 ± 0.852
0.581HisHis: 0.581 ± 0.426
0.0HisIle: 0.0 ± 0.0
2.324HisLys: 2.324 ± 1.851
3.486HisLeu: 3.486 ± 1.815
0.0HisMet: 0.0 ± 0.0
0.581HisAsn: 0.581 ± 0.615
2.324HisPro: 2.324 ± 1.185
1.743HisGln: 1.743 ± 0.494
1.743HisArg: 1.743 ± 0.891
1.162HisSer: 1.162 ± 0.498
2.324HisThr: 2.324 ± 0.716
3.486HisVal: 3.486 ± 0.974
0.0HisTrp: 0.0 ± 0.0
1.162HisTyr: 1.162 ± 0.619
0.0HisXaa: 0.0 ± 0.0
Ile
3.486IleAla: 3.486 ± 1.575
1.162IleCys: 1.162 ± 0.7
1.743IleAsp: 1.743 ± 0.861
1.162IleGlu: 1.162 ± 0.925
1.162IlePhe: 1.162 ± 0.619
1.162IleGly: 1.162 ± 0.551
0.581IleHis: 0.581 ± 0.463
0.0IleIle: 0.0 ± 0.0
3.486IleLys: 3.486 ± 1.524
2.324IleLeu: 2.324 ± 1.06
0.581IleMet: 0.581 ± 0.549
0.581IleAsn: 0.581 ± 0.549
0.581IlePro: 0.581 ± 0.426
1.743IleGln: 1.743 ± 0.587
4.067IleArg: 4.067 ± 1.947
1.162IleSer: 1.162 ± 0.925
2.324IleThr: 2.324 ± 0.738
2.324IleVal: 2.324 ± 0.738
0.581IleTrp: 0.581 ± 0.549
1.743IleTyr: 1.743 ± 0.861
0.0IleXaa: 0.0 ± 0.0
Lys
6.392LysAla: 6.392 ± 1.424
1.162LysCys: 1.162 ± 0.498
0.581LysAsp: 0.581 ± 0.549
3.486LysGlu: 3.486 ± 1.03
2.324LysPhe: 2.324 ± 0.793
4.067LysGly: 4.067 ± 0.924
0.581LysHis: 0.581 ± 0.615
1.743LysIle: 1.743 ± 1.176
2.324LysLys: 2.324 ± 0.999
5.23LysLeu: 5.23 ± 2.197
0.581LysMet: 0.581 ± 0.537
2.324LysAsn: 2.324 ± 0.738
2.324LysPro: 2.324 ± 1.275
1.743LysGln: 1.743 ± 1.176
5.23LysArg: 5.23 ± 1.995
2.905LysSer: 2.905 ± 0.323
2.324LysThr: 2.324 ± 1.21
2.324LysVal: 2.324 ± 0.999
0.581LysTrp: 0.581 ± 0.615
2.905LysTyr: 2.905 ± 0.423
0.0LysXaa: 0.0 ± 0.0
Leu
6.392LeuAla: 6.392 ± 1.454
0.581LeuCys: 0.581 ± 0.463
5.23LeuAsp: 5.23 ± 1.085
3.486LeuGlu: 3.486 ± 1.159
3.486LeuPhe: 3.486 ± 1.615
4.067LeuGly: 4.067 ± 1.294
1.743LeuHis: 1.743 ± 1.197
3.486LeuIle: 3.486 ± 1.238
1.743LeuLys: 1.743 ± 0.861
4.648LeuLeu: 4.648 ± 0.68
2.905LeuMet: 2.905 ± 2.314
4.067LeuAsn: 4.067 ± 2.133
4.648LeuPro: 4.648 ± 2.196
1.743LeuGln: 1.743 ± 1.197
8.135LeuArg: 8.135 ± 2.129
3.486LeuSer: 3.486 ± 1.527
1.743LeuThr: 1.743 ± 0.587
6.973LeuVal: 6.973 ± 1.841
0.581LeuTrp: 0.581 ± 0.463
1.743LeuTyr: 1.743 ± 1.014
0.0LeuXaa: 0.0 ± 0.0
Met
2.324MetAla: 2.324 ± 0.716
0.581MetCys: 0.581 ± 0.463
0.581MetAsp: 0.581 ± 0.463
1.743MetGlu: 1.743 ± 1.388
0.0MetPhe: 0.0 ± 0.0
1.162MetGly: 1.162 ± 0.605
0.581MetHis: 0.581 ± 0.463
0.0MetIle: 0.0 ± 0.0
0.581MetLys: 0.581 ± 0.615
1.162MetLeu: 1.162 ± 0.498
0.581MetMet: 0.581 ± 0.463
0.581MetAsn: 0.581 ± 0.615
2.324MetPro: 2.324 ± 1.102
0.581MetGln: 0.581 ± 0.549
1.162MetArg: 1.162 ± 0.925
2.905MetSer: 2.905 ± 1.099
2.324MetThr: 2.324 ± 1.536
1.743MetVal: 1.743 ± 1.014
0.581MetTrp: 0.581 ± 0.463
0.581MetTyr: 0.581 ± 0.426
0.0MetXaa: 0.0 ± 0.0
Asn
8.716AsnAla: 8.716 ± 3.069
0.581AsnCys: 0.581 ± 0.463
0.0AsnAsp: 0.0 ± 0.0
0.581AsnGlu: 0.581 ± 0.463
0.581AsnPhe: 0.581 ± 0.463
5.811AsnGly: 5.811 ± 1.321
0.581AsnHis: 0.581 ± 0.426
0.0AsnIle: 0.0 ± 0.0
4.067AsnLys: 4.067 ± 2.792
3.486AsnLeu: 3.486 ± 1.524
0.581AsnMet: 0.581 ± 0.463
1.162AsnAsn: 1.162 ± 0.636
4.067AsnPro: 4.067 ± 0.883
0.0AsnGln: 0.0 ± 0.0
2.905AsnArg: 2.905 ± 0.931
2.324AsnSer: 2.324 ± 0.208
1.162AsnThr: 1.162 ± 0.551
1.162AsnVal: 1.162 ± 0.551
2.324AsnTrp: 2.324 ± 0.774
1.743AsnTyr: 1.743 ± 0.861
0.0AsnXaa: 0.0 ± 0.0
Pro
7.554ProAla: 7.554 ± 1.248
0.0ProCys: 0.0 ± 0.0
2.324ProAsp: 2.324 ± 1.592
5.811ProGlu: 5.811 ± 1.968
0.0ProPhe: 0.0 ± 0.0
4.648ProGly: 4.648 ± 2.364
0.581ProHis: 0.581 ± 0.426
3.486ProIle: 3.486 ± 1.527
2.905ProLys: 2.905 ± 2.369
5.23ProLeu: 5.23 ± 2.197
2.324ProMet: 2.324 ± 0.793
1.743ProAsn: 1.743 ± 0.579
2.905ProPro: 2.905 ± 0.931
1.162ProGln: 1.162 ± 0.852
5.23ProArg: 5.23 ± 1.905
3.486ProSer: 3.486 ± 0.67
4.067ProThr: 4.067 ± 1.45
6.392ProVal: 6.392 ± 2.026
1.162ProTrp: 1.162 ± 0.551
1.162ProTyr: 1.162 ± 0.551
0.0ProXaa: 0.0 ± 0.0
Gln
9.878GlnAla: 9.878 ± 2.172
0.581GlnCys: 0.581 ± 0.426
1.162GlnAsp: 1.162 ± 1.099
1.743GlnGlu: 1.743 ± 1.278
0.581GlnPhe: 0.581 ± 0.549
2.324GlnGly: 2.324 ± 1.766
1.162GlnHis: 1.162 ± 0.925
1.162GlnIle: 1.162 ± 0.619
0.581GlnLys: 0.581 ± 0.463
1.743GlnLeu: 1.743 ± 1.076
0.0GlnMet: 0.0 ± 0.356
1.743GlnAsn: 1.743 ± 0.579
1.743GlnPro: 1.743 ± 0.587
4.648GlnGln: 4.648 ± 2.253
4.648GlnArg: 4.648 ± 2.165
1.743GlnSer: 1.743 ± 0.802
2.905GlnThr: 2.905 ± 1.488
2.905GlnVal: 2.905 ± 0.423
0.581GlnTrp: 0.581 ± 0.549
0.581GlnTyr: 0.581 ± 0.549
0.0GlnXaa: 0.0 ± 0.0
Arg
9.297ArgAla: 9.297 ± 1.254
2.324ArgCys: 2.324 ± 0.911
2.324ArgAsp: 2.324 ± 0.673
6.392ArgGlu: 6.392 ± 1.481
1.743ArgPhe: 1.743 ± 0.945
11.04ArgGly: 11.04 ± 0.894
1.162ArgHis: 1.162 ± 0.925
2.324ArgIle: 2.324 ± 0.793
4.067ArgLys: 4.067 ± 1.294
4.648ArgLeu: 4.648 ± 1.393
1.743ArgMet: 1.743 ± 0.861
4.067ArgAsn: 4.067 ± 1.543
3.486ArgPro: 3.486 ± 1.175
5.23ArgGln: 5.23 ± 0.337
11.621ArgArg: 11.621 ± 2.352
5.811ArgSer: 5.811 ± 2.783
4.067ArgThr: 4.067 ± 1.453
4.067ArgVal: 4.067 ± 0.756
1.743ArgTrp: 1.743 ± 0.487
2.905ArgTyr: 2.905 ± 1.328
0.0ArgXaa: 0.0 ± 0.0
Ser
8.716SerAla: 8.716 ± 2.104
1.743SerCys: 1.743 ± 0.487
1.162SerAsp: 1.162 ± 0.619
1.743SerGlu: 1.743 ± 1.278
1.162SerPhe: 1.162 ± 0.551
5.23SerGly: 5.23 ± 1.402
1.743SerHis: 1.743 ± 0.884
1.743SerIle: 1.743 ± 1.098
3.486SerLys: 3.486 ± 0.988
2.905SerLeu: 2.905 ± 1.53
0.581SerMet: 0.581 ± 0.615
3.486SerAsn: 3.486 ± 1.03
4.648SerPro: 4.648 ± 1.32
2.905SerGln: 2.905 ± 1.488
5.23SerArg: 5.23 ± 1.482
6.973SerSer: 6.973 ± 1.802
2.905SerThr: 2.905 ± 0.931
5.23SerVal: 5.23 ± 1.041
0.0SerTrp: 0.0 ± 0.0
1.743SerTyr: 1.743 ± 0.587
0.0SerXaa: 0.0 ± 0.0
Thr
5.811ThrAla: 5.811 ± 1.336
1.162ThrCys: 1.162 ± 0.925
4.067ThrAsp: 4.067 ± 2.497
2.905ThrGlu: 2.905 ± 0.423
2.324ThrPhe: 2.324 ± 1.354
4.648ThrGly: 4.648 ± 0.416
1.162ThrHis: 1.162 ± 0.852
1.162ThrIle: 1.162 ± 0.636
2.905ThrLys: 2.905 ± 0.778
4.648ThrLeu: 4.648 ± 2.078
1.743ThrMet: 1.743 ± 0.508
3.486ThrAsn: 3.486 ± 1.976
2.324ThrPro: 2.324 ± 1.181
1.743ThrGln: 1.743 ± 0.579
3.486ThrArg: 3.486 ± 1.125
2.905ThrSer: 2.905 ± 1.327
2.905ThrThr: 2.905 ± 1.085
6.392ThrVal: 6.392 ± 0.965
0.0ThrTrp: 0.0 ± 0.0
1.162ThrTyr: 1.162 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
8.716ValAla: 8.716 ± 2.857
1.162ValCys: 1.162 ± 0.551
4.067ValAsp: 4.067 ± 1.675
5.23ValGlu: 5.23 ± 0.859
4.648ValPhe: 4.648 ± 1.38
2.905ValGly: 2.905 ± 0.931
1.743ValHis: 1.743 ± 0.802
4.067ValIle: 4.067 ± 1.947
2.905ValLys: 2.905 ± 1.328
8.716ValLeu: 8.716 ± 0.798
0.581ValMet: 0.581 ± 0.426
2.324ValAsn: 2.324 ± 1.275
5.23ValPro: 5.23 ± 0.48
3.486ValGln: 3.486 ± 0.904
5.23ValArg: 5.23 ± 2.685
4.067ValSer: 4.067 ± 0.737
3.486ValThr: 3.486 ± 0.715
4.648ValVal: 4.648 ± 1.38
0.581ValTrp: 0.581 ± 0.549
2.324ValTyr: 2.324 ± 0.738
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
1.162TrpAsp: 1.162 ± 1.099
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.581TrpHis: 0.581 ± 0.549
0.0TrpIle: 0.0 ± 0.0
0.581TrpLys: 0.581 ± 0.463
1.162TrpLeu: 1.162 ± 0.636
0.581TrpMet: 0.581 ± 0.463
1.743TrpAsn: 1.743 ± 0.861
1.162TrpPro: 1.162 ± 0.551
0.581TrpGln: 0.581 ± 0.615
2.905TrpArg: 2.905 ± 0.817
2.324TrpSer: 2.324 ± 1.185
1.162TrpThr: 1.162 ± 0.498
1.162TrpVal: 1.162 ± 0.551
0.581TrpTrp: 0.581 ± 0.426
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.743TyrAla: 1.743 ± 1.014
0.581TyrCys: 0.581 ± 0.549
0.581TyrAsp: 0.581 ± 0.426
4.067TyrGlu: 4.067 ± 0.606
0.581TyrPhe: 0.581 ± 0.463
1.162TyrGly: 1.162 ± 0.619
1.162TyrHis: 1.162 ± 0.925
1.743TyrIle: 1.743 ± 1.076
0.581TyrLys: 0.581 ± 0.426
2.324TyrLeu: 2.324 ± 1.275
1.162TyrMet: 1.162 ± 0.959
1.162TyrAsn: 1.162 ± 0.636
1.743TyrPro: 1.743 ± 0.802
1.162TyrGln: 1.162 ± 0.551
1.743TyrArg: 1.743 ± 0.945
0.581TyrSer: 0.581 ± 0.549
2.324TyrThr: 2.324 ± 0.911
1.743TyrVal: 1.743 ± 0.487
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (1722 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski