Amino acid dipepetide frequency for Changjiang tombus-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.898AlaAla: 14.898 ± 4.024
0.931AlaCys: 0.931 ± 0.814
4.655AlaAsp: 4.655 ± 1.444
3.724AlaGlu: 3.724 ± 1.299
0.931AlaPhe: 0.931 ± 0.65
10.242AlaGly: 10.242 ± 4.429
1.862AlaHis: 1.862 ± 1.628
6.518AlaIle: 6.518 ± 0.208
7.449AlaLys: 7.449 ± 1.176
12.104AlaLeu: 12.104 ± 2.421
1.862AlaMet: 1.862 ± 1.752
3.724AlaAsn: 3.724 ± 0.27
9.311AlaPro: 9.311 ± 1.681
1.862AlaGln: 1.862 ± 1.936
4.655AlaArg: 4.655 ± 1.7
10.242AlaSer: 10.242 ± 4.133
3.724AlaThr: 3.724 ± 2.32
3.724AlaVal: 3.724 ± 0.27
1.862AlaTrp: 1.862 ± 1.301
5.587AlaTyr: 5.587 ± 1.561
0.0AlaXaa: 0.0 ± 0.0
Cys
1.862CysAla: 1.862 ± 1.301
0.0CysCys: 0.0 ± 0.0
0.931CysAsp: 0.931 ± 0.65
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
2.793CysLeu: 2.793 ± 1.385
0.0CysMet: 0.0 ± 0.0
1.862CysAsn: 1.862 ± 1.046
0.0CysPro: 0.0 ± 0.0
0.931CysGln: 0.931 ± 0.65
1.862CysArg: 1.862 ± 1.301
0.0CysSer: 0.0 ± 0.0
0.931CysThr: 0.931 ± 0.65
0.931CysVal: 0.931 ± 0.65
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.724AspAla: 3.724 ± 1.299
0.931AspCys: 0.931 ± 0.65
0.931AspAsp: 0.931 ± 0.65
0.931AspGlu: 0.931 ± 0.968
1.862AspPhe: 1.862 ± 0.712
6.518AspGly: 6.518 ± 1.336
2.793AspHis: 2.793 ± 1.036
1.862AspIle: 1.862 ± 0.712
0.931AspLys: 0.931 ± 0.968
0.931AspLeu: 0.931 ± 0.968
3.724AspMet: 3.724 ± 1.553
0.931AspAsn: 0.931 ± 0.968
3.724AspPro: 3.724 ± 1.654
0.931AspGln: 0.931 ± 0.65
0.0AspArg: 0.0 ± 0.0
0.931AspSer: 0.931 ± 0.65
0.931AspThr: 0.931 ± 0.65
1.862AspVal: 1.862 ± 1.046
0.0AspTrp: 0.0 ± 0.0
0.931AspTyr: 0.931 ± 0.814
0.0AspXaa: 0.0 ± 0.0
Glu
3.724GluAla: 3.724 ± 1.553
0.0GluCys: 0.0 ± 0.0
1.862GluAsp: 1.862 ± 1.046
3.724GluGlu: 3.724 ± 1.527
2.793GluPhe: 2.793 ± 1.095
0.0GluGly: 0.0 ± 0.0
2.793GluHis: 2.793 ± 1.951
0.931GluIle: 0.931 ± 0.65
0.0GluLys: 0.0 ± 0.0
6.518GluLeu: 6.518 ± 2.407
3.724GluMet: 3.724 ± 1.076
0.0GluAsn: 0.0 ± 0.0
0.0GluPro: 0.0 ± 0.0
2.793GluGln: 2.793 ± 2.905
4.655GluArg: 4.655 ± 0.912
2.793GluSer: 2.793 ± 1.618
2.793GluThr: 2.793 ± 0.4
3.724GluVal: 3.724 ± 2.555
2.793GluTrp: 2.793 ± 1.844
1.862GluTyr: 1.862 ± 0.712
0.0GluXaa: 0.0 ± 0.0
Phe
1.862PheAla: 1.862 ± 0.763
0.931PheCys: 0.931 ± 0.65
0.931PheAsp: 0.931 ± 0.65
2.793PheGlu: 2.793 ± 0.4
0.0PhePhe: 0.0 ± 0.0
5.587PheGly: 5.587 ± 0.897
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
0.931PheLys: 0.931 ± 0.65
2.793PheLeu: 2.793 ± 0.4
0.0PheMet: 0.0 ± 0.0
1.862PheAsn: 1.862 ± 1.301
0.0PhePro: 0.0 ± 0.0
2.793PheGln: 2.793 ± 1.095
2.793PheArg: 2.793 ± 1.095
2.793PheSer: 2.793 ± 1.605
3.724PheThr: 3.724 ± 1.527
4.655PheVal: 4.655 ± 0.912
0.931PheTrp: 0.931 ± 0.65
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
5.587GlyAla: 5.587 ± 0.8
0.931GlyCys: 0.931 ± 0.65
8.38GlyAsp: 8.38 ± 1.201
2.793GlyGlu: 2.793 ± 1.618
2.793GlyPhe: 2.793 ± 1.951
8.38GlyGly: 8.38 ± 2.376
0.931GlyHis: 0.931 ± 0.65
3.724GlyIle: 3.724 ± 2.157
3.724GlyLys: 3.724 ± 1.425
6.518GlyLeu: 6.518 ± 1.628
0.931GlyMet: 0.931 ± 0.62
4.655GlyAsn: 4.655 ± 2.046
4.655GlyPro: 4.655 ± 0.912
2.793GlyGln: 2.793 ± 0.4
1.862GlyArg: 1.862 ± 0.712
7.449GlySer: 7.449 ± 0.541
4.655GlyThr: 4.655 ± 1.444
10.242GlyVal: 10.242 ± 3.801
2.793GlyTrp: 2.793 ± 0.4
4.655GlyTyr: 4.655 ± 2.337
0.0GlyXaa: 0.0 ± 0.0
His
1.862HisAla: 1.862 ± 0.712
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
0.931HisGlu: 0.931 ± 0.968
0.0HisPhe: 0.0 ± 0.0
1.862HisGly: 1.862 ± 1.301
0.0HisHis: 0.0 ± 0.0
1.862HisIle: 1.862 ± 0.763
0.931HisLys: 0.931 ± 0.65
1.862HisLeu: 1.862 ± 0.712
1.862HisMet: 1.862 ± 0.763
1.862HisAsn: 1.862 ± 0.763
0.931HisPro: 0.931 ± 0.65
0.931HisGln: 0.931 ± 0.814
2.793HisArg: 2.793 ± 1.036
1.862HisSer: 1.862 ± 1.301
0.0HisThr: 0.0 ± 0.0
1.862HisVal: 1.862 ± 1.301
0.0HisTrp: 0.0 ± 0.0
0.931HisTyr: 0.931 ± 0.968
0.0HisXaa: 0.0 ± 0.0
Ile
4.655IleAla: 4.655 ± 2.256
0.0IleCys: 0.0 ± 0.0
0.0IleAsp: 0.0 ± 0.0
3.724IleGlu: 3.724 ± 2.602
0.931IlePhe: 0.931 ± 0.814
3.724IleGly: 3.724 ± 0.27
0.0IleHis: 0.0 ± 0.0
1.862IleIle: 1.862 ± 1.301
1.862IleLys: 1.862 ± 0.763
0.0IleLeu: 0.0 ± 0.0
1.862IleMet: 1.862 ± 0.712
1.862IleAsn: 1.862 ± 1.301
1.862IlePro: 1.862 ± 1.046
1.862IleGln: 1.862 ± 0.763
3.724IleArg: 3.724 ± 1.425
3.724IleSer: 3.724 ± 1.425
0.931IleThr: 0.931 ± 0.65
2.793IleVal: 2.793 ± 1.385
0.931IleTrp: 0.931 ± 0.968
1.862IleTyr: 1.862 ± 0.712
0.0IleXaa: 0.0 ± 0.0
Lys
3.724LysAla: 3.724 ± 1.299
0.931LysCys: 0.931 ± 0.65
1.862LysAsp: 1.862 ± 0.763
0.0LysGlu: 0.0 ± 0.0
1.862LysPhe: 1.862 ± 0.763
3.724LysGly: 3.724 ± 1.425
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
0.0LysLys: 0.0 ± 0.0
1.862LysLeu: 1.862 ± 1.936
0.931LysMet: 0.931 ± 0.65
0.0LysAsn: 0.0 ± 0.0
4.655LysPro: 4.655 ± 0.631
0.931LysGln: 0.931 ± 0.65
1.862LysArg: 1.862 ± 0.712
1.862LysSer: 1.862 ± 1.301
3.724LysThr: 3.724 ± 1.527
1.862LysVal: 1.862 ± 1.301
1.862LysTrp: 1.862 ± 0.712
1.862LysTyr: 1.862 ± 1.046
0.0LysXaa: 0.0 ± 0.0
Leu
7.449LeuAla: 7.449 ± 1.843
0.0LeuCys: 0.0 ± 0.0
2.793LeuAsp: 2.793 ± 1.618
3.724LeuGlu: 3.724 ± 1.654
2.793LeuPhe: 2.793 ± 1.618
10.242LeuGly: 10.242 ± 2.594
1.862LeuHis: 1.862 ± 1.301
3.724LeuIle: 3.724 ± 1.654
3.724LeuLys: 3.724 ± 1.299
6.518LeuLeu: 6.518 ± 1.267
0.0LeuMet: 0.0 ± 0.0
3.724LeuAsn: 3.724 ± 2.157
1.862LeuPro: 1.862 ± 1.046
4.655LeuGln: 4.655 ± 0.912
7.449LeuArg: 7.449 ± 2.861
6.518LeuSer: 6.518 ± 2.489
3.724LeuThr: 3.724 ± 2.32
7.449LeuVal: 7.449 ± 2.062
1.862LeuTrp: 1.862 ± 1.936
0.0LeuTyr: 0.0 ± 0.0
0.0LeuXaa: 0.0 ± 0.0
Met
2.793MetAla: 2.793 ± 0.4
1.862MetCys: 1.862 ± 1.301
0.0MetAsp: 0.0 ± 0.0
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
4.655MetGly: 4.655 ± 0.912
0.0MetHis: 0.0 ± 0.0
0.931MetIle: 0.931 ± 0.968
0.931MetLys: 0.931 ± 0.65
0.0MetLeu: 0.0 ± 0.0
0.0MetMet: 0.0 ± 0.0
2.793MetAsn: 2.793 ± 0.4
0.931MetPro: 0.931 ± 0.814
0.0MetGln: 0.0 ± 0.0
4.655MetArg: 4.655 ± 0.907
1.862MetSer: 1.862 ± 1.301
0.931MetThr: 0.931 ± 0.65
2.793MetVal: 2.793 ± 1.095
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
8.38AsnAla: 8.38 ± 1.581
0.931AsnCys: 0.931 ± 0.65
0.931AsnAsp: 0.931 ± 0.65
0.0AsnGlu: 0.0 ± 0.0
1.862AsnPhe: 1.862 ± 0.763
2.793AsnGly: 2.793 ± 1.385
0.931AsnHis: 0.931 ± 0.65
0.931AsnIle: 0.931 ± 0.65
0.931AsnLys: 0.931 ± 0.65
1.862AsnLeu: 1.862 ± 1.628
0.0AsnMet: 0.0 ± 0.0
3.724AsnAsn: 3.724 ± 0.27
0.0AsnPro: 0.0 ± 0.0
0.0AsnGln: 0.0 ± 0.0
2.793AsnArg: 2.793 ± 1.605
1.862AsnSer: 1.862 ± 0.712
4.655AsnThr: 4.655 ± 1.728
2.793AsnVal: 2.793 ± 1.618
0.931AsnTrp: 0.931 ± 0.814
0.931AsnTyr: 0.931 ± 0.65
0.0AsnXaa: 0.0 ± 0.0
Pro
4.655ProAla: 4.655 ± 2.952
0.931ProCys: 0.931 ± 0.814
2.793ProAsp: 2.793 ± 1.951
0.931ProGlu: 0.931 ± 0.814
1.862ProPhe: 1.862 ± 0.763
3.724ProGly: 3.724 ± 2.32
0.931ProHis: 0.931 ± 0.65
2.793ProIle: 2.793 ± 1.844
0.931ProLys: 0.931 ± 0.968
5.587ProLeu: 5.587 ± 1.857
0.931ProMet: 0.931 ± 0.968
0.931ProAsn: 0.931 ± 0.814
2.793ProPro: 2.793 ± 1.036
0.931ProGln: 0.931 ± 0.65
6.518ProArg: 6.518 ± 1.277
8.38ProSer: 8.38 ± 5.531
5.587ProThr: 5.587 ± 3.22
4.655ProVal: 4.655 ± 2.262
0.931ProTrp: 0.931 ± 0.65
0.931ProTyr: 0.931 ± 0.65
0.0ProXaa: 0.0 ± 0.0
Gln
8.38GlnAla: 8.38 ± 1.178
0.931GlnCys: 0.931 ± 0.968
0.0GlnAsp: 0.0 ± 0.0
2.793GlnGlu: 2.793 ± 0.4
1.862GlnPhe: 1.862 ± 0.712
2.793GlnGly: 2.793 ± 0.4
0.931GlnHis: 0.931 ± 0.65
0.0GlnIle: 0.0 ± 0.0
0.0GlnLys: 0.0 ± 0.0
2.793GlnLeu: 2.793 ± 1.618
0.0GlnMet: 0.0 ± 0.0
0.0GlnAsn: 0.0 ± 0.0
2.793GlnPro: 2.793 ± 1.036
0.931GlnGln: 0.931 ± 0.65
3.724GlnArg: 3.724 ± 1.031
0.0GlnSer: 0.0 ± 0.0
1.862GlnThr: 1.862 ± 0.712
3.724GlnVal: 3.724 ± 2.555
0.931GlnTrp: 0.931 ± 0.65
0.931GlnTyr: 0.931 ± 0.65
0.0GlnXaa: 0.0 ± 0.0
Arg
6.518ArgAla: 6.518 ± 1.284
0.931ArgCys: 0.931 ± 0.814
4.655ArgAsp: 4.655 ± 0.912
4.655ArgGlu: 4.655 ± 2.144
4.655ArgPhe: 4.655 ± 0.631
4.655ArgGly: 4.655 ± 2.144
0.931ArgHis: 0.931 ± 0.968
5.587ArgIle: 5.587 ± 2.889
2.793ArgLys: 2.793 ± 1.605
5.587ArgLeu: 5.587 ± 0.897
3.724ArgMet: 3.724 ± 1.425
0.931ArgAsn: 0.931 ± 0.65
2.793ArgPro: 2.793 ± 1.618
3.724ArgGln: 3.724 ± 1.299
7.449ArgArg: 7.449 ± 2.861
6.518ArgSer: 6.518 ± 1.336
1.862ArgThr: 1.862 ± 1.936
5.587ArgVal: 5.587 ± 0.76
3.724ArgTrp: 3.724 ± 0.27
2.793ArgTyr: 2.793 ± 1.951
0.0ArgXaa: 0.0 ± 0.0
Ser
7.449SerAla: 7.449 ± 2.956
0.931SerCys: 0.931 ± 0.65
0.931SerAsp: 0.931 ± 0.814
0.931SerGlu: 0.931 ± 0.814
3.724SerPhe: 3.724 ± 0.27
7.449SerGly: 7.449 ± 2.012
3.724SerHis: 3.724 ± 0.27
3.724SerIle: 3.724 ± 1.299
1.862SerLys: 1.862 ± 0.712
4.655SerLeu: 4.655 ± 2.046
0.931SerMet: 0.931 ± 0.814
2.793SerAsn: 2.793 ± 1.605
6.518SerPro: 6.518 ± 3.078
2.793SerGln: 2.793 ± 0.4
9.311SerArg: 9.311 ± 1.261
6.518SerSer: 6.518 ± 3.425
3.724SerThr: 3.724 ± 2.091
4.655SerVal: 4.655 ± 1.728
1.862SerTrp: 1.862 ± 0.763
0.931SerTyr: 0.931 ± 0.65
0.0SerXaa: 0.0 ± 0.0
Thr
9.311ThrAla: 9.311 ± 1.261
0.931ThrCys: 0.931 ± 0.65
0.931ThrAsp: 0.931 ± 0.814
6.518ThrGlu: 6.518 ± 3.122
0.931ThrPhe: 0.931 ± 0.968
1.862ThrGly: 1.862 ± 0.712
0.931ThrHis: 0.931 ± 0.968
0.0ThrIle: 0.0 ± 0.0
0.0ThrLys: 0.0 ± 0.0
7.449ThrLeu: 7.449 ± 0.78
0.931ThrMet: 0.931 ± 0.65
0.0ThrAsn: 0.0 ± 0.0
4.655ThrPro: 4.655 ± 1.444
1.862ThrGln: 1.862 ± 0.712
4.655ThrArg: 4.655 ± 0.912
6.518ThrSer: 6.518 ± 2.618
10.242ThrThr: 10.242 ± 3.334
2.793ThrVal: 2.793 ± 1.618
0.931ThrTrp: 0.931 ± 0.968
0.931ThrTyr: 0.931 ± 0.814
0.0ThrXaa: 0.0 ± 0.0
Val
7.449ValAla: 7.449 ± 2.012
0.0ValCys: 0.0 ± 0.0
0.931ValAsp: 0.931 ± 0.65
4.655ValGlu: 4.655 ± 1.7
4.655ValPhe: 4.655 ± 0.912
6.518ValGly: 6.518 ± 1.277
3.724ValHis: 3.724 ± 1.527
2.793ValIle: 2.793 ± 1.385
4.655ValLys: 4.655 ± 2.144
4.655ValLeu: 4.655 ± 2.144
1.862ValMet: 1.862 ± 1.301
1.862ValAsn: 1.862 ± 0.712
7.449ValPro: 7.449 ± 2.956
1.862ValGln: 1.862 ± 0.712
3.724ValArg: 3.724 ± 1.299
4.655ValSer: 4.655 ± 2.046
5.587ValThr: 5.587 ± 1.431
1.862ValVal: 1.862 ± 0.763
0.931ValTrp: 0.931 ± 0.814
1.862ValTyr: 1.862 ± 0.712
0.0ValXaa: 0.0 ± 0.0
Trp
3.724TrpAla: 3.724 ± 0.27
0.0TrpCys: 0.0 ± 0.0
0.931TrpAsp: 0.931 ± 0.65
1.862TrpGlu: 1.862 ± 0.763
0.931TrpPhe: 0.931 ± 0.65
1.862TrpGly: 1.862 ± 0.763
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.793TrpLeu: 2.793 ± 1.844
0.931TrpMet: 0.931 ± 0.968
0.931TrpAsn: 0.931 ± 0.814
0.931TrpPro: 0.931 ± 0.814
0.931TrpGln: 0.931 ± 0.65
3.724TrpArg: 3.724 ± 1.299
0.931TrpSer: 0.931 ± 0.968
1.862TrpThr: 1.862 ± 0.712
1.862TrpVal: 1.862 ± 0.712
0.931TrpTrp: 0.931 ± 0.968
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.793TyrAla: 2.793 ± 1.095
0.0TyrCys: 0.0 ± 0.0
0.931TyrAsp: 0.931 ± 0.968
2.793TyrGlu: 2.793 ± 1.385
0.931TyrPhe: 0.931 ± 0.65
1.862TyrGly: 1.862 ± 0.763
0.0TyrHis: 0.0 ± 0.0
0.931TyrIle: 0.931 ± 0.65
1.862TyrLys: 1.862 ± 1.301
2.793TyrLeu: 2.793 ± 1.951
0.0TyrMet: 0.0 ± 0.0
2.793TyrAsn: 2.793 ± 1.951
1.862TyrPro: 1.862 ± 1.936
1.862TyrGln: 1.862 ± 0.763
1.862TyrArg: 1.862 ± 0.763
0.0TyrSer: 0.0 ± 0.0
0.931TyrThr: 0.931 ± 0.814
1.862TyrVal: 1.862 ± 1.628
0.931TyrTrp: 0.931 ± 0.814
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1075 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski