Amino acid dipepetide frequency for Wenzhou tombus-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.51AlaAla: 14.51 ± 1.294
1.814AlaCys: 1.814 ± 0.083
6.651AlaAsp: 6.651 ± 0.632
8.464AlaGlu: 8.464 ± 1.245
0.605AlaPhe: 0.605 ± 0.299
7.255AlaGly: 7.255 ± 3.273
0.605AlaHis: 0.605 ± 0.681
3.023AlaIle: 3.023 ± 0.515
4.837AlaLys: 4.837 ± 0.431
8.464AlaLeu: 8.464 ± 0.715
4.232AlaMet: 4.232 ± 0.985
3.628AlaAsn: 3.628 ± 0.166
3.628AlaPro: 3.628 ± 0.166
1.814AlaGln: 1.814 ± 0.897
7.86AlaArg: 7.86 ± 0.034
3.628AlaSer: 3.628 ± 0.166
6.651AlaThr: 6.651 ± 0.632
5.441AlaVal: 5.441 ± 0.25
0.605AlaTrp: 0.605 ± 0.299
4.837AlaTyr: 4.837 ± 0.431
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.605CysAsp: 0.605 ± 0.681
1.209CysGlu: 1.209 ± 0.598
0.605CysPhe: 0.605 ± 0.299
0.605CysGly: 0.605 ± 0.681
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.605CysLys: 0.605 ± 0.681
1.814CysLeu: 1.814 ± 1.063
0.605CysMet: 0.605 ± 0.681
1.209CysAsn: 1.209 ± 0.382
0.0CysPro: 0.0 ± 0.0
0.605CysGln: 0.605 ± 0.299
0.605CysArg: 0.605 ± 0.299
0.605CysSer: 0.605 ± 0.299
0.605CysThr: 0.605 ± 0.299
2.418CysVal: 2.418 ± 1.196
0.0CysTrp: 0.0 ± 0.0
0.605CysTyr: 0.605 ± 0.299
0.0CysXaa: 0.0 ± 0.0
Asp
4.837AspAla: 4.837 ± 0.431
0.605AspCys: 0.605 ± 0.299
2.418AspAsp: 2.418 ± 1.196
4.232AspGlu: 4.232 ± 0.847
1.814AspPhe: 1.814 ± 1.063
7.255AspGly: 7.255 ± 0.647
1.209AspHis: 1.209 ± 0.382
2.418AspIle: 2.418 ± 1.744
0.605AspLys: 0.605 ± 0.681
6.046AspLeu: 6.046 ± 0.931
1.209AspMet: 1.209 ± 0.598
0.605AspAsn: 0.605 ± 0.299
10.278AspPro: 10.278 ± 2.141
0.605AspGln: 0.605 ± 0.299
1.814AspArg: 1.814 ± 0.897
1.209AspSer: 1.209 ± 0.382
2.418AspThr: 2.418 ± 1.196
4.232AspVal: 4.232 ± 0.847
1.209AspTrp: 1.209 ± 0.598
1.209AspTyr: 1.209 ± 0.382
0.0AspXaa: 0.0 ± 0.0
Glu
5.441GluAla: 5.441 ± 0.25
1.209GluCys: 1.209 ± 0.382
1.814GluAsp: 1.814 ± 0.897
4.232GluGlu: 4.232 ± 1.112
0.605GluPhe: 0.605 ± 0.299
4.837GluGly: 4.837 ± 0.431
2.418GluHis: 2.418 ± 1.196
0.605GluIle: 0.605 ± 0.299
3.023GluLys: 3.023 ± 1.494
9.069GluLeu: 9.069 ± 1.544
1.209GluMet: 1.209 ± 0.598
0.605GluAsn: 0.605 ± 0.299
3.628GluPro: 3.628 ± 0.813
1.209GluGln: 1.209 ± 0.598
3.628GluArg: 3.628 ± 0.813
3.023GluSer: 3.023 ± 1.445
1.814GluThr: 1.814 ± 0.083
3.628GluVal: 3.628 ± 1.146
0.605GluTrp: 0.605 ± 0.299
1.814GluTyr: 1.814 ± 0.083
0.0GluXaa: 0.0 ± 0.0
Phe
3.023PheAla: 3.023 ± 0.515
1.209PheCys: 1.209 ± 0.598
1.209PheAsp: 1.209 ± 0.598
2.418PheGlu: 2.418 ± 0.764
2.418PhePhe: 2.418 ± 0.216
0.605PheGly: 0.605 ± 0.299
1.209PheHis: 1.209 ± 0.382
0.605PheIle: 0.605 ± 0.299
1.209PheLys: 1.209 ± 0.598
3.023PheLeu: 3.023 ± 0.465
0.0PheMet: 0.0 ± 0.0
1.209PheAsn: 1.209 ± 0.598
1.209PhePro: 1.209 ± 1.362
0.0PheGln: 0.0 ± 0.0
0.605PheArg: 0.605 ± 0.299
1.814PheSer: 1.814 ± 1.063
1.814PheThr: 1.814 ± 1.063
0.0PheVal: 0.0 ± 0.0
0.605PheTrp: 0.605 ± 0.299
0.605PheTyr: 0.605 ± 0.299
0.0PheXaa: 0.0 ± 0.0
Gly
4.837GlyAla: 4.837 ± 0.549
2.418GlyCys: 2.418 ± 1.744
4.837GlyAsp: 4.837 ± 0.549
1.814GlyGlu: 1.814 ± 0.897
1.209GlyPhe: 1.209 ± 1.362
4.837GlyGly: 4.837 ± 1.528
0.605GlyHis: 0.605 ± 0.299
1.209GlyIle: 1.209 ± 1.362
3.023GlyLys: 3.023 ± 0.515
7.86GlyLeu: 7.86 ± 0.946
3.023GlyMet: 3.023 ± 0.465
3.628GlyAsn: 3.628 ± 0.166
3.628GlyPro: 3.628 ± 0.813
3.023GlyGln: 3.023 ± 3.405
6.651GlyArg: 6.651 ± 0.348
4.837GlySer: 4.837 ± 0.549
4.232GlyThr: 4.232 ± 1.827
4.837GlyVal: 4.837 ± 0.431
0.605GlyTrp: 0.605 ± 0.681
4.232GlyTyr: 4.232 ± 0.132
0.0GlyXaa: 0.0 ± 0.0
His
2.418HisAla: 2.418 ± 0.216
0.0HisCys: 0.0 ± 0.0
1.814HisAsp: 1.814 ± 0.083
1.209HisGlu: 1.209 ± 0.598
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
1.209HisLeu: 1.209 ± 0.598
0.605HisMet: 0.605 ± 0.681
1.209HisAsn: 1.209 ± 0.382
1.814HisPro: 1.814 ± 0.897
0.0HisGln: 0.0 ± 0.0
1.814HisArg: 1.814 ± 0.897
0.605HisSer: 0.605 ± 0.299
2.418HisThr: 2.418 ± 0.216
1.814HisVal: 1.814 ± 0.083
0.0HisTrp: 0.0 ± 0.0
0.605HisTyr: 0.605 ± 0.299
0.0HisXaa: 0.0 ± 0.0
Ile
4.232IleAla: 4.232 ± 2.807
0.0IleCys: 0.0 ± 0.0
1.209IleAsp: 1.209 ± 0.382
3.628IleGlu: 3.628 ± 0.813
0.605IlePhe: 0.605 ± 0.681
3.628IleGly: 3.628 ± 1.146
1.814IleHis: 1.814 ± 0.897
0.605IleIle: 0.605 ± 0.681
0.0IleLys: 0.0 ± 0.0
1.814IleLeu: 1.814 ± 0.897
0.605IleMet: 0.605 ± 0.299
2.418IleAsn: 2.418 ± 1.196
3.628IlePro: 3.628 ± 0.166
2.418IleGln: 2.418 ± 0.764
2.418IleArg: 2.418 ± 1.196
2.418IleSer: 2.418 ± 1.744
2.418IleThr: 2.418 ± 1.196
2.418IleVal: 2.418 ± 1.744
0.0IleTrp: 0.0 ± 0.0
1.814IleTyr: 1.814 ± 0.083
0.0IleXaa: 0.0 ± 0.0
Lys
3.023LysAla: 3.023 ± 0.515
0.605LysCys: 0.605 ± 0.299
3.628LysAsp: 3.628 ± 0.166
1.814LysGlu: 1.814 ± 0.083
1.814LysPhe: 1.814 ± 0.083
1.814LysGly: 1.814 ± 1.063
0.0LysHis: 0.0 ± 0.0
0.0LysIle: 0.0 ± 0.0
1.814LysLys: 1.814 ± 0.083
7.86LysLeu: 7.86 ± 0.034
0.605LysMet: 0.605 ± 0.299
0.0LysAsn: 0.0 ± 0.0
3.628LysPro: 3.628 ± 1.793
1.814LysGln: 1.814 ± 0.083
3.628LysArg: 3.628 ± 0.813
2.418LysSer: 2.418 ± 0.216
1.209LysThr: 1.209 ± 0.382
1.814LysVal: 1.814 ± 0.083
1.209LysTrp: 1.209 ± 0.382
3.023LysTyr: 3.023 ± 0.465
0.605LysXaa: 0.605 ± 0.299
Leu
6.651LeuAla: 6.651 ± 1.328
1.209LeuCys: 1.209 ± 0.382
7.86LeuAsp: 7.86 ± 0.034
6.651LeuGlu: 6.651 ± 2.308
2.418LeuPhe: 2.418 ± 0.216
6.651LeuGly: 6.651 ± 2.308
0.605LeuHis: 0.605 ± 0.299
7.255LeuIle: 7.255 ± 1.627
4.837LeuLys: 4.837 ± 1.411
10.278LeuLeu: 10.278 ± 1.161
2.418LeuMet: 2.418 ± 0.216
3.628LeuAsn: 3.628 ± 0.813
7.86LeuPro: 7.86 ± 0.034
1.814LeuGln: 1.814 ± 0.083
10.278LeuArg: 10.278 ± 0.182
5.441LeuSer: 5.441 ± 0.25
3.023LeuThr: 3.023 ± 0.465
6.651LeuVal: 6.651 ± 0.348
0.0LeuTrp: 0.0 ± 0.0
1.814LeuTyr: 1.814 ± 0.083
0.0LeuXaa: 0.0 ± 0.0
Met
4.232MetAla: 4.232 ± 1.112
0.605MetCys: 0.605 ± 0.299
0.605MetAsp: 0.605 ± 0.299
3.023MetGlu: 3.023 ± 0.515
3.023MetPhe: 3.023 ± 0.515
1.209MetGly: 1.209 ± 0.382
0.0MetHis: 0.0 ± 0.0
1.814MetIle: 1.814 ± 0.083
2.418MetLys: 2.418 ± 0.216
2.418MetLeu: 2.418 ± 1.196
1.814MetMet: 1.814 ± 0.897
0.605MetAsn: 0.605 ± 0.681
3.023MetPro: 3.023 ± 0.515
0.605MetGln: 0.605 ± 0.299
3.023MetArg: 3.023 ± 0.515
1.814MetSer: 1.814 ± 1.063
1.209MetThr: 1.209 ± 0.598
1.209MetVal: 1.209 ± 0.382
0.0MetTrp: 0.0 ± 0.0
1.209MetTyr: 1.209 ± 0.382
0.0MetXaa: 0.0 ± 0.0
Asn
5.441AsnAla: 5.441 ± 1.23
0.605AsnCys: 0.605 ± 0.681
2.418AsnAsp: 2.418 ± 1.196
2.418AsnGlu: 2.418 ± 0.764
0.0AsnPhe: 0.0 ± 0.0
2.418AsnGly: 2.418 ± 1.744
2.418AsnHis: 2.418 ± 1.196
2.418AsnIle: 2.418 ± 0.216
0.605AsnLys: 0.605 ± 0.681
2.418AsnLeu: 2.418 ± 1.196
0.0AsnMet: 0.0 ± 0.0
1.209AsnAsn: 1.209 ± 0.382
3.628AsnPro: 3.628 ± 1.793
0.0AsnGln: 0.0 ± 0.0
3.023AsnArg: 3.023 ± 0.465
1.209AsnSer: 1.209 ± 0.382
3.628AsnThr: 3.628 ± 2.126
3.023AsnVal: 3.023 ± 0.515
0.605AsnTrp: 0.605 ± 0.299
0.605AsnTyr: 0.605 ± 0.681
0.0AsnXaa: 0.0 ± 0.0
Pro
5.441ProAla: 5.441 ± 0.25
0.0ProCys: 0.0 ± 0.0
4.232ProAsp: 4.232 ± 1.112
3.023ProGlu: 3.023 ± 0.515
2.418ProPhe: 2.418 ± 0.764
9.674ProGly: 9.674 ± 1.097
1.209ProHis: 1.209 ± 1.362
4.232ProIle: 4.232 ± 0.132
2.418ProLys: 2.418 ± 0.216
2.418ProLeu: 2.418 ± 1.196
2.418ProMet: 2.418 ± 0.216
1.814ProAsn: 1.814 ± 0.083
6.651ProPro: 6.651 ± 1.328
2.418ProGln: 2.418 ± 1.196
4.837ProArg: 4.837 ± 1.411
5.441ProSer: 5.441 ± 1.23
8.464ProThr: 8.464 ± 0.265
6.651ProVal: 6.651 ± 3.288
0.0ProTrp: 0.0 ± 0.0
0.605ProTyr: 0.605 ± 0.299
0.0ProXaa: 0.0 ± 0.0
Gln
2.418GlnAla: 2.418 ± 0.764
0.605GlnCys: 0.605 ± 0.299
0.0GlnAsp: 0.0 ± 0.0
1.209GlnGlu: 1.209 ± 0.598
0.0GlnPhe: 0.0 ± 0.0
1.209GlnGly: 1.209 ± 1.362
0.605GlnHis: 0.605 ± 0.299
0.605GlnIle: 0.605 ± 0.299
1.209GlnLys: 1.209 ± 0.382
6.651GlnLeu: 6.651 ± 0.348
0.605GlnMet: 0.605 ± 0.299
0.0GlnAsn: 0.0 ± 0.0
1.814GlnPro: 1.814 ± 1.063
1.209GlnGln: 1.209 ± 0.382
3.023GlnArg: 3.023 ± 0.465
1.209GlnSer: 1.209 ± 0.598
1.209GlnThr: 1.209 ± 0.382
1.814GlnVal: 1.814 ± 0.083
0.0GlnTrp: 0.0 ± 0.0
0.605GlnTyr: 0.605 ± 0.681
0.0GlnXaa: 0.0 ± 0.0
Arg
6.046ArgAla: 6.046 ± 0.049
1.209ArgCys: 1.209 ± 0.598
6.651ArgAsp: 6.651 ± 1.328
1.814ArgGlu: 1.814 ± 0.897
3.023ArgPhe: 3.023 ± 1.494
3.628ArgGly: 3.628 ± 1.793
1.209ArgHis: 1.209 ± 0.598
3.023ArgIle: 3.023 ± 0.515
1.209ArgLys: 1.209 ± 0.598
8.464ArgLeu: 8.464 ± 1.245
3.628ArgMet: 3.628 ± 0.521
5.441ArgAsn: 5.441 ± 0.25
2.418ArgPro: 2.418 ± 0.216
2.418ArgGln: 2.418 ± 0.764
11.487ArgArg: 11.487 ± 1.759
3.628ArgSer: 3.628 ± 0.166
4.837ArgThr: 4.837 ± 2.508
6.651ArgVal: 6.651 ± 1.328
0.0ArgTrp: 0.0 ± 0.0
3.023ArgTyr: 3.023 ± 0.515
0.0ArgXaa: 0.0 ± 0.0
Ser
6.651SerAla: 6.651 ± 3.571
0.0SerCys: 0.0 ± 0.0
2.418SerAsp: 2.418 ± 1.744
1.814SerGlu: 1.814 ± 0.897
0.605SerPhe: 0.605 ± 0.299
3.628SerGly: 3.628 ± 0.166
1.209SerHis: 1.209 ± 0.598
3.023SerIle: 3.023 ± 2.425
4.232SerLys: 4.232 ± 1.827
3.023SerLeu: 3.023 ± 1.494
3.023SerMet: 3.023 ± 0.465
1.814SerAsn: 1.814 ± 0.083
3.628SerPro: 3.628 ± 0.813
1.209SerGln: 1.209 ± 0.598
3.628SerArg: 3.628 ± 0.166
3.023SerSer: 3.023 ± 1.445
1.814SerThr: 1.814 ± 1.063
9.069SerVal: 9.069 ± 3.356
0.0SerTrp: 0.0 ± 0.0
1.209SerTyr: 1.209 ± 0.382
0.0SerXaa: 0.0 ± 0.0
Thr
6.651ThrAla: 6.651 ± 0.348
0.0ThrCys: 0.0 ± 0.0
1.209ThrAsp: 1.209 ± 0.382
0.605ThrGlu: 0.605 ± 0.681
0.605ThrPhe: 0.605 ± 0.681
7.255ThrGly: 7.255 ± 2.293
0.605ThrHis: 0.605 ± 0.299
2.418ThrIle: 2.418 ± 0.216
3.628ThrLys: 3.628 ± 1.146
3.628ThrLeu: 3.628 ± 0.813
3.023ThrMet: 3.023 ± 0.515
3.023ThrAsn: 3.023 ± 1.445
4.837ThrPro: 4.837 ± 0.549
1.814ThrGln: 1.814 ± 1.063
1.814ThrArg: 1.814 ± 1.063
5.441ThrSer: 5.441 ± 1.23
3.023ThrThr: 3.023 ± 1.445
5.441ThrVal: 5.441 ± 0.73
0.605ThrTrp: 0.605 ± 0.681
2.418ThrTyr: 2.418 ± 0.216
0.0ThrXaa: 0.0 ± 0.0
Val
6.651ValAla: 6.651 ± 2.308
0.0ValCys: 0.0 ± 0.0
3.628ValAsp: 3.628 ± 0.166
1.814ValGlu: 1.814 ± 0.083
1.814ValPhe: 1.814 ± 0.897
3.628ValGly: 3.628 ± 1.146
1.814ValHis: 1.814 ± 0.083
2.418ValIle: 2.418 ± 0.764
4.837ValLys: 4.837 ± 0.431
8.464ValLeu: 8.464 ± 0.265
2.418ValMet: 2.418 ± 1.196
3.628ValAsn: 3.628 ± 0.166
6.651ValPro: 6.651 ± 0.632
1.814ValGln: 1.814 ± 0.083
5.441ValArg: 5.441 ± 0.73
5.441ValSer: 5.441 ± 0.25
5.441ValThr: 5.441 ± 1.23
7.86ValVal: 7.86 ± 0.034
1.814ValTrp: 1.814 ± 2.043
2.418ValTyr: 2.418 ± 1.196
0.0ValXaa: 0.0 ± 0.0
Trp
0.605TrpAla: 0.605 ± 0.299
0.605TrpCys: 0.605 ± 0.299
0.605TrpAsp: 0.605 ± 0.681
1.209TrpGlu: 1.209 ± 1.362
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.605TrpIle: 0.605 ± 0.681
0.0TrpLys: 0.0 ± 0.0
0.0TrpLeu: 0.0 ± 0.0
0.0TrpMet: 0.0 ± 0.0
0.605TrpAsn: 0.605 ± 0.681
0.605TrpPro: 0.605 ± 0.299
1.209TrpGln: 1.209 ± 0.382
1.814TrpArg: 1.814 ± 0.897
0.605TrpSer: 0.605 ± 0.681
0.0TrpThr: 0.0 ± 0.0
0.605TrpVal: 0.605 ± 0.299
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.441TyrAla: 5.441 ± 0.73
0.0TyrCys: 0.0 ± 0.0
2.418TyrAsp: 2.418 ± 0.216
1.814TyrGlu: 1.814 ± 0.897
1.209TyrPhe: 1.209 ± 0.382
1.209TyrGly: 1.209 ± 0.598
0.0TyrHis: 0.0 ± 0.0
1.814TyrIle: 1.814 ± 0.083
1.814TyrLys: 1.814 ± 0.897
2.418TyrLeu: 2.418 ± 1.196
1.814TyrMet: 1.814 ± 0.083
1.814TyrAsn: 1.814 ± 1.063
1.814TyrPro: 1.814 ± 1.063
0.0TyrGln: 0.0 ± 0.0
3.023TyrArg: 3.023 ± 0.515
1.209TyrSer: 1.209 ± 1.362
1.814TyrThr: 1.814 ± 0.083
2.418TyrVal: 2.418 ± 0.216
0.605TyrTrp: 0.605 ± 0.681
1.209TyrTyr: 1.209 ± 0.382
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.605XaaTrp: 0.605 ± 0.299
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1655 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski