Amino acid dipepetide frequency for Wenling zhaovirus-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.191AlaAla: 4.191 ± 0.906
0.524AlaCys: 0.524 ± 0.52
1.048AlaAsp: 1.048 ± 0.227
1.048AlaGlu: 1.048 ± 0.227
3.143AlaPhe: 3.143 ± 0.133
1.572AlaGly: 1.572 ± 0.746
0.524AlaHis: 0.524 ± 0.293
2.619AlaIle: 2.619 ± 0.653
3.143AlaLys: 3.143 ± 2.305
1.572AlaLeu: 1.572 ± 0.88
0.524AlaMet: 0.524 ± 0.52
4.715AlaAsn: 4.715 ± 1.426
3.143AlaPro: 3.143 ± 1.493
2.095AlaGln: 2.095 ± 0.36
2.095AlaArg: 2.095 ± 0.36
3.667AlaSer: 3.667 ± 0.386
3.143AlaThr: 3.143 ± 0.68
1.572AlaVal: 1.572 ± 0.746
1.048AlaTrp: 1.048 ± 1.039
3.143AlaTyr: 3.143 ± 0.946
0.0AlaXaa: 0.0 ± 0.0
Cys
1.048CysAla: 1.048 ± 0.586
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.572CysGlu: 1.572 ± 0.746
0.524CysPhe: 0.524 ± 0.52
1.048CysGly: 1.048 ± 0.586
0.0CysHis: 0.0 ± 0.0
2.095CysIle: 2.095 ± 0.36
0.524CysLys: 0.524 ± 0.293
1.572CysLeu: 1.572 ± 0.746
1.048CysMet: 1.048 ± 0.227
3.143CysAsn: 3.143 ± 0.946
0.524CysPro: 0.524 ± 0.293
1.048CysGln: 1.048 ± 0.586
0.0CysArg: 0.0 ± 0.0
1.572CysSer: 1.572 ± 0.067
0.524CysThr: 0.524 ± 0.293
0.524CysVal: 0.524 ± 0.293
0.0CysTrp: 0.0 ± 0.0
0.524CysTyr: 0.524 ± 0.293
0.0CysXaa: 0.0 ± 0.0
Asp
2.619AspAla: 2.619 ± 0.653
0.0AspCys: 0.0 ± 0.0
2.619AspAsp: 2.619 ± 0.16
2.619AspGlu: 2.619 ± 0.973
4.191AspPhe: 4.191 ± 1.533
2.619AspGly: 2.619 ± 0.16
0.0AspHis: 0.0 ± 0.0
8.381AspIle: 8.381 ± 3.878
4.191AspLys: 4.191 ± 1.719
3.667AspLeu: 3.667 ± 1.199
1.572AspMet: 1.572 ± 0.067
2.619AspAsn: 2.619 ± 0.653
3.143AspPro: 3.143 ± 0.68
3.143AspGln: 3.143 ± 0.946
1.048AspArg: 1.048 ± 0.227
3.667AspSer: 3.667 ± 2.012
3.667AspThr: 3.667 ± 0.426
2.619AspVal: 2.619 ± 1.466
0.524AspTrp: 0.524 ± 0.293
5.762AspTyr: 5.762 ± 0.786
0.0AspXaa: 0.0 ± 0.0
Glu
1.048GluAla: 1.048 ± 1.039
1.572GluCys: 1.572 ± 0.067
1.048GluAsp: 1.048 ± 0.586
1.048GluGlu: 1.048 ± 0.586
2.619GluPhe: 2.619 ± 0.653
1.572GluGly: 1.572 ± 0.746
0.524GluHis: 0.524 ± 0.293
3.143GluIle: 3.143 ± 0.68
3.667GluLys: 3.667 ± 0.426
2.619GluLeu: 2.619 ± 0.973
0.524GluMet: 0.524 ± 0.52
1.572GluAsn: 1.572 ± 0.88
3.667GluPro: 3.667 ± 1.199
2.619GluGln: 2.619 ± 0.973
1.048GluArg: 1.048 ± 1.039
3.143GluSer: 3.143 ± 1.493
3.143GluThr: 3.143 ± 0.68
2.619GluVal: 2.619 ± 0.16
0.0GluTrp: 0.0 ± 0.0
1.048GluTyr: 1.048 ± 0.227
0.0GluXaa: 0.0 ± 0.0
Phe
2.619PheAla: 2.619 ± 1.786
1.572PheCys: 1.572 ± 0.746
3.667PheAsp: 3.667 ± 1.239
1.048PheGlu: 1.048 ± 0.586
0.0PhePhe: 0.0 ± 0.0
1.048PheGly: 1.048 ± 0.227
0.0PheHis: 0.0 ± 0.0
1.572PheIle: 1.572 ± 0.88
6.286PheLys: 6.286 ± 1.079
3.143PheLeu: 3.143 ± 1.759
0.0PheMet: 0.0 ± 0.0
3.143PheAsn: 3.143 ± 0.133
1.048PhePro: 1.048 ± 0.586
1.572PheGln: 1.572 ± 0.067
2.619PheArg: 2.619 ± 0.653
3.143PheSer: 3.143 ± 0.68
3.667PheThr: 3.667 ± 0.386
1.048PheVal: 1.048 ± 0.227
0.524PheTrp: 0.524 ± 0.52
3.143PheTyr: 3.143 ± 0.946
0.0PheXaa: 0.0 ± 0.0
Gly
0.524GlyAla: 0.524 ± 0.52
2.095GlyCys: 2.095 ± 1.173
4.191GlyAsp: 4.191 ± 1.719
2.619GlyGlu: 2.619 ± 1.786
2.095GlyPhe: 2.095 ± 1.266
2.619GlyGly: 2.619 ± 0.973
0.524GlyHis: 0.524 ± 0.293
2.619GlyIle: 2.619 ± 1.466
4.715GlyLys: 4.715 ± 1.426
4.715GlyLeu: 4.715 ± 2.239
0.524GlyMet: 0.524 ± 0.52
2.619GlyAsn: 2.619 ± 0.973
0.524GlyPro: 0.524 ± 0.52
4.715GlyGln: 4.715 ± 0.2
1.572GlyArg: 1.572 ± 0.746
4.715GlySer: 4.715 ± 1.013
2.619GlyThr: 2.619 ± 1.786
1.048GlyVal: 1.048 ± 0.227
0.0GlyTrp: 0.0 ± 0.0
1.572GlyTyr: 1.572 ± 0.067
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.524HisAsp: 0.524 ± 0.293
1.048HisGlu: 1.048 ± 0.227
0.524HisPhe: 0.524 ± 0.293
1.048HisGly: 1.048 ± 0.227
0.524HisHis: 0.524 ± 0.293
1.572HisIle: 1.572 ± 0.067
0.524HisLys: 0.524 ± 0.52
1.572HisLeu: 1.572 ± 0.067
0.0HisMet: 0.0 ± 0.0
0.524HisAsn: 0.524 ± 0.293
1.048HisPro: 1.048 ± 0.586
1.048HisGln: 1.048 ± 0.227
0.524HisArg: 0.524 ± 0.293
0.0HisSer: 0.0 ± 0.0
1.048HisThr: 1.048 ± 0.586
1.048HisVal: 1.048 ± 0.586
0.0HisTrp: 0.0 ± 0.0
1.572HisTyr: 1.572 ± 0.067
0.0HisXaa: 0.0 ± 0.0
Ile
6.286IleAla: 6.286 ± 1.359
1.048IleCys: 1.048 ± 0.586
7.334IleAsp: 7.334 ± 1.666
3.143IleGlu: 3.143 ± 0.133
1.572IlePhe: 1.572 ± 0.88
2.095IleGly: 2.095 ± 0.36
2.095IleHis: 2.095 ± 0.36
5.762IleIle: 5.762 ± 1.599
5.238IleLys: 5.238 ± 0.493
6.81IleLeu: 6.81 ± 0.253
1.048IleMet: 1.048 ± 0.227
3.143IleAsn: 3.143 ± 0.133
4.191IlePro: 4.191 ± 0.093
4.191IleGln: 4.191 ± 0.72
2.619IleArg: 2.619 ± 1.466
2.619IleSer: 2.619 ± 0.653
4.715IleThr: 4.715 ± 0.2
3.667IleVal: 3.667 ± 2.052
0.524IleTrp: 0.524 ± 0.293
2.095IleTyr: 2.095 ± 0.36
0.0IleXaa: 0.0 ± 0.0
Lys
2.095LysAla: 2.095 ± 0.453
1.048LysCys: 1.048 ± 0.586
6.286LysAsp: 6.286 ± 1.079
3.143LysGlu: 3.143 ± 1.493
3.667LysPhe: 3.667 ± 2.052
4.715LysGly: 4.715 ± 0.2
3.143LysHis: 3.143 ± 0.133
5.238LysIle: 5.238 ± 2.119
7.334LysLys: 7.334 ± 0.04
8.381LysLeu: 8.381 ± 2.625
2.095LysMet: 2.095 ± 0.36
5.762LysAsn: 5.762 ± 0.84
3.667LysPro: 3.667 ± 0.426
9.953LysGln: 9.953 ± 0.933
4.191LysArg: 4.191 ± 1.719
6.81LysSer: 6.81 ± 1.066
6.81LysThr: 6.81 ± 1.879
2.619LysVal: 2.619 ± 0.653
0.524LysTrp: 0.524 ± 0.293
2.619LysTyr: 2.619 ± 1.466
0.0LysXaa: 0.0 ± 0.0
Leu
2.619LeuAla: 2.619 ± 0.973
1.048LeuCys: 1.048 ± 0.227
2.619LeuAsp: 2.619 ± 1.786
3.143LeuGlu: 3.143 ± 0.68
3.667LeuPhe: 3.667 ± 0.386
2.619LeuGly: 2.619 ± 1.466
0.524LeuHis: 0.524 ± 0.52
3.143LeuIle: 3.143 ± 1.493
8.381LeuLys: 8.381 ± 1.439
6.286LeuLeu: 6.286 ± 2.172
0.524LeuMet: 0.524 ± 0.293
4.191LeuAsn: 4.191 ± 1.533
5.238LeuPro: 5.238 ± 0.493
10.477LeuGln: 10.477 ± 1.452
2.619LeuArg: 2.619 ± 1.466
8.905LeuSer: 8.905 ± 0.706
4.191LeuThr: 4.191 ± 0.093
4.715LeuVal: 4.715 ± 3.052
1.048LeuTrp: 1.048 ± 0.586
4.191LeuTyr: 4.191 ± 1.533
0.0LeuXaa: 0.0 ± 0.0
Met
1.048MetAla: 1.048 ± 0.586
1.048MetCys: 1.048 ± 0.227
1.572MetAsp: 1.572 ± 0.067
0.524MetGlu: 0.524 ± 0.293
0.524MetPhe: 0.524 ± 0.293
0.0MetGly: 0.0 ± 0.0
0.524MetHis: 0.524 ± 0.293
1.048MetIle: 1.048 ± 0.227
2.095MetLys: 2.095 ± 1.173
1.572MetLeu: 1.572 ± 1.559
0.524MetMet: 0.524 ± 0.293
2.619MetAsn: 2.619 ± 1.786
2.095MetPro: 2.095 ± 0.453
1.048MetGln: 1.048 ± 1.039
0.524MetArg: 0.524 ± 0.293
1.572MetSer: 1.572 ± 0.88
1.048MetThr: 1.048 ± 0.227
2.095MetVal: 2.095 ± 0.453
0.524MetTrp: 0.524 ± 0.293
0.524MetTyr: 0.524 ± 0.52
0.0MetXaa: 0.0 ± 0.0
Asn
3.667AsnAla: 3.667 ± 1.199
1.572AsnCys: 1.572 ± 0.067
2.095AsnAsp: 2.095 ± 0.453
2.095AsnGlu: 2.095 ± 0.36
2.095AsnPhe: 2.095 ± 0.453
4.191AsnGly: 4.191 ± 1.719
0.0AsnHis: 0.0 ± 0.0
5.762AsnIle: 5.762 ± 0.027
5.238AsnLys: 5.238 ± 0.32
6.286AsnLeu: 6.286 ± 0.546
0.0AsnMet: 0.0 ± 0.0
4.715AsnAsn: 4.715 ± 0.2
2.619AsnPro: 2.619 ± 1.466
5.238AsnGln: 5.238 ± 2.932
5.238AsnArg: 5.238 ± 0.493
5.238AsnSer: 5.238 ± 0.493
5.238AsnThr: 5.238 ± 0.493
2.619AsnVal: 2.619 ± 0.973
0.0AsnTrp: 0.0 ± 0.0
2.095AsnTyr: 2.095 ± 0.453
0.0AsnXaa: 0.0 ± 0.0
Pro
3.667ProAla: 3.667 ± 0.426
1.048ProCys: 1.048 ± 0.227
2.619ProAsp: 2.619 ± 0.653
4.191ProGlu: 4.191 ± 0.093
1.572ProPhe: 1.572 ± 0.067
2.095ProGly: 2.095 ± 0.453
0.524ProHis: 0.524 ± 0.52
3.667ProIle: 3.667 ± 0.426
5.238ProLys: 5.238 ± 1.133
3.143ProLeu: 3.143 ± 0.946
1.048ProMet: 1.048 ± 1.039
3.143ProAsn: 3.143 ± 0.68
2.619ProPro: 2.619 ± 0.653
1.572ProGln: 1.572 ± 0.067
3.667ProArg: 3.667 ± 2.052
2.095ProSer: 2.095 ± 0.453
1.572ProThr: 1.572 ± 0.067
1.048ProVal: 1.048 ± 0.227
1.048ProTrp: 1.048 ± 0.586
1.048ProTyr: 1.048 ± 0.227
0.0ProXaa: 0.0 ± 0.0
Gln
3.667GlnAla: 3.667 ± 0.426
1.048GlnCys: 1.048 ± 0.586
3.667GlnAsp: 3.667 ± 0.426
3.143GlnGlu: 3.143 ± 0.133
3.667GlnPhe: 3.667 ± 1.199
1.572GlnGly: 1.572 ± 0.746
0.0GlnHis: 0.0 ± 0.0
5.238GlnIle: 5.238 ± 1.306
7.858GlnLys: 7.858 ± 0.333
7.334GlnLeu: 7.334 ± 0.853
4.191GlnMet: 4.191 ± 0.72
6.286GlnAsn: 6.286 ± 0.267
2.619GlnPro: 2.619 ± 0.16
7.334GlnGln: 7.334 ± 0.853
3.143GlnArg: 3.143 ± 0.68
4.191GlnSer: 4.191 ± 0.72
4.191GlnThr: 4.191 ± 2.532
5.762GlnVal: 5.762 ± 1.599
1.048GlnTrp: 1.048 ± 0.586
4.715GlnTyr: 4.715 ± 1.826
0.0GlnXaa: 0.0 ± 0.0
Arg
1.048ArgAla: 1.048 ± 0.586
0.0ArgCys: 0.0 ± 0.0
3.667ArgAsp: 3.667 ± 1.239
0.524ArgGlu: 0.524 ± 0.52
2.095ArgPhe: 2.095 ± 1.173
2.619ArgGly: 2.619 ± 0.16
1.572ArgHis: 1.572 ± 0.88
1.572ArgIle: 1.572 ± 0.88
2.619ArgLys: 2.619 ± 1.466
3.143ArgLeu: 3.143 ± 0.946
1.048ArgMet: 1.048 ± 0.18
4.191ArgAsn: 4.191 ± 0.093
1.048ArgPro: 1.048 ± 0.227
5.238ArgGln: 5.238 ± 0.493
2.095ArgArg: 2.095 ± 1.173
5.238ArgSer: 5.238 ± 1.946
2.619ArgThr: 2.619 ± 0.653
1.572ArgVal: 1.572 ± 0.88
0.0ArgTrp: 0.0 ± 0.0
1.048ArgTyr: 1.048 ± 1.039
0.0ArgXaa: 0.0 ± 0.0
Ser
2.095SerAla: 2.095 ± 2.079
1.572SerCys: 1.572 ± 0.88
4.191SerAsp: 4.191 ± 0.093
3.143SerGlu: 3.143 ± 1.493
3.143SerPhe: 3.143 ± 0.946
4.191SerGly: 4.191 ± 0.906
0.0SerHis: 0.0 ± 0.0
5.238SerIle: 5.238 ± 1.133
6.286SerLys: 6.286 ± 0.546
5.762SerLeu: 5.762 ± 1.652
1.048SerMet: 1.048 ± 0.586
5.238SerAsn: 5.238 ± 1.306
3.143SerPro: 3.143 ± 0.133
6.81SerGln: 6.81 ± 1.066
5.238SerArg: 5.238 ± 0.493
5.238SerSer: 5.238 ± 1.946
3.667SerThr: 3.667 ± 1.199
3.143SerVal: 3.143 ± 0.68
1.572SerTrp: 1.572 ± 0.88
2.095SerTyr: 2.095 ± 1.266
0.0SerXaa: 0.0 ± 0.0
Thr
2.095ThrAla: 2.095 ± 0.453
1.048ThrCys: 1.048 ± 0.227
5.238ThrAsp: 5.238 ± 0.32
2.095ThrGlu: 2.095 ± 0.453
2.619ThrPhe: 2.619 ± 0.973
4.715ThrGly: 4.715 ± 3.865
0.524ThrHis: 0.524 ± 0.293
4.715ThrIle: 4.715 ± 1.426
6.81ThrLys: 6.81 ± 1.373
2.619ThrLeu: 2.619 ± 1.466
3.667ThrMet: 3.667 ± 2.012
3.143ThrAsn: 3.143 ± 1.493
2.619ThrPro: 2.619 ± 0.653
3.143ThrGln: 3.143 ± 0.946
1.572ThrArg: 1.572 ± 0.067
5.238ThrSer: 5.238 ± 0.32
3.667ThrThr: 3.667 ± 0.386
1.048ThrVal: 1.048 ± 0.586
1.048ThrTrp: 1.048 ± 0.227
4.191ThrTyr: 4.191 ± 1.533
0.0ThrXaa: 0.0 ± 0.0
Val
1.572ValAla: 1.572 ± 0.746
0.0ValCys: 0.0 ± 0.0
1.572ValAsp: 1.572 ± 0.88
0.0ValGlu: 0.0 ± 0.0
1.572ValPhe: 1.572 ± 0.067
3.143ValGly: 3.143 ± 0.68
1.572ValHis: 1.572 ± 0.746
4.191ValIle: 4.191 ± 0.72
2.095ValLys: 2.095 ± 0.36
4.715ValLeu: 4.715 ± 0.2
1.572ValMet: 1.572 ± 0.067
3.143ValAsn: 3.143 ± 0.133
2.095ValPro: 2.095 ± 0.36
4.715ValGln: 4.715 ± 1.013
2.619ValArg: 2.619 ± 1.466
4.191ValSer: 4.191 ± 0.906
2.619ValThr: 2.619 ± 0.653
2.095ValVal: 2.095 ± 0.453
1.048ValTrp: 1.048 ± 0.227
2.095ValTyr: 2.095 ± 0.36
0.0ValXaa: 0.0 ± 0.0
Trp
0.524TrpAla: 0.524 ± 0.293
0.0TrpCys: 0.0 ± 0.0
2.619TrpAsp: 2.619 ± 0.653
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
1.572TrpGly: 1.572 ± 0.746
0.524TrpHis: 0.524 ± 0.293
0.0TrpIle: 0.0 ± 0.0
1.048TrpLys: 1.048 ± 0.586
1.048TrpLeu: 1.048 ± 0.586
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
1.048TrpGln: 1.048 ± 0.227
0.524TrpArg: 0.524 ± 0.293
0.524TrpSer: 0.524 ± 0.52
0.524TrpThr: 0.524 ± 0.293
0.0TrpVal: 0.0 ± 0.0
0.524TrpTrp: 0.524 ± 0.52
0.524TrpTyr: 0.524 ± 0.293
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.095TyrAla: 2.095 ± 0.36
1.048TyrCys: 1.048 ± 0.586
2.095TyrAsp: 2.095 ± 0.453
2.095TyrGlu: 2.095 ± 0.36
1.572TyrPhe: 1.572 ± 0.067
1.572TyrGly: 1.572 ± 0.746
1.048TyrHis: 1.048 ± 0.586
2.619TyrIle: 2.619 ± 0.653
6.81TyrLys: 6.81 ± 2.692
4.191TyrLeu: 4.191 ± 1.533
1.048TyrMet: 1.048 ± 0.586
2.095TyrAsn: 2.095 ± 1.173
1.572TyrPro: 1.572 ± 0.067
3.667TyrGln: 3.667 ± 2.052
0.524TyrArg: 0.524 ± 0.293
1.048TyrSer: 1.048 ± 0.227
3.143TyrThr: 3.143 ± 0.946
5.762TyrVal: 5.762 ± 1.599
0.0TyrTrp: 0.0 ± 0.0
4.191TyrTyr: 4.191 ± 1.533
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1910 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski