Amino acid dipepetide frequency for Viola virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.696AlaAla: 2.696 ± 0.173
2.097AlaCys: 2.097 ± 0.536
1.797AlaAsp: 1.797 ± 0.186
4.793AlaGlu: 4.793 ± 0.257
2.696AlaPhe: 2.696 ± 0.731
3.295AlaGly: 3.295 ± 1.094
1.498AlaHis: 1.498 ± 0.899
3.295AlaIle: 3.295 ± 0.19
3.295AlaLys: 3.295 ± 0.642
7.49AlaLeu: 7.49 ± 2.69
2.397AlaMet: 2.397 ± 0.549
1.498AlaAsn: 1.498 ± 0.448
1.797AlaPro: 1.797 ± 0.266
2.097AlaGln: 2.097 ± 0.084
4.194AlaArg: 4.194 ± 1.187
6.89AlaSer: 6.89 ± 0.342
2.397AlaThr: 2.397 ± 0.355
4.793AlaVal: 4.793 ± 0.257
0.3AlaTrp: 0.3 ± 0.182
1.198AlaTyr: 1.198 ± 0.726
0.0AlaXaa: 0.0 ± 0.0
Cys
0.899CysAla: 0.899 ± 0.359
0.3CysCys: 0.3 ± 0.27
0.599CysAsp: 0.599 ± 0.089
0.3CysGlu: 0.3 ± 0.27
0.899CysPhe: 0.899 ± 0.811
1.198CysGly: 1.198 ± 0.629
0.599CysHis: 0.599 ± 0.54
1.198CysIle: 1.198 ± 0.629
2.397CysLys: 2.397 ± 0.806
3.295CysLeu: 3.295 ± 1.617
0.599CysMet: 0.599 ± 0.089
0.899CysAsn: 0.899 ± 0.093
0.899CysPro: 0.899 ± 0.093
2.397CysGln: 2.397 ± 0.806
1.498CysArg: 1.498 ± 0.899
2.696CysSer: 2.696 ± 0.625
2.097CysThr: 2.097 ± 1.44
1.498CysVal: 1.498 ± 0.899
0.599CysTrp: 0.599 ± 0.089
1.498CysTyr: 1.498 ± 0.448
0.0CysXaa: 0.0 ± 0.0
Asp
4.194AspAla: 4.194 ± 0.169
1.498AspCys: 1.498 ± 1.351
5.093AspAsp: 5.093 ± 1.28
5.392AspGlu: 5.392 ± 0.106
3.595AspPhe: 3.595 ± 0.824
2.397AspGly: 2.397 ± 0.806
1.797AspHis: 1.797 ± 0.186
4.494AspIle: 4.494 ± 0.916
1.498AspLys: 1.498 ± 0.004
3.895AspLeu: 3.895 ± 1.457
1.797AspMet: 1.797 ± 0.638
2.696AspAsn: 2.696 ± 0.625
1.797AspPro: 1.797 ± 0.186
1.498AspGln: 1.498 ± 0.004
3.595AspArg: 3.595 ± 0.532
4.194AspSer: 4.194 ± 1.524
2.996AspThr: 2.996 ± 0.443
2.397AspVal: 2.397 ± 0.097
1.198AspTrp: 1.198 ± 0.177
1.797AspTyr: 1.797 ± 0.186
0.0AspXaa: 0.0 ± 0.0
Glu
4.793GluAla: 4.793 ± 0.646
1.797GluCys: 1.797 ± 0.718
6.291GluAsp: 6.291 ± 0.65
5.992GluGlu: 5.992 ± 2.276
2.996GluPhe: 2.996 ± 1.364
6.291GluGly: 6.291 ± 0.199
1.797GluHis: 1.797 ± 0.266
4.793GluIle: 4.793 ± 0.646
4.194GluLys: 4.194 ± 1.639
6.591GluLeu: 6.591 ± 1.284
0.899GluMet: 0.899 ± 0.359
1.198GluAsn: 1.198 ± 0.177
1.498GluPro: 1.498 ± 0.456
2.696GluGln: 2.696 ± 0.279
4.194GluArg: 4.194 ± 0.169
3.895GluSer: 3.895 ± 1.457
2.996GluThr: 2.996 ± 0.46
5.392GluVal: 5.392 ± 0.106
0.3GluTrp: 0.3 ± 0.182
1.498GluTyr: 1.498 ± 0.448
0.0GluXaa: 0.0 ± 0.0
Phe
2.996PheAla: 2.996 ± 0.009
0.599PheCys: 0.599 ± 0.54
2.397PheAsp: 2.397 ± 0.097
2.397PheGlu: 2.397 ± 0.549
1.498PhePhe: 1.498 ± 0.908
2.097PheGly: 2.097 ± 0.819
0.899PheHis: 0.899 ± 0.545
2.696PheIle: 2.696 ± 0.731
3.895PheLys: 3.895 ± 0.553
5.093PheLeu: 5.093 ± 0.076
0.899PheMet: 0.899 ± 0.093
1.498PheAsn: 1.498 ± 0.448
2.996PhePro: 2.996 ± 0.46
0.0PheGln: 0.0 ± 0.0
2.996PheArg: 2.996 ± 1.364
2.996PheSer: 2.996 ± 0.895
2.996PheThr: 2.996 ± 0.912
3.595PheVal: 3.595 ± 0.372
0.599PheTrp: 0.599 ± 0.089
0.599PheTyr: 0.599 ± 0.089
0.0PheXaa: 0.0 ± 0.0
Gly
3.895GlyAla: 3.895 ± 0.101
1.797GlyCys: 1.797 ± 0.718
3.895GlyAsp: 3.895 ± 0.802
2.696GlyGlu: 2.696 ± 0.625
2.696GlyPhe: 2.696 ± 0.731
6.291GlyGly: 6.291 ± 2.964
2.696GlyHis: 2.696 ± 0.279
3.595GlyIle: 3.595 ± 0.824
3.895GlyLys: 3.895 ± 1.457
4.194GlyLeu: 4.194 ± 0.169
2.696GlyMet: 2.696 ± 0.129
1.797GlyAsn: 1.797 ± 0.266
2.996GlyPro: 2.996 ± 0.443
2.097GlyGln: 2.097 ± 0.536
0.899GlyArg: 0.899 ± 0.545
6.591GlySer: 6.591 ± 2.783
2.397GlyThr: 2.397 ± 1.258
3.895GlyVal: 3.895 ± 0.553
0.599GlyTrp: 0.599 ± 0.54
1.198GlyTyr: 1.198 ± 0.177
0.0GlyXaa: 0.0 ± 0.0
His
1.797HisAla: 1.797 ± 0.186
0.0HisCys: 0.0 ± 0.0
0.599HisAsp: 0.599 ± 0.089
2.397HisGlu: 2.397 ± 0.355
1.797HisPhe: 1.797 ± 0.266
1.797HisGly: 1.797 ± 0.186
0.0HisHis: 0.0 ± 0.0
1.797HisIle: 1.797 ± 1.089
2.097HisLys: 2.097 ± 0.084
3.295HisLeu: 3.295 ± 1.165
0.599HisMet: 0.599 ± 0.54
0.3HisAsn: 0.3 ± 0.27
0.599HisPro: 0.599 ± 0.363
1.198HisGln: 1.198 ± 0.726
1.498HisArg: 1.498 ± 0.456
2.696HisSer: 2.696 ± 0.625
0.3HisThr: 0.3 ± 0.182
2.097HisVal: 2.097 ± 0.084
0.0HisTrp: 0.0 ± 0.0
0.899HisTyr: 0.899 ± 0.093
0.0HisXaa: 0.0 ± 0.0
Ile
4.194IleAla: 4.194 ± 0.621
1.198IleCys: 1.198 ± 0.629
3.295IleAsp: 3.295 ± 0.714
4.194IleGlu: 4.194 ± 2.09
1.498IlePhe: 1.498 ± 0.004
4.194IleGly: 4.194 ± 1.524
2.397IleHis: 2.397 ± 0.097
2.996IleIle: 2.996 ± 0.46
5.692IleLys: 5.692 ± 0.616
3.295IleLeu: 3.295 ± 0.642
2.097IleMet: 2.097 ± 0.367
3.295IleAsn: 3.295 ± 0.19
1.797IlePro: 1.797 ± 0.186
2.397IleGln: 2.397 ± 0.355
3.895IleArg: 3.895 ± 0.553
6.591IleSer: 6.591 ± 1.736
4.494IleThr: 4.494 ± 0.465
2.996IleVal: 2.996 ± 0.46
0.599IleTrp: 0.599 ± 0.363
1.498IleTyr: 1.498 ± 0.908
0.0IleXaa: 0.0 ± 0.0
Lys
5.392LysAla: 5.392 ± 0.346
3.295LysCys: 3.295 ± 1.165
4.793LysAsp: 4.793 ± 0.709
3.895LysGlu: 3.895 ± 1.457
2.397LysPhe: 2.397 ± 0.549
4.194LysGly: 4.194 ± 1.072
0.899LysHis: 0.899 ± 0.545
4.494LysIle: 4.494 ± 0.916
4.494LysLys: 4.494 ± 1.795
6.591LysLeu: 6.591 ± 0.072
1.797LysMet: 1.797 ± 0.266
1.498LysAsn: 1.498 ± 0.004
2.696LysPro: 2.696 ± 0.279
1.797LysGln: 1.797 ± 1.089
2.996LysArg: 2.996 ± 1.364
6.291LysSer: 6.291 ± 0.705
4.793LysThr: 4.793 ± 0.257
4.494LysVal: 4.494 ± 0.465
1.498LysTrp: 1.498 ± 0.004
1.797LysTyr: 1.797 ± 0.266
0.0LysXaa: 0.0 ± 0.0
Leu
3.895LeuAla: 3.895 ± 1.706
1.797LeuCys: 1.797 ± 0.638
3.595LeuAsp: 3.595 ± 0.824
6.291LeuGlu: 6.291 ± 0.199
5.093LeuPhe: 5.093 ± 0.828
5.392LeuGly: 5.392 ± 1.009
1.797LeuHis: 1.797 ± 0.638
5.992LeuIle: 5.992 ± 0.017
6.591LeuLys: 6.591 ± 0.38
6.89LeuLeu: 6.89 ± 0.11
2.397LeuMet: 2.397 ± 0.097
2.097LeuAsn: 2.097 ± 0.536
2.696LeuPro: 2.696 ± 0.173
4.494LeuGln: 4.494 ± 0.465
6.291LeuArg: 6.291 ± 0.199
9.587LeuSer: 9.587 ± 0.063
4.194LeuThr: 4.194 ± 1.072
5.392LeuVal: 5.392 ± 1.702
0.0LeuTrp: 0.0 ± 0.0
3.895LeuTyr: 3.895 ± 0.553
0.0LeuXaa: 0.0 ± 0.0
Met
3.295MetAla: 3.295 ± 0.19
0.0MetCys: 0.0 ± 0.0
2.397MetAsp: 2.397 ± 0.549
1.498MetGlu: 1.498 ± 0.908
1.797MetPhe: 1.797 ± 0.266
0.899MetGly: 0.899 ± 0.545
0.899MetHis: 0.899 ± 0.545
2.397MetIle: 2.397 ± 0.355
2.097MetLys: 2.097 ± 0.536
1.797MetLeu: 1.797 ± 0.638
1.797MetMet: 1.797 ± 1.089
0.899MetAsn: 0.899 ± 0.093
0.899MetPro: 0.899 ± 0.093
0.599MetGln: 0.599 ± 0.363
1.198MetArg: 1.198 ± 0.275
1.797MetSer: 1.797 ± 1.17
1.797MetThr: 1.797 ± 0.718
0.899MetVal: 0.899 ± 0.093
0.0MetTrp: 0.0 ± 0.0
1.198MetTyr: 1.198 ± 0.275
0.0MetXaa: 0.0 ± 0.0
Asn
0.599AsnAla: 0.599 ± 0.089
0.3AsnCys: 0.3 ± 0.27
2.097AsnAsp: 2.097 ± 1.44
2.996AsnGlu: 2.996 ± 0.009
1.498AsnPhe: 1.498 ± 0.908
0.899AsnGly: 0.899 ± 0.093
0.899AsnHis: 0.899 ± 0.093
1.198AsnIle: 1.198 ± 0.629
3.595AsnLys: 3.595 ± 0.372
3.895AsnLeu: 3.895 ± 1.457
1.198AsnMet: 1.198 ± 0.95
2.097AsnAsn: 2.097 ± 0.536
2.397AsnPro: 2.397 ± 0.097
1.797AsnGln: 1.797 ± 0.638
1.797AsnArg: 1.797 ± 0.266
1.797AsnSer: 1.797 ± 0.638
2.097AsnThr: 2.097 ± 0.084
1.498AsnVal: 1.498 ± 0.448
0.599AsnTrp: 0.599 ± 0.089
1.498AsnTyr: 1.498 ± 0.899
0.0AsnXaa: 0.0 ± 0.0
Pro
0.599ProAla: 0.599 ± 0.089
0.599ProCys: 0.599 ± 0.363
2.097ProAsp: 2.097 ± 0.536
3.295ProGlu: 3.295 ± 1.546
2.397ProPhe: 2.397 ± 0.806
2.996ProGly: 2.996 ± 0.009
0.899ProHis: 0.899 ± 0.359
0.899ProIle: 0.899 ± 0.811
2.696ProLys: 2.696 ± 0.173
4.494ProLeu: 4.494 ± 0.891
0.899ProMet: 0.899 ± 0.093
1.498ProAsn: 1.498 ± 0.004
0.599ProPro: 0.599 ± 0.089
0.3ProGln: 0.3 ± 0.27
1.498ProArg: 1.498 ± 0.004
3.295ProSer: 3.295 ± 1.094
0.899ProThr: 0.899 ± 0.093
2.097ProVal: 2.097 ± 0.819
0.899ProTrp: 0.899 ± 0.093
0.899ProTyr: 0.899 ± 0.093
0.0ProXaa: 0.0 ± 0.0
Gln
2.097GlnAla: 2.097 ± 0.367
1.797GlnCys: 1.797 ± 0.718
3.295GlnAsp: 3.295 ± 0.714
3.295GlnGlu: 3.295 ± 0.642
1.198GlnPhe: 1.198 ± 0.726
2.996GlnGly: 2.996 ± 0.443
1.498GlnHis: 1.498 ± 0.456
3.295GlnIle: 3.295 ± 0.642
3.595GlnLys: 3.595 ± 0.984
1.498GlnLeu: 1.498 ± 0.456
0.599GlnMet: 0.599 ± 0.363
0.599GlnAsn: 0.599 ± 0.089
1.198GlnPro: 1.198 ± 0.275
0.899GlnGln: 0.899 ± 0.093
1.498GlnArg: 1.498 ± 0.456
2.097GlnSer: 2.097 ± 0.084
1.198GlnThr: 1.198 ± 0.177
0.599GlnVal: 0.599 ± 0.089
0.3GlnTrp: 0.3 ± 0.182
0.599GlnTyr: 0.599 ± 0.363
0.0GlnXaa: 0.0 ± 0.0
Arg
4.494ArgAla: 4.494 ± 0.439
0.899ArgCys: 0.899 ± 0.811
4.194ArgAsp: 4.194 ± 1.639
3.295ArgGlu: 3.295 ± 0.262
0.0ArgPhe: 0.0 ± 0.0
2.696ArgGly: 2.696 ± 0.279
0.599ArgHis: 0.599 ± 0.089
5.093ArgIle: 5.093 ± 0.828
3.595ArgLys: 3.595 ± 0.08
4.793ArgLeu: 4.793 ± 0.194
1.797ArgMet: 1.797 ± 0.638
1.797ArgAsn: 1.797 ± 1.089
1.797ArgPro: 1.797 ± 0.718
1.797ArgGln: 1.797 ± 0.638
2.696ArgArg: 2.696 ± 0.173
5.692ArgSer: 5.692 ± 0.739
2.097ArgThr: 2.097 ± 0.819
2.696ArgVal: 2.696 ± 0.279
0.899ArgTrp: 0.899 ± 0.093
1.498ArgTyr: 1.498 ± 0.908
0.0ArgXaa: 0.0 ± 0.0
Ser
5.392SerAla: 5.392 ± 0.558
4.194SerCys: 4.194 ± 2.88
2.696SerAsp: 2.696 ± 1.182
4.793SerGlu: 4.793 ± 1.098
5.093SerPhe: 5.093 ± 0.528
4.494SerGly: 4.494 ± 1.343
2.696SerHis: 2.696 ± 0.625
3.595SerIle: 3.595 ± 0.372
6.591SerLys: 6.591 ± 1.284
9.587SerLeu: 9.587 ± 0.389
2.696SerMet: 2.696 ± 0.731
3.295SerAsn: 3.295 ± 0.262
2.097SerPro: 2.097 ± 0.367
2.097SerGln: 2.097 ± 0.084
3.895SerArg: 3.895 ± 1.005
7.789SerSer: 7.789 ± 1.153
6.591SerThr: 6.591 ± 0.975
8.089SerVal: 8.089 ± 1.423
1.797SerTrp: 1.797 ± 0.186
2.397SerTyr: 2.397 ± 0.355
0.0SerXaa: 0.0 ± 0.0
Thr
3.295ThrAla: 3.295 ± 1.165
1.198ThrCys: 1.198 ± 0.177
3.295ThrAsp: 3.295 ± 0.19
3.295ThrGlu: 3.295 ± 0.262
1.498ThrPhe: 1.498 ± 0.899
4.194ThrGly: 4.194 ± 1.524
0.599ThrHis: 0.599 ± 0.089
2.097ThrIle: 2.097 ± 0.536
3.595ThrLys: 3.595 ± 0.532
5.692ThrLeu: 5.692 ± 0.287
1.498ThrMet: 1.498 ± 0.448
4.494ThrAsn: 4.494 ± 0.013
1.198ThrPro: 1.198 ± 0.726
2.696ThrGln: 2.696 ± 1.077
1.797ThrArg: 1.797 ± 0.266
5.692ThrSer: 5.692 ± 0.739
3.895ThrThr: 3.895 ± 1.254
4.194ThrVal: 4.194 ± 0.621
0.599ThrTrp: 0.599 ± 0.363
2.097ThrTyr: 2.097 ± 0.084
0.0ThrXaa: 0.0 ± 0.0
Val
2.996ValAla: 2.996 ± 1.364
2.097ValCys: 2.097 ± 0.988
3.595ValAsp: 3.595 ± 1.436
5.392ValGlu: 5.392 ± 1.461
2.996ValPhe: 2.996 ± 0.46
2.996ValGly: 2.996 ± 0.443
2.397ValHis: 2.397 ± 1.258
5.392ValIle: 5.392 ± 0.346
5.093ValLys: 5.093 ± 0.98
2.696ValLeu: 2.696 ± 0.625
0.3ValMet: 0.3 ± 0.182
1.498ValAsn: 1.498 ± 0.004
2.097ValPro: 2.097 ± 0.988
2.097ValGln: 2.097 ± 0.819
3.295ValArg: 3.295 ± 0.19
6.591ValSer: 6.591 ± 0.832
4.793ValThr: 4.793 ± 0.709
4.194ValVal: 4.194 ± 1.072
0.599ValTrp: 0.599 ± 0.363
1.797ValTyr: 1.797 ± 0.186
0.0ValXaa: 0.0 ± 0.0
Trp
0.3TrpAla: 0.3 ± 0.182
0.3TrpCys: 0.3 ± 0.182
0.3TrpAsp: 0.3 ± 0.182
1.198TrpGlu: 1.198 ± 0.177
0.599TrpPhe: 0.599 ± 0.089
0.899TrpGly: 0.899 ± 0.359
0.0TrpHis: 0.0 ± 0.0
0.599TrpIle: 0.599 ± 0.089
0.3TrpLys: 0.3 ± 0.182
0.899TrpLeu: 0.899 ± 0.093
0.3TrpMet: 0.3 ± 0.182
0.599TrpAsn: 0.599 ± 0.089
0.3TrpPro: 0.3 ± 0.182
0.0TrpGln: 0.0 ± 0.0
0.899TrpArg: 0.899 ± 0.545
1.198TrpSer: 1.198 ± 0.275
2.097TrpThr: 2.097 ± 0.536
0.899TrpVal: 0.899 ± 0.093
0.3TrpTrp: 0.3 ± 0.182
0.3TrpTyr: 0.3 ± 0.182
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.696TyrAla: 2.696 ± 0.731
0.599TyrCys: 0.599 ± 0.54
1.498TyrAsp: 1.498 ± 0.004
2.696TyrGlu: 2.696 ± 0.625
1.498TyrPhe: 1.498 ± 0.908
0.899TyrGly: 0.899 ± 0.093
0.899TyrHis: 0.899 ± 0.545
2.696TyrIle: 2.696 ± 0.173
0.899TyrLys: 0.899 ± 0.093
2.397TyrLeu: 2.397 ± 1.001
0.599TyrMet: 0.599 ± 0.363
1.198TyrAsn: 1.198 ± 0.726
1.498TyrPro: 1.498 ± 0.899
1.198TyrGln: 1.198 ± 0.177
1.797TyrArg: 1.797 ± 0.266
1.498TyrSer: 1.498 ± 0.004
1.797TyrThr: 1.797 ± 0.266
1.198TyrVal: 1.198 ± 0.275
0.599TyrTrp: 0.599 ± 0.089
0.3TyrTyr: 0.3 ± 0.27
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3339 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski