Amino acid dipepetide frequency for Wallerfield virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.082AlaAla: 2.082 ± 0.75
1.735AlaCys: 1.735 ± 0.895
1.388AlaAsp: 1.388 ± 0.716
3.123AlaGlu: 3.123 ± 1.611
0.347AlaPhe: 0.347 ± 1.005
1.041AlaGly: 1.041 ± 0.537
1.388AlaHis: 1.388 ± 0.716
6.246AlaIle: 6.246 ± 2.768
3.47AlaLys: 3.47 ± 1.166
3.123AlaLeu: 3.123 ± 1.459
0.347AlaMet: 0.347 ± 0.179
2.776AlaAsn: 2.776 ± 0.46
4.164AlaPro: 4.164 ± 1.5
0.694AlaGln: 0.694 ± 0.358
1.041AlaArg: 1.041 ± 1.211
4.858AlaSer: 4.858 ± 1.117
3.47AlaThr: 3.47 ± 1.349
4.164AlaVal: 4.164 ± 0.599
0.347AlaTrp: 0.347 ± 0.718
2.082AlaTyr: 2.082 ± 0.75
0.0AlaXaa: 0.0 ± 0.0
Cys
2.082CysAla: 2.082 ± 1.074
0.0CysCys: 0.0 ± 0.0
2.082CysAsp: 2.082 ± 1.074
0.347CysGlu: 0.347 ± 0.179
1.735CysPhe: 1.735 ± 0.879
0.0CysGly: 0.0 ± 0.0
1.735CysHis: 1.735 ± 0.895
2.082CysIle: 2.082 ± 1.7
1.388CysLys: 1.388 ± 0.354
1.388CysLeu: 1.388 ± 0.354
0.347CysMet: 0.347 ± 0.718
1.041CysAsn: 1.041 ± 0.799
0.347CysPro: 0.347 ± 0.718
0.0CysGln: 0.0 ± 0.0
1.735CysArg: 1.735 ± 0.351
2.776CysSer: 2.776 ± 0.912
3.47CysThr: 3.47 ± 1.166
1.041CysVal: 1.041 ± 0.537
0.0CysTrp: 0.0 ± 0.0
2.082CysTyr: 2.082 ± 0.431
0.0CysXaa: 0.0 ± 0.0
Asp
2.776AspAla: 2.776 ± 0.709
1.735AspCys: 1.735 ± 0.895
3.123AspAsp: 3.123 ± 0.87
2.429AspGlu: 2.429 ± 0.558
3.123AspPhe: 3.123 ± 0.887
1.388AspGly: 1.388 ± 0.741
1.041AspHis: 1.041 ± 0.437
7.287AspIle: 7.287 ± 1.73
2.082AspLys: 2.082 ± 0.431
6.94AspLeu: 6.94 ± 2.79
2.082AspMet: 2.082 ± 1.016
4.164AspAsn: 4.164 ± 0.352
2.429AspPro: 2.429 ± 1.253
1.041AspGln: 1.041 ± 0.537
1.388AspArg: 1.388 ± 0.354
4.511AspSer: 4.511 ± 1.025
3.47AspThr: 3.47 ± 1.547
5.899AspVal: 5.899 ± 2.258
0.0AspTrp: 0.0 ± 0.0
3.47AspTyr: 3.47 ± 0.396
0.0AspXaa: 0.0 ± 0.0
Glu
1.041GluAla: 1.041 ± 0.537
1.041GluCys: 1.041 ± 0.437
4.164GluAsp: 4.164 ± 1.38
3.123GluGlu: 3.123 ± 1.611
3.817GluPhe: 3.817 ± 1.866
0.694GluGly: 0.694 ± 0.358
2.082GluHis: 2.082 ± 0.431
5.552GluIle: 5.552 ± 1.417
3.123GluLys: 3.123 ± 1.611
4.511GluLeu: 4.511 ± 1.028
1.388GluMet: 1.388 ± 0.636
3.817GluAsn: 3.817 ± 1.969
1.388GluPro: 1.388 ± 0.716
1.041GluGln: 1.041 ± 0.537
2.429GluArg: 2.429 ± 0.578
4.164GluSer: 4.164 ± 0.352
1.041GluThr: 1.041 ± 0.437
3.123GluVal: 3.123 ± 1.311
0.0GluTrp: 0.0 ± 0.0
3.817GluTyr: 3.817 ± 0.476
0.0GluXaa: 0.0 ± 0.0
Phe
4.511PheAla: 4.511 ± 1.296
1.735PheCys: 1.735 ± 0.895
2.776PheAsp: 2.776 ± 1.066
3.817PheGlu: 3.817 ± 1.123
2.776PhePhe: 2.776 ± 2.086
2.429PheGly: 2.429 ± 0.816
1.388PheHis: 1.388 ± 0.716
4.858PheIle: 4.858 ± 2.49
3.817PheLys: 3.817 ± 1.311
3.123PheLeu: 3.123 ± 3.46
1.041PheMet: 1.041 ± 0.799
3.123PheAsn: 3.123 ± 0.887
2.429PhePro: 2.429 ± 0.558
1.041PheGln: 1.041 ± 0.437
2.429PheArg: 2.429 ± 0.558
5.899PheSer: 5.899 ± 1.916
2.776PheThr: 2.776 ± 0.709
5.899PheVal: 5.899 ± 2.774
0.347PheTrp: 0.347 ± 0.179
1.735PheTyr: 1.735 ± 0.895
0.0PheXaa: 0.0 ± 0.0
Gly
1.388GlyAla: 1.388 ± 1.78
0.0GlyCys: 0.0 ± 0.0
2.776GlyAsp: 2.776 ± 0.46
2.082GlyGlu: 2.082 ± 0.431
1.388GlyPhe: 1.388 ± 0.716
1.388GlyGly: 1.388 ± 0.354
0.694GlyHis: 0.694 ± 0.358
2.776GlyIle: 2.776 ± 0.912
2.429GlyLys: 2.429 ± 2.412
1.735GlyLeu: 1.735 ± 0.879
1.388GlyMet: 1.388 ± 0.741
2.082GlyAsn: 2.082 ± 1.074
0.0GlyPro: 0.0 ± 0.0
0.0GlyGln: 0.0 ± 0.0
1.735GlyArg: 1.735 ± 0.351
2.429GlySer: 2.429 ± 0.775
1.388GlyThr: 1.388 ± 1.78
2.776GlyVal: 2.776 ± 0.912
0.0GlyTrp: 0.0 ± 0.0
1.388GlyTyr: 1.388 ± 1.043
0.0GlyXaa: 0.0 ± 0.0
His
1.388HisAla: 1.388 ± 0.354
1.041HisCys: 1.041 ± 0.437
1.388HisAsp: 1.388 ± 0.716
2.776HisGlu: 2.776 ± 1.432
2.776HisPhe: 2.776 ± 0.912
1.388HisGly: 1.388 ± 0.741
1.388HisHis: 1.388 ± 1.133
3.47HisIle: 3.47 ± 0.702
2.082HisLys: 2.082 ± 2.561
2.429HisLeu: 2.429 ± 0.558
0.0HisMet: 0.0 ± 0.0
0.694HisAsn: 0.694 ± 0.358
1.735HisPro: 1.735 ± 0.351
0.0HisGln: 0.0 ± 0.0
1.041HisArg: 1.041 ± 0.537
2.776HisSer: 2.776 ± 0.709
1.735HisThr: 1.735 ± 0.895
1.388HisVal: 1.388 ± 0.354
0.347HisTrp: 0.347 ± 0.179
1.041HisTyr: 1.041 ± 0.537
0.0HisXaa: 0.0 ± 0.0
Ile
3.47IleAla: 3.47 ± 1.79
1.388IleCys: 1.388 ± 0.716
4.858IleAsp: 4.858 ± 1.117
4.511IleGlu: 4.511 ± 0.981
3.123IlePhe: 3.123 ± 1.311
3.47IleGly: 3.47 ± 0.708
2.776IleHis: 2.776 ± 0.708
6.593IleIle: 6.593 ± 2.515
7.287IleLys: 7.287 ± 0.541
6.593IleLeu: 6.593 ± 4.785
2.082IleMet: 2.082 ± 1.074
3.123IleAsn: 3.123 ± 1.311
5.899IlePro: 5.899 ± 1.872
2.429IleGln: 2.429 ± 1.253
3.123IleArg: 3.123 ± 0.87
6.593IleSer: 6.593 ± 1.409
4.511IleThr: 4.511 ± 2.426
6.94IleVal: 6.94 ± 1.405
0.694IleTrp: 0.694 ± 1.435
2.776IleTyr: 2.776 ± 2.267
0.0IleXaa: 0.0 ± 0.0
Lys
3.123LysAla: 3.123 ± 1.031
1.735LysCys: 1.735 ± 1.603
2.429LysAsp: 2.429 ± 0.558
2.429LysGlu: 2.429 ± 0.775
6.593LysPhe: 6.593 ± 0.962
1.388LysGly: 1.388 ± 1.043
1.735LysHis: 1.735 ± 0.724
2.776LysIle: 2.776 ± 0.709
2.429LysLys: 2.429 ± 1.253
9.368LysLeu: 9.368 ± 2.525
1.041LysMet: 1.041 ± 0.537
4.164LysAsn: 4.164 ± 2.072
2.082LysPro: 2.082 ± 0.75
1.388LysGln: 1.388 ± 0.741
2.776LysArg: 2.776 ± 1.432
4.164LysSer: 4.164 ± 1.463
2.776LysThr: 2.776 ± 0.709
3.123LysVal: 3.123 ± 1.311
0.347LysTrp: 0.347 ± 0.179
4.858LysTyr: 4.858 ± 1.117
0.0LysXaa: 0.0 ± 0.0
Leu
2.776LeuAla: 2.776 ± 0.708
2.429LeuCys: 2.429 ± 1.531
6.593LeuAsp: 6.593 ± 1.816
3.123LeuGlu: 3.123 ± 0.87
6.246LeuPhe: 6.246 ± 0.311
2.776LeuGly: 2.776 ± 0.912
1.735LeuHis: 1.735 ± 0.351
6.94LeuIle: 6.94 ± 1.417
5.899LeuLys: 5.899 ± 1.576
7.981LeuLeu: 7.981 ± 1.708
2.082LeuMet: 2.082 ± 0.431
4.858LeuAsn: 4.858 ± 2.505
3.47LeuPro: 3.47 ± 1.547
4.858LeuGln: 4.858 ± 0.043
3.47LeuArg: 3.47 ± 1.349
6.94LeuSer: 6.94 ± 0.738
8.328LeuThr: 8.328 ± 3.198
5.205LeuVal: 5.205 ± 3.029
0.0LeuTrp: 0.0 ± 0.0
2.776LeuTyr: 2.776 ± 0.46
0.0LeuXaa: 0.0 ± 0.0
Met
1.735MetAla: 1.735 ± 0.879
1.388MetCys: 1.388 ± 0.354
1.735MetAsp: 1.735 ± 0.895
1.388MetGlu: 1.388 ± 0.354
0.694MetPhe: 0.694 ± 0.358
1.041MetGly: 1.041 ± 0.437
0.694MetHis: 0.694 ± 0.567
0.347MetIle: 0.347 ± 0.179
2.082MetLys: 2.082 ± 1.074
2.429MetLeu: 2.429 ± 1.531
1.388MetMet: 1.388 ± 0.716
0.694MetAsn: 0.694 ± 0.358
0.694MetPro: 0.694 ± 0.358
0.347MetGln: 0.347 ± 1.005
1.388MetArg: 1.388 ± 1.133
1.735MetSer: 1.735 ± 0.895
0.694MetThr: 0.694 ± 0.358
0.347MetVal: 0.347 ± 1.005
0.0MetTrp: 0.0 ± 0.0
1.041MetTyr: 1.041 ± 0.537
0.0MetXaa: 0.0 ± 0.0
Asn
3.123AsnAla: 3.123 ± 0.87
0.347AsnCys: 0.347 ± 0.179
3.123AsnAsp: 3.123 ± 1.611
2.082AsnGlu: 2.082 ± 0.431
3.817AsnPhe: 3.817 ± 0.53
2.429AsnGly: 2.429 ± 0.558
1.388AsnHis: 1.388 ± 0.354
5.552AsnIle: 5.552 ± 1.408
2.429AsnLys: 2.429 ± 0.578
5.205AsnLeu: 5.205 ± 0.192
2.082AsnMet: 2.082 ± 0.874
2.429AsnAsn: 2.429 ± 1.253
2.776AsnPro: 2.776 ± 0.46
1.735AsnGln: 1.735 ± 0.895
3.123AsnArg: 3.123 ± 0.39
3.47AsnSer: 3.47 ± 1.037
3.817AsnThr: 3.817 ± 1.207
5.899AsnVal: 5.899 ± 1.409
0.694AsnTrp: 0.694 ± 0.358
3.123AsnTyr: 3.123 ± 1.723
0.0AsnXaa: 0.0 ± 0.0
Pro
1.735ProAla: 1.735 ± 0.724
1.041ProCys: 1.041 ± 0.537
3.123ProAsp: 3.123 ± 0.87
2.429ProGlu: 2.429 ± 0.816
3.123ProPhe: 3.123 ± 0.887
2.429ProGly: 2.429 ± 0.775
0.694ProHis: 0.694 ± 0.567
2.429ProIle: 2.429 ± 0.775
3.47ProLys: 3.47 ± 0.396
2.776ProLeu: 2.776 ± 0.46
0.347ProMet: 0.347 ± 0.179
3.123ProAsn: 3.123 ± 0.87
2.082ProPro: 2.082 ± 1.7
0.694ProGln: 0.694 ± 0.358
2.082ProArg: 2.082 ± 1.854
3.47ProSer: 3.47 ± 0.396
2.082ProThr: 2.082 ± 0.874
2.429ProVal: 2.429 ± 1.253
0.0ProTrp: 0.0 ± 0.0
3.123ProTyr: 3.123 ± 0.87
0.0ProXaa: 0.0 ± 0.0
Gln
0.0GlnAla: 0.0 ± 0.0
0.694GlnCys: 0.694 ± 0.358
1.388GlnAsp: 1.388 ± 0.716
0.0GlnGlu: 0.0 ± 0.0
2.776GlnPhe: 2.776 ± 1.582
0.694GlnGly: 0.694 ± 0.358
0.694GlnHis: 0.694 ± 0.567
2.082GlnIle: 2.082 ± 1.074
1.041GlnLys: 1.041 ± 0.537
2.776GlnLeu: 2.776 ± 0.709
0.694GlnMet: 0.694 ± 0.358
2.429GlnAsn: 2.429 ± 1.253
1.388GlnPro: 1.388 ± 0.716
0.0GlnGln: 0.0 ± 0.0
2.082GlnArg: 2.082 ± 0.75
1.388GlnSer: 1.388 ± 0.716
0.694GlnThr: 0.694 ± 0.358
0.694GlnVal: 0.694 ± 0.567
0.347GlnTrp: 0.347 ± 0.179
2.082GlnTyr: 2.082 ± 0.431
0.0GlnXaa: 0.0 ± 0.0
Arg
1.388ArgAla: 1.388 ± 0.741
2.082ArgCys: 2.082 ± 1.074
0.694ArgAsp: 0.694 ± 0.567
4.164ArgGlu: 4.164 ± 0.352
1.388ArgPhe: 1.388 ± 0.741
0.347ArgGly: 0.347 ± 0.179
2.429ArgHis: 2.429 ± 1.253
4.164ArgIle: 4.164 ± 1.38
1.041ArgLys: 1.041 ± 0.437
5.552ArgLeu: 5.552 ± 0.369
0.694ArgMet: 0.694 ± 0.358
3.47ArgAsn: 3.47 ± 0.702
1.388ArgPro: 1.388 ± 0.354
1.041ArgGln: 1.041 ± 0.537
1.041ArgArg: 1.041 ± 0.537
2.429ArgSer: 2.429 ± 0.558
4.511ArgThr: 4.511 ± 4.16
3.123ArgVal: 3.123 ± 0.682
0.0ArgTrp: 0.0 ± 0.0
3.123ArgTyr: 3.123 ± 0.87
0.0ArgXaa: 0.0 ± 0.0
Ser
5.899SerAla: 5.899 ± 1.916
1.388SerCys: 1.388 ± 1.043
3.817SerAsp: 3.817 ± 1.372
4.164SerGlu: 4.164 ± 0.352
4.511SerPhe: 4.511 ± 1.025
1.388SerGly: 1.388 ± 0.741
2.082SerHis: 2.082 ± 0.431
7.634SerIle: 7.634 ± 1.441
4.164SerLys: 4.164 ± 1.181
5.899SerLeu: 5.899 ± 1.191
1.041SerMet: 1.041 ± 1.211
4.164SerAsn: 4.164 ± 1.463
4.511SerPro: 4.511 ± 0.981
3.47SerGln: 3.47 ± 0.702
4.164SerArg: 4.164 ± 1.38
3.817SerSer: 3.817 ± 2.255
5.899SerThr: 5.899 ± 1.191
3.47SerVal: 3.47 ± 1.037
0.694SerTrp: 0.694 ± 0.358
4.511SerTyr: 4.511 ± 1.621
0.0SerXaa: 0.0 ± 0.0
Thr
3.817ThrAla: 3.817 ± 0.476
2.776ThrCys: 2.776 ± 0.709
3.123ThrAsp: 3.123 ± 0.39
4.164ThrGlu: 4.164 ± 1.062
3.123ThrPhe: 3.123 ± 0.39
1.388ThrGly: 1.388 ± 1.781
2.429ThrHis: 2.429 ± 0.558
3.47ThrIle: 3.47 ± 0.702
3.47ThrLys: 3.47 ± 1.349
3.123ThrLeu: 3.123 ± 0.87
1.735ThrMet: 1.735 ± 0.351
3.123ThrAsn: 3.123 ± 1.723
3.123ThrPro: 3.123 ± 0.682
1.388ThrGln: 1.388 ± 0.716
3.47ThrArg: 3.47 ± 0.396
4.858ThrSer: 4.858 ± 1.157
4.858ThrThr: 4.858 ± 1.632
6.246ThrVal: 6.246 ± 6.058
0.0ThrTrp: 0.0 ± 0.0
4.164ThrTyr: 4.164 ± 0.861
0.0ThrXaa: 0.0 ± 0.0
Val
4.511ValAla: 4.511 ± 1.557
1.388ValCys: 1.388 ± 0.354
4.858ValAsp: 4.858 ± 0.856
3.123ValGlu: 3.123 ± 0.87
2.776ValPhe: 2.776 ± 1.582
1.735ValGly: 1.735 ± 1.682
2.776ValHis: 2.776 ± 0.912
4.164ValIle: 4.164 ± 1.062
5.899ValLys: 5.899 ± 1.206
7.981ValLeu: 7.981 ± 1.708
1.735ValMet: 1.735 ± 0.792
6.593ValAsn: 6.593 ± 0.313
1.735ValPro: 1.735 ± 0.351
1.041ValGln: 1.041 ± 0.537
2.429ValArg: 2.429 ± 1.253
6.246ValSer: 6.246 ± 0.311
3.817ValThr: 3.817 ± 1.372
5.205ValVal: 5.205 ± 1.069
0.694ValTrp: 0.694 ± 0.567
3.47ValTyr: 3.47 ± 2.431
0.0ValXaa: 0.0 ± 0.0
Trp
0.347TrpAla: 0.347 ± 0.718
0.0TrpCys: 0.0 ± 0.0
0.694TrpAsp: 0.694 ± 0.567
0.0TrpGlu: 0.0 ± 0.0
0.347TrpPhe: 0.347 ± 0.179
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.347TrpIle: 0.347 ± 0.179
0.694TrpLys: 0.694 ± 0.358
0.694TrpLeu: 0.694 ± 0.358
0.0TrpMet: 0.0 ± 0.0
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.694TrpSer: 0.694 ± 1.435
0.0TrpThr: 0.0 ± 0.0
0.694TrpVal: 0.694 ± 0.358
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.694TyrAla: 0.694 ± 0.89
1.735TyrCys: 1.735 ± 0.996
6.246TyrAsp: 6.246 ± 2.435
2.429TyrGlu: 2.429 ± 0.816
3.47TyrPhe: 3.47 ± 0.702
2.429TyrGly: 2.429 ± 0.775
1.735TyrHis: 1.735 ± 0.351
3.47TyrIle: 3.47 ± 0.702
2.429TyrLys: 2.429 ± 0.558
4.858TyrLeu: 4.858 ± 2.8
0.0TyrMet: 0.0 ± 0.0
2.776TyrAsn: 2.776 ± 0.708
1.041TyrPro: 1.041 ± 1.281
1.735TyrGln: 1.735 ± 0.895
3.123TyrArg: 3.123 ± 1.611
3.47TyrSer: 3.47 ± 1.166
4.511TyrThr: 4.511 ± 0.175
4.511TyrVal: 4.511 ± 1.296
0.0TyrTrp: 0.0 ± 0.0
5.552TyrTyr: 5.552 ± 1.408
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2883 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski