Amino acid dipepetide frequency for Wenzhou tombus-like virus 9

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.959AlaAla: 4.959 ± 2.135
1.653AlaCys: 1.653 ± 0.739
3.306AlaAsp: 3.306 ± 1.967
5.785AlaGlu: 5.785 ± 2.453
1.653AlaPhe: 1.653 ± 0.515
3.306AlaGly: 3.306 ± 0.952
0.0AlaHis: 0.0 ± 0.0
3.306AlaIle: 3.306 ± 1.462
4.959AlaLys: 4.959 ± 0.378
4.132AlaLeu: 4.132 ± 2.436
0.826AlaMet: 0.826 ± 0.57
2.479AlaAsn: 2.479 ± 0.445
4.132AlaPro: 4.132 ± 1.589
0.826AlaGln: 0.826 ± 0.57
1.653AlaArg: 1.653 ± 0.515
5.785AlaSer: 5.785 ± 4.844
3.306AlaThr: 3.306 ± 1.259
6.612AlaVal: 6.612 ± 0.565
1.653AlaTrp: 1.653 ± 1.454
1.653AlaTyr: 1.653 ± 0.515
0.0AlaXaa: 0.0 ± 0.0
Cys
0.826CysAla: 0.826 ± 0.727
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.653CysGlu: 1.653 ± 1.705
0.0CysPhe: 0.0 ± 0.0
1.653CysGly: 1.653 ± 0.739
0.0CysHis: 0.0 ± 0.0
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.826CysMet: 0.826 ± 0.631
0.0CysAsn: 0.0 ± 0.0
0.826CysPro: 0.826 ± 0.57
0.826CysGln: 0.826 ± 0.57
2.479CysArg: 2.479 ± 1.711
2.479CysSer: 2.479 ± 1.008
0.0CysThr: 0.0 ± 0.0
1.653CysVal: 1.653 ± 0.739
0.0CysTrp: 0.0 ± 0.0
1.653CysTyr: 1.653 ± 1.141
0.0CysXaa: 0.0 ± 0.0
Asp
4.959AspAla: 4.959 ± 0.994
1.653AspCys: 1.653 ± 1.141
5.785AspAsp: 5.785 ± 1.685
6.612AspGlu: 6.612 ± 1.73
0.0AspPhe: 0.0 ± 0.0
8.264AspGly: 8.264 ± 1.565
2.479AspHis: 2.479 ± 1.505
4.132AspIle: 4.132 ± 0.446
1.653AspLys: 1.653 ± 0.739
4.959AspLeu: 4.959 ± 1.173
2.479AspMet: 2.479 ± 1.008
2.479AspAsn: 2.479 ± 0.445
6.612AspPro: 6.612 ± 2.015
3.306AspGln: 3.306 ± 1.3
4.132AspArg: 4.132 ± 0.782
1.653AspSer: 1.653 ± 1.454
3.306AspThr: 3.306 ± 1.967
2.479AspVal: 2.479 ± 1.123
1.653AspTrp: 1.653 ± 0.515
0.826AspTyr: 0.826 ± 0.727
0.0AspXaa: 0.0 ± 0.0
Glu
4.132GluAla: 4.132 ± 3.318
0.0GluCys: 0.0 ± 0.0
3.306GluAsp: 3.306 ± 2.282
3.306GluGlu: 3.306 ± 1.479
5.785GluPhe: 5.785 ± 1.702
0.826GluGly: 0.826 ± 0.853
3.306GluHis: 3.306 ± 2.282
2.479GluIle: 2.479 ± 1.711
2.479GluLys: 2.479 ± 0.445
4.959GluLeu: 4.959 ± 2.246
0.826GluMet: 0.826 ± 0.57
2.479GluAsn: 2.479 ± 1.691
4.959GluPro: 4.959 ± 1.544
3.306GluGln: 3.306 ± 1.462
5.785GluArg: 5.785 ± 0.859
4.132GluSer: 4.132 ± 1.674
0.826GluThr: 0.826 ± 0.727
4.959GluVal: 4.959 ± 2.135
1.653GluTrp: 1.653 ± 0.739
3.306GluTyr: 3.306 ± 1.03
0.0GluXaa: 0.0 ± 0.0
Phe
0.826PheAla: 0.826 ± 0.57
1.653PheCys: 1.653 ± 0.515
4.132PheAsp: 4.132 ± 1.839
2.479PheGlu: 2.479 ± 0.808
0.826PhePhe: 0.826 ± 0.57
2.479PheGly: 2.479 ± 0.445
0.826PheHis: 0.826 ± 0.727
0.826PheIle: 0.826 ± 0.727
0.826PheLys: 0.826 ± 0.853
0.826PheLeu: 0.826 ± 0.57
0.0PheMet: 0.0 ± 0.0
2.479PheAsn: 2.479 ± 0.808
0.826PhePro: 0.826 ± 0.727
1.653PheGln: 1.653 ± 0.515
3.306PheArg: 3.306 ± 2.282
4.132PheSer: 4.132 ± 1.229
2.479PheThr: 2.479 ± 1.123
4.132PheVal: 4.132 ± 1.229
0.826PheTrp: 0.826 ± 0.853
1.653PheTyr: 1.653 ± 1.141
0.0PheXaa: 0.0 ± 0.0
Gly
4.959GlyAla: 4.959 ± 1.34
3.306GlyCys: 3.306 ± 1.479
8.264GlyAsp: 8.264 ± 1.26
4.959GlyGlu: 4.959 ± 0.89
1.653GlyPhe: 1.653 ± 0.515
6.612GlyGly: 6.612 ± 2.663
0.0GlyHis: 0.0 ± 0.0
4.959GlyIle: 4.959 ± 0.378
4.132GlyLys: 4.132 ± 0.782
7.438GlyLeu: 7.438 ± 3.106
2.479GlyMet: 2.479 ± 1.123
2.479GlyAsn: 2.479 ± 0.808
1.653GlyPro: 1.653 ± 0.515
0.826GlyGln: 0.826 ± 0.727
4.959GlyArg: 4.959 ± 0.378
4.959GlySer: 4.959 ± 1.866
4.132GlyThr: 4.132 ± 1.589
4.132GlyVal: 4.132 ± 2.193
0.826GlyTrp: 0.826 ± 0.57
0.826GlyTyr: 0.826 ± 0.727
0.0GlyXaa: 0.0 ± 0.0
His
2.479HisAla: 2.479 ± 1.491
0.0HisCys: 0.0 ± 0.0
0.826HisAsp: 0.826 ± 0.853
1.653HisGlu: 1.653 ± 1.141
0.0HisPhe: 0.0 ± 0.0
3.306HisGly: 3.306 ± 0.952
0.0HisHis: 0.0 ± 0.0
1.653HisIle: 1.653 ± 1.141
1.653HisLys: 1.653 ± 1.141
4.132HisLeu: 4.132 ± 1.589
0.826HisMet: 0.826 ± 0.615
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.826HisGln: 0.826 ± 0.727
1.653HisArg: 1.653 ± 0.515
1.653HisSer: 1.653 ± 1.141
1.653HisThr: 1.653 ± 1.454
0.826HisVal: 0.826 ± 0.727
0.0HisTrp: 0.0 ± 0.0
0.826HisTyr: 0.826 ± 0.853
0.0HisXaa: 0.0 ± 0.0
Ile
3.306IleAla: 3.306 ± 1.3
0.0IleCys: 0.0 ± 0.0
3.306IleAsp: 3.306 ± 1.82
4.132IleGlu: 4.132 ± 0.446
1.653IlePhe: 1.653 ± 1.141
2.479IleGly: 2.479 ± 1.505
0.826IleHis: 0.826 ± 0.57
1.653IleIle: 1.653 ± 1.141
2.479IleLys: 2.479 ± 1.711
0.826IleLeu: 0.826 ± 0.727
1.653IleMet: 1.653 ± 1.141
4.132IleAsn: 4.132 ± 1.229
3.306IlePro: 3.306 ± 0.952
0.826IleGln: 0.826 ± 0.57
2.479IleArg: 2.479 ± 1.008
1.653IleSer: 1.653 ± 0.984
4.132IleThr: 4.132 ± 1.229
3.306IleVal: 3.306 ± 1.967
0.826IleTrp: 0.826 ± 0.727
2.479IleTyr: 2.479 ± 1.008
0.0IleXaa: 0.0 ± 0.0
Lys
1.653LysAla: 1.653 ± 0.515
0.0LysCys: 0.0 ± 0.0
2.479LysAsp: 2.479 ± 1.711
3.306LysGlu: 3.306 ± 1.479
3.306LysPhe: 3.306 ± 2.282
4.959LysGly: 4.959 ± 1.616
0.826LysHis: 0.826 ± 0.57
3.306LysIle: 3.306 ± 1.259
4.132LysLys: 4.132 ± 4.264
4.132LysLeu: 4.132 ± 0.446
1.653LysMet: 1.653 ± 1.141
0.826LysAsn: 0.826 ± 0.853
4.132LysPro: 4.132 ± 1.674
2.479LysGln: 2.479 ± 1.491
2.479LysArg: 2.479 ± 1.008
2.479LysSer: 2.479 ± 0.808
1.653LysThr: 1.653 ± 0.984
5.785LysVal: 5.785 ± 2.446
0.826LysTrp: 0.826 ± 0.57
0.0LysTyr: 0.0 ± 0.0
0.0LysXaa: 0.0 ± 0.0
Leu
9.091LeuAla: 9.091 ± 3.522
0.826LeuCys: 0.826 ± 0.57
5.785LeuAsp: 5.785 ± 0.48
4.132LeuGlu: 4.132 ± 2.852
3.306LeuPhe: 3.306 ± 2.907
6.612LeuGly: 6.612 ± 1.729
1.653LeuHis: 1.653 ± 1.454
3.306LeuIle: 3.306 ± 0.952
4.132LeuLys: 4.132 ± 0.98
7.438LeuLeu: 7.438 ± 1.338
4.132LeuMet: 4.132 ± 1.634
1.653LeuAsn: 1.653 ± 0.515
3.306LeuPro: 3.306 ± 1.3
2.479LeuGln: 2.479 ± 0.808
5.785LeuArg: 5.785 ± 1.678
7.438LeuSer: 7.438 ± 2.253
1.653LeuThr: 1.653 ± 0.984
4.959LeuVal: 4.959 ± 1.816
1.653LeuTrp: 1.653 ± 0.515
2.479LeuTyr: 2.479 ± 1.123
0.0LeuXaa: 0.0 ± 0.0
Met
0.0MetAla: 0.0 ± 0.0
0.826MetCys: 0.826 ± 0.57
0.826MetAsp: 0.826 ± 0.57
1.653MetGlu: 1.653 ± 0.739
1.653MetPhe: 1.653 ± 0.739
3.306MetGly: 3.306 ± 1.03
0.0MetHis: 0.0 ± 0.0
0.826MetIle: 0.826 ± 0.727
0.826MetLys: 0.826 ± 0.57
1.653MetLeu: 1.653 ± 0.739
0.0MetMet: 0.0 ± 0.0
1.653MetAsn: 1.653 ± 1.141
0.826MetPro: 0.826 ± 0.57
0.826MetGln: 0.826 ± 0.727
1.653MetArg: 1.653 ± 0.515
4.132MetSer: 4.132 ± 1.634
1.653MetThr: 1.653 ± 1.141
1.653MetVal: 1.653 ± 0.515
1.653MetTrp: 1.653 ± 0.739
1.653MetTyr: 1.653 ± 1.141
0.0MetXaa: 0.0 ± 0.0
Asn
4.132AsnAla: 4.132 ± 2.833
0.0AsnCys: 0.0 ± 0.0
3.306AsnAsp: 3.306 ± 1.462
1.653AsnGlu: 1.653 ± 1.454
0.0AsnPhe: 0.0 ± 0.0
3.306AsnGly: 3.306 ± 1.462
0.826AsnHis: 0.826 ± 0.853
0.826AsnIle: 0.826 ± 0.727
0.826AsnLys: 0.826 ± 0.57
3.306AsnLeu: 3.306 ± 0.282
2.479AsnMet: 2.479 ± 0.808
1.653AsnAsn: 1.653 ± 0.515
1.653AsnPro: 1.653 ± 0.515
0.826AsnGln: 0.826 ± 0.57
2.479AsnArg: 2.479 ± 2.558
4.132AsnSer: 4.132 ± 1.634
1.653AsnThr: 1.653 ± 0.515
3.306AsnVal: 3.306 ± 1.3
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.479ProAla: 2.479 ± 2.18
0.0ProCys: 0.0 ± 0.0
2.479ProAsp: 2.479 ± 0.808
0.826ProGlu: 0.826 ± 0.57
1.653ProPhe: 1.653 ± 1.454
2.479ProGly: 2.479 ± 1.123
0.0ProHis: 0.0 ± 0.0
2.479ProIle: 2.479 ± 0.808
1.653ProLys: 1.653 ± 0.739
2.479ProLeu: 2.479 ± 1.711
0.826ProMet: 0.826 ± 0.57
3.306ProAsn: 3.306 ± 1.03
0.826ProPro: 0.826 ± 0.57
0.826ProGln: 0.826 ± 0.57
5.785ProArg: 5.785 ± 1.905
3.306ProSer: 3.306 ± 0.952
4.132ProThr: 4.132 ± 0.446
4.959ProVal: 4.959 ± 2.218
2.479ProTrp: 2.479 ± 0.808
0.826ProTyr: 0.826 ± 0.57
0.0ProXaa: 0.0 ± 0.0
Gln
0.826GlnAla: 0.826 ± 0.57
1.653GlnCys: 1.653 ± 1.141
0.826GlnAsp: 0.826 ± 0.57
3.306GlnGlu: 3.306 ± 2.282
0.0GlnPhe: 0.0 ± 0.0
1.653GlnGly: 1.653 ± 0.515
1.653GlnHis: 1.653 ± 0.515
2.479GlnIle: 2.479 ± 1.123
0.826GlnLys: 0.826 ± 0.57
3.306GlnLeu: 3.306 ± 0.282
0.826GlnMet: 0.826 ± 0.853
0.0GlnAsn: 0.0 ± 0.0
2.479GlnPro: 2.479 ± 1.491
0.826GlnGln: 0.826 ± 0.853
2.479GlnArg: 2.479 ± 0.445
0.826GlnSer: 0.826 ± 0.727
0.826GlnThr: 0.826 ± 0.727
2.479GlnVal: 2.479 ± 0.445
0.0GlnTrp: 0.0 ± 0.0
2.479GlnTyr: 2.479 ± 1.711
0.0GlnXaa: 0.0 ± 0.0
Arg
3.306ArgAla: 3.306 ± 0.282
1.653ArgCys: 1.653 ± 0.739
4.959ArgAsp: 4.959 ± 1.866
3.306ArgGlu: 3.306 ± 0.282
4.132ArgPhe: 4.132 ± 1.839
3.306ArgGly: 3.306 ± 2.282
3.306ArgHis: 3.306 ± 0.282
3.306ArgIle: 3.306 ± 0.952
4.132ArgLys: 4.132 ± 1.977
8.264ArgLeu: 8.264 ± 1.364
0.826ArgMet: 0.826 ± 0.57
3.306ArgAsn: 3.306 ± 1.479
1.653ArgPro: 1.653 ± 1.141
1.653ArgGln: 1.653 ± 1.141
9.091ArgArg: 9.091 ± 6.109
5.785ArgSer: 5.785 ± 0.859
7.438ArgThr: 7.438 ± 2.258
4.132ArgVal: 4.132 ± 1.839
1.653ArgTrp: 1.653 ± 1.705
3.306ArgTyr: 3.306 ± 1.03
0.0ArgXaa: 0.0 ± 0.0
Ser
2.479SerAla: 2.479 ± 0.808
1.653SerCys: 1.653 ± 0.739
4.132SerAsp: 4.132 ± 2.104
4.132SerGlu: 4.132 ± 0.446
2.479SerPhe: 2.479 ± 0.808
6.612SerGly: 6.612 ± 1.855
1.653SerHis: 1.653 ± 0.515
3.306SerIle: 3.306 ± 0.952
5.785SerLys: 5.785 ± 0.48
6.612SerLeu: 6.612 ± 0.858
0.826SerMet: 0.826 ± 0.727
1.653SerAsn: 1.653 ± 1.454
4.959SerPro: 4.959 ± 0.89
3.306SerGln: 3.306 ± 1.259
6.612SerArg: 6.612 ± 0.728
9.917SerSer: 9.917 ± 4.862
6.612SerThr: 6.612 ± 3.782
5.785SerVal: 5.785 ± 3.067
0.826SerTrp: 0.826 ± 0.727
3.306SerTyr: 3.306 ± 2.907
0.0SerXaa: 0.0 ± 0.0
Thr
3.306ThrAla: 3.306 ± 2.149
0.826ThrCys: 0.826 ± 0.853
5.785ThrAsp: 5.785 ± 2.936
2.479ThrGlu: 2.479 ± 1.505
2.479ThrPhe: 2.479 ± 0.445
2.479ThrGly: 2.479 ± 0.445
2.479ThrHis: 2.479 ± 1.123
0.826ThrIle: 0.826 ± 0.727
4.132ThrLys: 4.132 ± 0.782
8.264ThrLeu: 8.264 ± 2.795
1.653ThrMet: 1.653 ± 1.141
1.653ThrAsn: 1.653 ± 0.984
1.653ThrPro: 1.653 ± 1.141
0.826ThrGln: 0.826 ± 0.727
1.653ThrArg: 1.653 ± 1.454
5.785ThrSer: 5.785 ± 0.855
8.264ThrThr: 8.264 ± 3.177
4.132ThrVal: 4.132 ± 1.634
0.826ThrTrp: 0.826 ± 0.727
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
4.959ValAla: 4.959 ± 1.816
0.0ValCys: 0.0 ± 0.0
4.132ValAsp: 4.132 ± 1.229
5.785ValGlu: 5.785 ± 1.905
2.479ValPhe: 2.479 ± 1.711
8.264ValGly: 8.264 ± 1.961
1.653ValHis: 1.653 ± 1.705
3.306ValIle: 3.306 ± 2.282
3.306ValLys: 3.306 ± 1.462
4.132ValLeu: 4.132 ± 3.152
2.479ValMet: 2.479 ± 1.008
4.132ValAsn: 4.132 ± 0.446
0.0ValPro: 0.0 ± 0.0
2.479ValGln: 2.479 ± 0.808
6.612ValArg: 6.612 ± 0.728
9.091ValSer: 9.091 ± 3.975
2.479ValThr: 2.479 ± 0.445
4.959ValVal: 4.959 ± 2.954
0.0ValTrp: 0.0 ± 0.0
1.653ValTyr: 1.653 ± 1.141
0.0ValXaa: 0.0 ± 0.0
Trp
1.653TrpAla: 1.653 ± 0.984
0.0TrpCys: 0.0 ± 0.0
0.826TrpAsp: 0.826 ± 0.57
0.0TrpGlu: 0.0 ± 0.0
1.653TrpPhe: 1.653 ± 0.739
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.826TrpIle: 0.826 ± 0.57
0.826TrpLys: 0.826 ± 0.853
3.306TrpLeu: 3.306 ± 0.952
0.826TrpMet: 0.826 ± 0.727
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
2.479TrpArg: 2.479 ± 1.123
1.653TrpSer: 1.653 ± 1.141
1.653TrpThr: 1.653 ± 0.515
0.826TrpVal: 0.826 ± 0.727
0.0TrpTrp: 0.0 ± 0.0
0.826TrpTyr: 0.826 ± 0.853
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.653TyrAla: 1.653 ± 0.515
0.0TyrCys: 0.0 ± 0.0
4.959TyrAsp: 4.959 ± 1.173
1.653TyrGlu: 1.653 ± 1.454
2.479TyrPhe: 2.479 ± 0.808
1.653TyrGly: 1.653 ± 1.141
2.479TyrHis: 2.479 ± 0.808
1.653TyrIle: 1.653 ± 0.515
1.653TyrLys: 1.653 ± 1.141
1.653TyrLeu: 1.653 ± 0.984
0.0TyrMet: 0.0 ± 0.0
0.0TyrAsn: 0.0 ± 0.0
0.0TyrPro: 0.0 ± 0.0
0.826TyrGln: 0.826 ± 0.853
4.959TyrArg: 4.959 ± 2.392
1.653TyrSer: 1.653 ± 0.515
1.653TyrThr: 1.653 ± 0.515
0.826TyrVal: 0.826 ± 0.57
0.0TyrTrp: 0.0 ± 0.0
1.653TyrTyr: 1.653 ± 1.141
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1211 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski