Amino acid dipepetide frequency for Wenzhou picorna-like virus 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.1AlaAla: 3.1 ± 0.912
0.886AlaCys: 0.886 ± 0.465
2.657AlaAsp: 2.657 ± 0.51
3.543AlaGlu: 3.543 ± 0.046
3.1AlaPhe: 3.1 ± 1.626
5.314AlaGly: 5.314 ± 0.249
0.443AlaHis: 0.443 ± 0.402
3.543AlaIle: 3.543 ± 0.589
3.986AlaLys: 3.986 ± 0.187
6.2AlaLeu: 6.2 ± 1.983
0.886AlaMet: 0.886 ± 0.465
4.429AlaAsn: 4.429 ± 1.485
4.872AlaPro: 4.872 ± 2.522
2.214AlaGln: 2.214 ± 0.742
1.771AlaArg: 1.771 ± 0.34
2.657AlaSer: 2.657 ± 1.779
3.543AlaThr: 3.543 ± 0.68
3.986AlaVal: 3.986 ± 0.187
0.886AlaTrp: 0.886 ± 0.17
1.329AlaTyr: 1.329 ± 0.062
0.0AlaXaa: 0.0 ± 0.0
Cys
0.886CysAla: 0.886 ± 0.17
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.443CysGlu: 0.443 ± 0.232
0.0CysPhe: 0.0 ± 0.0
0.886CysGly: 0.886 ± 0.17
1.329CysHis: 1.329 ± 0.697
2.214CysIle: 2.214 ± 1.161
0.886CysLys: 0.886 ± 0.465
1.329CysLeu: 1.329 ± 0.062
0.0CysMet: 0.0 ± 0.0
0.443CysAsn: 0.443 ± 0.232
0.886CysPro: 0.886 ± 0.805
1.329CysGln: 1.329 ± 0.062
0.886CysArg: 0.886 ± 0.465
1.329CysSer: 1.329 ± 0.697
0.886CysThr: 0.886 ± 0.17
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.443CysTyr: 0.443 ± 0.402
0.0CysXaa: 0.0 ± 0.0
Asp
3.1AspAla: 3.1 ± 0.357
0.886AspCys: 0.886 ± 0.465
3.986AspAsp: 3.986 ± 1.456
4.429AspGlu: 4.429 ± 0.216
5.314AspPhe: 5.314 ± 0.884
3.1AspGly: 3.1 ± 0.278
1.329AspHis: 1.329 ± 0.572
3.543AspIle: 3.543 ± 0.68
2.214AspLys: 2.214 ± 0.527
4.872AspLeu: 4.872 ± 1.286
1.771AspMet: 1.771 ± 0.929
0.886AspAsn: 0.886 ± 0.465
2.214AspPro: 2.214 ± 0.527
0.886AspGln: 0.886 ± 0.17
1.771AspArg: 1.771 ± 0.295
2.657AspSer: 2.657 ± 0.124
4.872AspThr: 4.872 ± 0.017
3.1AspVal: 3.1 ± 1.626
1.771AspTrp: 1.771 ± 0.929
2.657AspTyr: 2.657 ± 0.759
0.0AspXaa: 0.0 ± 0.0
Glu
2.657GluAla: 2.657 ± 0.759
1.329GluCys: 1.329 ± 0.697
3.1GluAsp: 3.1 ± 0.357
3.1GluGlu: 3.1 ± 0.357
5.314GluPhe: 5.314 ± 1.518
3.543GluGly: 3.543 ± 0.046
1.329GluHis: 1.329 ± 0.572
3.986GluIle: 3.986 ± 1.082
1.771GluLys: 1.771 ± 0.295
5.757GluLeu: 5.757 ± 0.481
2.214GluMet: 2.214 ± 0.108
2.214GluAsn: 2.214 ± 1.161
2.214GluPro: 2.214 ± 0.527
1.771GluGln: 1.771 ± 0.34
2.214GluArg: 2.214 ± 0.527
3.1GluSer: 3.1 ± 0.912
1.771GluThr: 1.771 ± 0.295
2.657GluVal: 2.657 ± 0.759
0.443GluTrp: 0.443 ± 0.232
1.329GluTyr: 1.329 ± 0.062
0.0GluXaa: 0.0 ± 0.0
Phe
1.329PheAla: 1.329 ± 0.062
0.886PheCys: 0.886 ± 0.465
3.1PheAsp: 3.1 ± 0.278
3.543PheGlu: 3.543 ± 0.046
2.657PhePhe: 2.657 ± 0.124
2.214PheGly: 2.214 ± 0.108
0.886PheHis: 0.886 ± 0.465
1.771PheIle: 1.771 ± 0.34
3.543PheLys: 3.543 ± 1.224
3.986PheLeu: 3.986 ± 0.821
0.886PheMet: 0.886 ± 0.465
5.314PheAsn: 5.314 ± 0.249
1.771PhePro: 1.771 ± 0.34
0.886PheGln: 0.886 ± 0.17
3.986PheArg: 3.986 ± 0.821
8.415PheSer: 8.415 ± 1.875
2.657PheThr: 2.657 ± 0.759
5.757PheVal: 5.757 ± 1.75
0.0PheTrp: 0.0 ± 0.0
3.1PheTyr: 3.1 ± 0.278
0.0PheXaa: 0.0 ± 0.0
Gly
2.657GlyAla: 2.657 ± 0.51
0.886GlyCys: 0.886 ± 0.805
4.872GlyAsp: 4.872 ± 0.651
0.886GlyGlu: 0.886 ± 0.465
3.986GlyPhe: 3.986 ± 0.187
3.1GlyGly: 3.1 ± 2.816
0.443GlyHis: 0.443 ± 0.402
3.1GlyIle: 3.1 ± 0.912
6.2GlyLys: 6.2 ± 2.617
4.872GlyLeu: 4.872 ± 0.651
1.771GlyMet: 1.771 ± 0.929
3.543GlyAsn: 3.543 ± 1.949
1.771GlyPro: 1.771 ± 0.34
1.329GlyGln: 1.329 ± 0.062
1.329GlyArg: 1.329 ± 1.207
4.872GlySer: 4.872 ± 1.252
4.872GlyThr: 4.872 ± 1.887
3.986GlyVal: 3.986 ± 0.187
0.886GlyTrp: 0.886 ± 0.17
2.214GlyTyr: 2.214 ± 0.742
0.0GlyXaa: 0.0 ± 0.0
His
1.329HisAla: 1.329 ± 0.062
0.0HisCys: 0.0 ± 0.0
0.886HisAsp: 0.886 ± 0.465
0.443HisGlu: 0.443 ± 0.232
0.886HisPhe: 0.886 ± 0.465
1.771HisGly: 1.771 ± 0.975
0.886HisHis: 0.886 ± 0.17
1.329HisIle: 1.329 ± 0.062
0.886HisLys: 0.886 ± 0.465
0.443HisLeu: 0.443 ± 0.232
0.0HisMet: 0.0 ± 0.0
0.443HisAsn: 0.443 ± 0.402
0.886HisPro: 0.886 ± 0.465
0.0HisGln: 0.0 ± 0.0
2.657HisArg: 2.657 ± 0.759
2.214HisSer: 2.214 ± 0.527
2.214HisThr: 2.214 ± 0.108
1.771HisVal: 1.771 ± 0.34
0.0HisTrp: 0.0 ± 0.0
1.771HisTyr: 1.771 ± 0.295
0.0HisXaa: 0.0 ± 0.0
Ile
5.314IleAla: 5.314 ± 0.884
0.443IleCys: 0.443 ± 0.232
3.543IleAsp: 3.543 ± 1.224
3.986IleGlu: 3.986 ± 1.082
4.429IlePhe: 4.429 ± 0.419
4.429IleGly: 4.429 ± 0.85
1.771IleHis: 1.771 ± 0.295
6.2IleIle: 6.2 ± 0.714
2.657IleLys: 2.657 ± 1.394
4.872IleLeu: 4.872 ± 0.651
1.329IleMet: 1.329 ± 0.572
5.314IleAsn: 5.314 ± 1.655
1.771IlePro: 1.771 ± 0.975
1.771IleGln: 1.771 ± 0.295
2.657IleArg: 2.657 ± 1.145
8.415IleSer: 8.415 ± 0.029
3.1IleThr: 3.1 ± 0.278
5.757IleVal: 5.757 ± 0.481
0.0IleTrp: 0.0 ± 0.0
0.443IleTyr: 0.443 ± 0.232
0.0IleXaa: 0.0 ± 0.0
Lys
1.329LysAla: 1.329 ± 0.697
0.0LysCys: 0.0 ± 0.0
6.643LysAsp: 6.643 ± 2.215
4.429LysGlu: 4.429 ± 1.688
3.1LysPhe: 3.1 ± 0.912
2.657LysGly: 2.657 ± 0.124
1.771LysHis: 1.771 ± 0.929
5.314LysIle: 5.314 ± 2.153
3.986LysLys: 3.986 ± 1.456
3.986LysLeu: 3.986 ± 0.187
2.657LysMet: 2.657 ± 1.394
4.429LysAsn: 4.429 ± 2.323
0.443LysPro: 0.443 ± 0.402
1.329LysGln: 1.329 ± 0.572
1.771LysArg: 1.771 ± 0.929
4.429LysSer: 4.429 ± 1.688
2.657LysThr: 2.657 ± 0.759
3.1LysVal: 3.1 ± 0.357
0.443LysTrp: 0.443 ± 0.232
4.872LysTyr: 4.872 ± 1.92
0.0LysXaa: 0.0 ± 0.0
Leu
4.429LeuAla: 4.429 ± 1.485
1.329LeuCys: 1.329 ± 0.697
3.543LeuAsp: 3.543 ± 0.589
3.1LeuGlu: 3.1 ± 0.278
6.2LeuPhe: 6.2 ± 1.348
4.872LeuGly: 4.872 ± 0.618
2.214LeuHis: 2.214 ± 0.527
3.986LeuIle: 3.986 ± 0.448
5.757LeuLys: 5.757 ± 1.116
3.543LeuLeu: 3.543 ± 0.046
2.657LeuMet: 2.657 ± 0.51
2.657LeuAsn: 2.657 ± 0.51
2.214LeuPro: 2.214 ± 0.108
3.543LeuGln: 3.543 ± 1.224
3.543LeuArg: 3.543 ± 0.589
5.314LeuSer: 5.314 ± 0.249
8.857LeuThr: 8.857 ± 1.066
6.2LeuVal: 6.2 ± 0.556
0.443LeuTrp: 0.443 ± 0.232
3.1LeuTyr: 3.1 ± 0.912
0.0LeuXaa: 0.0 ± 0.0
Met
1.771MetAla: 1.771 ± 0.34
0.0MetCys: 0.0 ± 0.0
0.886MetAsp: 0.886 ± 0.17
1.771MetGlu: 1.771 ± 0.929
0.443MetPhe: 0.443 ± 0.402
0.443MetGly: 0.443 ± 0.402
0.443MetHis: 0.443 ± 0.232
1.771MetIle: 1.771 ± 0.34
2.657MetLys: 2.657 ± 0.124
2.214MetLeu: 2.214 ± 1.161
1.329MetMet: 1.329 ± 0.572
3.1MetAsn: 3.1 ± 0.991
0.886MetPro: 0.886 ± 0.465
1.329MetGln: 1.329 ± 0.062
1.329MetArg: 1.329 ± 0.572
4.429MetSer: 4.429 ± 0.85
3.543MetThr: 3.543 ± 0.589
1.771MetVal: 1.771 ± 0.295
0.443MetTrp: 0.443 ± 0.232
1.329MetTyr: 1.329 ± 1.207
0.0MetXaa: 0.0 ± 0.0
Asn
3.1AsnAla: 3.1 ± 0.912
0.443AsnCys: 0.443 ± 0.232
2.214AsnAsp: 2.214 ± 0.527
1.771AsnGlu: 1.771 ± 0.295
3.543AsnPhe: 3.543 ± 0.046
2.657AsnGly: 2.657 ± 0.124
0.0AsnHis: 0.0 ± 0.0
4.872AsnIle: 4.872 ± 0.651
2.657AsnLys: 2.657 ± 0.759
3.543AsnLeu: 3.543 ± 1.315
2.657AsnMet: 2.657 ± 0.759
2.214AsnAsn: 2.214 ± 0.108
2.657AsnPro: 2.657 ± 1.145
2.657AsnGln: 2.657 ± 0.759
2.657AsnArg: 2.657 ± 1.145
4.872AsnSer: 4.872 ± 0.017
5.757AsnThr: 5.757 ± 2.692
3.986AsnVal: 3.986 ± 0.448
0.443AsnTrp: 0.443 ± 0.402
1.771AsnTyr: 1.771 ± 0.929
0.0AsnXaa: 0.0 ± 0.0
Pro
1.329ProAla: 1.329 ± 0.062
0.443ProCys: 0.443 ± 0.232
0.886ProAsp: 0.886 ± 0.17
0.886ProGlu: 0.886 ± 0.465
0.886ProPhe: 0.886 ± 0.17
0.886ProGly: 0.886 ± 0.17
0.443ProHis: 0.443 ± 0.402
3.543ProIle: 3.543 ± 0.68
3.1ProLys: 3.1 ± 0.991
5.314ProLeu: 5.314 ± 1.655
0.886ProMet: 0.886 ± 0.805
3.986ProAsn: 3.986 ± 2.986
3.986ProPro: 3.986 ± 1.717
0.886ProGln: 0.886 ± 0.805
0.886ProArg: 0.886 ± 0.465
1.771ProSer: 1.771 ± 0.34
4.429ProThr: 4.429 ± 0.419
4.429ProVal: 4.429 ± 1.485
0.886ProTrp: 0.886 ± 0.17
3.543ProTyr: 3.543 ± 1.949
0.0ProXaa: 0.0 ± 0.0
Gln
3.1GlnAla: 3.1 ± 0.278
0.443GlnCys: 0.443 ± 0.402
3.1GlnAsp: 3.1 ± 0.278
3.543GlnGlu: 3.543 ± 0.046
0.443GlnPhe: 0.443 ± 0.232
2.214GlnGly: 2.214 ± 1.161
0.886GlnHis: 0.886 ± 0.465
3.1GlnIle: 3.1 ± 0.357
0.886GlnLys: 0.886 ± 0.465
3.1GlnLeu: 3.1 ± 0.278
0.0GlnMet: 0.0 ± 0.0
1.329GlnAsn: 1.329 ± 1.207
1.329GlnPro: 1.329 ± 1.207
1.771GlnGln: 1.771 ± 0.34
1.329GlnArg: 1.329 ± 0.697
2.214GlnSer: 2.214 ± 0.742
0.886GlnThr: 0.886 ± 0.805
0.443GlnVal: 0.443 ± 0.232
0.886GlnTrp: 0.886 ± 0.465
1.329GlnTyr: 1.329 ± 0.572
0.0GlnXaa: 0.0 ± 0.0
Arg
1.329ArgAla: 1.329 ± 0.572
0.443ArgCys: 0.443 ± 0.232
0.886ArgAsp: 0.886 ± 0.17
3.986ArgGlu: 3.986 ± 0.187
2.657ArgPhe: 2.657 ± 1.145
1.329ArgGly: 1.329 ± 0.062
0.443ArgHis: 0.443 ± 0.232
3.543ArgIle: 3.543 ± 0.589
2.657ArgLys: 2.657 ± 1.394
2.214ArgLeu: 2.214 ± 0.108
2.657ArgMet: 2.657 ± 0.51
2.214ArgAsn: 2.214 ± 0.108
1.329ArgPro: 1.329 ± 0.062
0.886ArgGln: 0.886 ± 0.17
1.771ArgArg: 1.771 ± 0.295
3.986ArgSer: 3.986 ± 0.448
1.771ArgThr: 1.771 ± 0.295
5.757ArgVal: 5.757 ± 0.788
0.443ArgTrp: 0.443 ± 0.232
2.214ArgTyr: 2.214 ± 0.108
0.0ArgXaa: 0.0 ± 0.0
Ser
4.429SerAla: 4.429 ± 0.216
1.329SerCys: 1.329 ± 0.572
5.314SerAsp: 5.314 ± 0.884
3.986SerGlu: 3.986 ± 0.187
3.1SerPhe: 3.1 ± 0.357
6.2SerGly: 6.2 ± 0.714
1.329SerHis: 1.329 ± 0.572
7.086SerIle: 7.086 ± 0.091
3.986SerLys: 3.986 ± 0.187
5.314SerLeu: 5.314 ± 1.655
3.1SerMet: 3.1 ± 0.912
1.771SerAsn: 1.771 ± 0.295
5.757SerPro: 5.757 ± 2.057
3.986SerGln: 3.986 ± 0.187
4.872SerArg: 4.872 ± 0.618
4.429SerSer: 4.429 ± 0.216
5.757SerThr: 5.757 ± 0.788
5.757SerVal: 5.757 ± 0.788
0.886SerTrp: 0.886 ± 0.465
1.329SerTyr: 1.329 ± 0.572
0.0SerXaa: 0.0 ± 0.0
Thr
6.2ThrAla: 6.2 ± 0.556
0.443ThrCys: 0.443 ± 0.232
4.872ThrAsp: 4.872 ± 0.651
3.986ThrGlu: 3.986 ± 0.821
2.214ThrPhe: 2.214 ± 1.161
3.986ThrGly: 3.986 ± 1.717
0.886ThrHis: 0.886 ± 0.465
5.314ThrIle: 5.314 ± 1.02
2.657ThrLys: 2.657 ± 0.759
6.2ThrLeu: 6.2 ± 0.079
2.214ThrMet: 2.214 ± 1.377
3.543ThrAsn: 3.543 ± 0.046
3.543ThrPro: 3.543 ± 1.315
1.771ThrGln: 1.771 ± 1.609
2.214ThrArg: 2.214 ± 0.742
4.429ThrSer: 4.429 ± 2.119
7.086ThrThr: 7.086 ± 0.726
5.757ThrVal: 5.757 ± 0.788
0.886ThrTrp: 0.886 ± 0.17
2.657ThrTyr: 2.657 ± 0.124
0.0ThrXaa: 0.0 ± 0.0
Val
7.972ValAla: 7.972 ± 0.896
2.657ValCys: 2.657 ± 0.124
2.657ValAsp: 2.657 ± 0.124
0.886ValGlu: 0.886 ± 0.17
4.429ValPhe: 4.429 ± 1.054
4.872ValGly: 4.872 ± 0.618
2.214ValHis: 2.214 ± 0.108
1.329ValIle: 1.329 ± 0.062
4.429ValLys: 4.429 ± 1.054
4.872ValLeu: 4.872 ± 0.651
3.543ValMet: 3.543 ± 0.589
3.986ValAsn: 3.986 ± 0.821
4.429ValPro: 4.429 ± 0.216
2.214ValGln: 2.214 ± 0.108
2.214ValArg: 2.214 ± 0.742
7.086ValSer: 7.086 ± 0.726
3.543ValThr: 3.543 ± 0.68
5.757ValVal: 5.757 ± 0.153
0.0ValTrp: 0.0 ± 0.0
3.543ValTyr: 3.543 ± 0.046
0.0ValXaa: 0.0 ± 0.0
Trp
0.443TrpAla: 0.443 ± 0.232
0.0TrpCys: 0.0 ± 0.0
0.886TrpAsp: 0.886 ± 0.465
0.886TrpGlu: 0.886 ± 0.465
0.443TrpPhe: 0.443 ± 0.232
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.886TrpIle: 0.886 ± 0.465
0.886TrpLys: 0.886 ± 0.465
0.443TrpLeu: 0.443 ± 0.232
0.0TrpMet: 0.0 ± 0.0
1.329TrpAsn: 1.329 ± 0.062
0.0TrpPro: 0.0 ± 0.0
0.443TrpGln: 0.443 ± 0.232
0.443TrpArg: 0.443 ± 0.232
0.443TrpSer: 0.443 ± 0.402
0.443TrpThr: 0.443 ± 0.402
0.886TrpVal: 0.886 ± 0.17
0.0TrpTrp: 0.0 ± 0.0
0.886TrpTyr: 0.886 ± 0.17
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.543TyrAla: 3.543 ± 0.046
1.329TyrCys: 1.329 ± 0.062
1.329TyrAsp: 1.329 ± 0.697
2.214TyrGlu: 2.214 ± 0.527
3.543TyrPhe: 3.543 ± 0.589
3.1TyrGly: 3.1 ± 0.912
1.771TyrHis: 1.771 ± 0.929
1.771TyrIle: 1.771 ± 0.975
3.543TyrLys: 3.543 ± 1.224
4.429TyrLeu: 4.429 ± 3.388
0.886TyrMet: 0.886 ± 0.147
1.329TyrAsn: 1.329 ± 0.697
0.443TyrPro: 0.443 ± 0.232
1.329TyrGln: 1.329 ± 0.062
2.214TyrArg: 2.214 ± 0.108
2.657TyrSer: 2.657 ± 1.145
2.214TyrThr: 2.214 ± 0.742
1.771TyrVal: 1.771 ± 0.34
0.0TyrTrp: 0.0 ± 0.0
2.214TyrTyr: 2.214 ± 1.377
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2259 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski