Amino acid dipepetide frequency for Wenzhou qinvirus-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.745AlaAla: 15.745 ± 7.007
2.128AlaCys: 2.128 ± 1.064
8.936AlaAsp: 8.936 ± 1.28
6.809AlaGlu: 6.809 ± 1.279
3.404AlaPhe: 3.404 ± 2.549
7.234AlaGly: 7.234 ± 0.634
1.277AlaHis: 1.277 ± 0.424
5.532AlaIle: 5.532 ± 1.704
5.532AlaLys: 5.532 ± 1.704
9.787AlaLeu: 9.787 ± 3.609
5.532AlaMet: 5.532 ± 1.39
5.106AlaAsn: 5.106 ± 1.698
6.809AlaPro: 6.809 ± 2.973
3.404AlaGln: 3.404 ± 1.486
11.064AlaArg: 11.064 ± 1.281
8.936AlaSer: 8.936 ± 1.28
10.213AlaThr: 10.213 ± 2.333
10.638AlaVal: 10.638 ± 0.006
2.553AlaTrp: 2.553 ± 1.277
2.553AlaTyr: 2.553 ± 2.975
0.0AlaXaa: 0.0 ± 0.0
Cys
0.426CysAla: 0.426 ± 0.213
0.0CysCys: 0.0 ± 0.0
0.851CysAsp: 0.851 ± 0.637
1.702CysGlu: 1.702 ± 0.212
0.426CysPhe: 0.426 ± 0.213
0.851CysGly: 0.851 ± 0.426
0.426CysHis: 0.426 ± 0.213
0.851CysIle: 0.851 ± 0.426
0.851CysLys: 0.851 ± 0.426
2.979CysLeu: 2.979 ± 1.49
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.426CysGln: 0.426 ± 0.213
3.83CysArg: 3.83 ± 0.211
0.851CysSer: 0.851 ± 0.426
0.0CysThr: 0.0 ± 0.0
2.128CysVal: 2.128 ± 0.001
0.0CysTrp: 0.0 ± 0.0
0.426CysTyr: 0.426 ± 0.85
0.0CysXaa: 0.0 ± 0.0
Asp
5.532AspAla: 5.532 ± 0.641
0.851AspCys: 0.851 ± 0.426
4.681AspAsp: 4.681 ± 0.848
2.979AspGlu: 2.979 ± 1.699
1.702AspPhe: 1.702 ± 0.851
3.404AspGly: 3.404 ± 1.702
0.426AspHis: 0.426 ± 0.213
6.809AspIle: 6.809 ± 0.216
2.553AspLys: 2.553 ± 0.214
6.809AspLeu: 6.809 ± 2.973
0.851AspMet: 0.851 ± 0.426
0.851AspAsn: 0.851 ± 0.426
5.957AspPro: 5.957 ± 0.209
0.426AspGln: 0.426 ± 0.213
4.681AspArg: 4.681 ± 0.215
2.979AspSer: 2.979 ± 0.427
4.681AspThr: 4.681 ± 0.215
5.957AspVal: 5.957 ± 2.979
1.277AspTrp: 1.277 ± 0.424
1.277AspTyr: 1.277 ± 0.638
0.0AspXaa: 0.0 ± 0.0
Glu
2.979GluAla: 2.979 ± 0.636
1.277GluCys: 1.277 ± 0.638
4.255GluAsp: 4.255 ± 1.061
4.681GluGlu: 4.681 ± 1.911
2.128GluPhe: 2.128 ± 1.064
2.979GluGly: 2.979 ± 0.636
2.128GluHis: 2.128 ± 1.064
2.553GluIle: 2.553 ± 0.214
0.851GluLys: 0.851 ± 0.637
4.681GluLeu: 4.681 ± 0.848
1.277GluMet: 1.277 ± 0.638
1.702GluAsn: 1.702 ± 0.851
1.277GluPro: 1.277 ± 0.638
0.426GluGln: 0.426 ± 0.213
2.979GluArg: 2.979 ± 0.427
4.255GluSer: 4.255 ± 2.128
3.404GluThr: 3.404 ± 0.423
5.532GluVal: 5.532 ± 3.611
1.702GluTrp: 1.702 ± 0.851
2.553GluTyr: 2.553 ± 0.214
0.0GluXaa: 0.0 ± 0.0
Phe
5.106PheAla: 5.106 ± 0.635
0.851PheCys: 0.851 ± 0.426
2.553PheAsp: 2.553 ± 0.849
2.128PheGlu: 2.128 ± 0.001
1.702PhePhe: 1.702 ± 0.851
3.404PheGly: 3.404 ± 1.702
1.702PheHis: 1.702 ± 0.851
1.277PheIle: 1.277 ± 0.638
1.702PheLys: 1.702 ± 0.851
2.979PheLeu: 2.979 ± 0.427
0.426PheMet: 0.426 ± 0.213
0.426PheAsn: 0.426 ± 0.213
0.851PhePro: 0.851 ± 0.637
0.426PheGln: 0.426 ± 0.213
1.277PheArg: 1.277 ± 0.638
1.702PheSer: 1.702 ± 1.275
2.553PheThr: 2.553 ± 1.277
2.553PheVal: 2.553 ± 0.214
1.702PheTrp: 1.702 ± 0.212
0.426PheTyr: 0.426 ± 0.213
0.0PheXaa: 0.0 ± 0.0
Gly
6.809GlyAla: 6.809 ± 1.279
0.426GlyCys: 0.426 ± 0.213
2.128GlyAsp: 2.128 ± 1.064
1.702GlyGlu: 1.702 ± 0.212
4.255GlyPhe: 4.255 ± 2.128
2.553GlyGly: 2.553 ± 1.912
2.128GlyHis: 2.128 ± 0.001
0.851GlyIle: 0.851 ± 0.426
1.702GlyLys: 1.702 ± 1.275
4.681GlyLeu: 4.681 ± 0.215
1.702GlyMet: 1.702 ± 0.212
1.277GlyAsn: 1.277 ± 0.424
1.702GlyPro: 1.702 ± 0.212
2.979GlyGln: 2.979 ± 0.636
2.128GlyArg: 2.128 ± 1.062
2.553GlySer: 2.553 ± 1.912
6.383GlyThr: 6.383 ± 1.066
4.681GlyVal: 4.681 ± 0.848
1.277GlyTrp: 1.277 ± 0.638
2.979GlyTyr: 2.979 ± 1.49
0.0GlyXaa: 0.0 ± 0.0
His
1.702HisAla: 1.702 ± 0.851
0.426HisCys: 0.426 ± 0.213
2.553HisAsp: 2.553 ± 1.277
0.0HisGlu: 0.0 ± 0.0
2.128HisPhe: 2.128 ± 1.064
1.277HisGly: 1.277 ± 0.638
1.277HisHis: 1.277 ± 0.424
1.277HisIle: 1.277 ± 0.424
1.277HisLys: 1.277 ± 0.638
2.553HisLeu: 2.553 ± 1.277
2.128HisMet: 2.128 ± 0.001
0.0HisAsn: 0.0 ± 0.0
1.277HisPro: 1.277 ± 0.638
0.426HisGln: 0.426 ± 0.213
1.702HisArg: 1.702 ± 0.212
1.277HisSer: 1.277 ± 0.424
2.128HisThr: 2.128 ± 1.062
1.702HisVal: 1.702 ± 0.212
0.851HisTrp: 0.851 ± 0.637
0.426HisTyr: 0.426 ± 0.213
0.0HisXaa: 0.0 ± 0.0
Ile
9.362IleAla: 9.362 ± 3.619
0.0IleCys: 0.0 ± 0.0
2.553IleAsp: 2.553 ± 1.277
2.128IleGlu: 2.128 ± 1.064
2.553IlePhe: 2.553 ± 1.277
2.553IleGly: 2.553 ± 0.214
0.851IleHis: 0.851 ± 0.637
1.277IleIle: 1.277 ± 0.638
0.851IleLys: 0.851 ± 0.426
2.128IleLeu: 2.128 ± 0.001
0.426IleMet: 0.426 ± 0.213
0.851IleAsn: 0.851 ± 0.637
1.702IlePro: 1.702 ± 0.212
0.0IleGln: 0.0 ± 0.0
5.532IleArg: 5.532 ± 1.704
4.681IleSer: 4.681 ± 1.911
2.553IleThr: 2.553 ± 1.277
2.979IleVal: 2.979 ± 0.636
0.851IleTrp: 0.851 ± 0.426
0.851IleTyr: 0.851 ± 0.426
0.0IleXaa: 0.0 ± 0.0
Lys
3.404LysAla: 3.404 ± 1.486
1.277LysCys: 1.277 ± 0.638
3.404LysAsp: 3.404 ± 0.423
2.553LysGlu: 2.553 ± 0.849
1.277LysPhe: 1.277 ± 0.424
0.426LysGly: 0.426 ± 0.213
0.0LysHis: 0.0 ± 0.0
2.979LysIle: 2.979 ± 0.427
0.0LysLys: 0.0 ± 0.0
2.128LysLeu: 2.128 ± 1.064
1.277LysMet: 1.277 ± 0.638
0.851LysAsn: 0.851 ± 0.426
1.702LysPro: 1.702 ± 0.851
0.426LysGln: 0.426 ± 0.85
3.404LysArg: 3.404 ± 0.64
2.979LysSer: 2.979 ± 1.699
1.277LysThr: 1.277 ± 0.638
3.404LysVal: 3.404 ± 0.423
0.426LysTrp: 0.426 ± 0.85
0.851LysTyr: 0.851 ± 0.426
0.0LysXaa: 0.0 ± 0.0
Leu
13.617LeuAla: 13.617 ± 4.882
0.851LeuCys: 0.851 ± 0.637
4.681LeuAsp: 4.681 ± 0.215
4.255LeuGlu: 4.255 ± 1.065
3.83LeuPhe: 3.83 ± 2.336
4.681LeuGly: 4.681 ± 1.278
1.277LeuHis: 1.277 ± 0.638
2.128LeuIle: 2.128 ± 0.001
4.255LeuLys: 4.255 ± 3.187
5.106LeuLeu: 5.106 ± 0.635
3.404LeuMet: 3.404 ± 0.423
2.128LeuAsn: 2.128 ± 0.001
4.681LeuPro: 4.681 ± 2.341
1.702LeuGln: 1.702 ± 0.851
4.255LeuArg: 4.255 ± 1.061
6.809LeuSer: 6.809 ± 1.279
2.979LeuThr: 2.979 ± 0.636
6.809LeuVal: 6.809 ± 1.279
1.702LeuTrp: 1.702 ± 1.275
2.979LeuTyr: 2.979 ± 1.49
0.0LeuXaa: 0.0 ± 0.0
Met
5.957MetAla: 5.957 ± 2.335
1.277MetCys: 1.277 ± 0.638
3.404MetAsp: 3.404 ± 0.64
3.404MetGlu: 3.404 ± 0.64
0.426MetPhe: 0.426 ± 0.213
2.979MetGly: 2.979 ± 0.636
0.426MetHis: 0.426 ± 0.213
0.426MetIle: 0.426 ± 0.213
0.426MetLys: 0.426 ± 0.213
2.128MetLeu: 2.128 ± 0.001
1.277MetMet: 1.277 ± 1.487
0.0MetAsn: 0.0 ± 0.0
1.277MetPro: 1.277 ± 0.424
0.426MetGln: 0.426 ± 0.213
2.553MetArg: 2.553 ± 0.849
1.702MetSer: 1.702 ± 0.851
1.277MetThr: 1.277 ± 0.424
3.404MetVal: 3.404 ± 1.702
0.851MetTrp: 0.851 ± 0.426
0.851MetTyr: 0.851 ± 0.426
0.0MetXaa: 0.0 ± 0.0
Asn
2.979AsnAla: 2.979 ± 0.427
0.426AsnCys: 0.426 ± 0.213
2.553AsnAsp: 2.553 ± 1.277
1.277AsnGlu: 1.277 ± 0.424
0.851AsnPhe: 0.851 ± 0.637
1.702AsnGly: 1.702 ± 0.212
0.851AsnHis: 0.851 ± 0.637
1.277AsnIle: 1.277 ± 0.638
0.0AsnLys: 0.0 ± 0.0
1.702AsnLeu: 1.702 ± 0.851
2.553AsnMet: 2.553 ± 1.277
1.702AsnAsn: 1.702 ± 0.212
0.851AsnPro: 0.851 ± 0.637
0.0AsnGln: 0.0 ± 0.0
2.553AsnArg: 2.553 ± 0.214
0.0AsnSer: 0.0 ± 0.0
2.553AsnThr: 2.553 ± 0.214
2.128AsnVal: 2.128 ± 3.188
1.277AsnTrp: 1.277 ± 0.638
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
7.66ProAla: 7.66 ± 2.547
0.426ProCys: 0.426 ± 0.213
1.702ProAsp: 1.702 ± 0.851
0.851ProGlu: 0.851 ± 0.637
0.851ProPhe: 0.851 ± 0.426
3.83ProGly: 3.83 ± 1.273
2.128ProHis: 2.128 ± 1.064
2.128ProIle: 2.128 ± 0.001
0.426ProLys: 0.426 ± 0.213
3.404ProLeu: 3.404 ± 0.423
2.553ProMet: 2.553 ± 0.659
1.702ProAsn: 1.702 ± 0.212
1.702ProPro: 1.702 ± 0.212
0.851ProGln: 0.851 ± 0.426
5.106ProArg: 5.106 ± 0.428
3.83ProSer: 3.83 ± 0.852
2.128ProThr: 2.128 ± 1.062
1.702ProVal: 1.702 ± 0.212
0.851ProTrp: 0.851 ± 0.426
0.426ProTyr: 0.426 ± 0.213
0.0ProXaa: 0.0 ± 0.0
Gln
4.681GlnAla: 4.681 ± 0.215
0.0GlnCys: 0.0 ± 0.0
1.702GlnAsp: 1.702 ± 0.851
0.851GlnGlu: 0.851 ± 0.426
0.426GlnPhe: 0.426 ± 0.213
1.702GlnGly: 1.702 ± 0.212
0.0GlnHis: 0.0 ± 0.0
0.851GlnIle: 0.851 ± 0.426
0.0GlnLys: 0.0 ± 0.0
2.979GlnLeu: 2.979 ± 1.699
0.851GlnMet: 0.851 ± 0.637
0.426GlnAsn: 0.426 ± 0.213
0.426GlnPro: 0.426 ± 0.213
0.0GlnGln: 0.0 ± 0.0
0.851GlnArg: 0.851 ± 0.426
1.277GlnSer: 1.277 ± 1.487
0.426GlnThr: 0.426 ± 0.213
0.426GlnVal: 0.426 ± 0.213
1.277GlnTrp: 1.277 ± 0.638
1.277GlnTyr: 1.277 ± 0.424
0.0GlnXaa: 0.0 ± 0.0
Arg
13.191ArgAla: 13.191 ± 1.906
2.128ArgCys: 2.128 ± 1.064
5.957ArgAsp: 5.957 ± 1.916
3.404ArgGlu: 3.404 ± 1.486
1.702ArgPhe: 1.702 ± 0.212
2.128ArgGly: 2.128 ± 0.001
2.979ArgHis: 2.979 ± 0.427
5.106ArgIle: 5.106 ± 1.491
2.979ArgLys: 2.979 ± 0.427
4.681ArgLeu: 4.681 ± 1.911
2.128ArgMet: 2.128 ± 0.001
3.404ArgAsn: 3.404 ± 1.702
3.83ArgPro: 3.83 ± 1.915
0.851ArgGln: 0.851 ± 0.426
3.83ArgArg: 3.83 ± 0.211
5.106ArgSer: 5.106 ± 0.428
2.553ArgThr: 2.553 ± 0.214
5.532ArgVal: 5.532 ± 0.641
0.426ArgTrp: 0.426 ± 0.85
2.553ArgTyr: 2.553 ± 0.214
0.0ArgXaa: 0.0 ± 0.0
Ser
7.66SerAla: 7.66 ± 0.642
1.702SerCys: 1.702 ± 0.212
2.553SerAsp: 2.553 ± 1.912
3.83SerGlu: 3.83 ± 0.211
2.128SerPhe: 2.128 ± 1.064
3.83SerGly: 3.83 ± 0.852
2.128SerHis: 2.128 ± 0.001
1.702SerIle: 1.702 ± 0.851
3.83SerLys: 3.83 ± 1.915
5.957SerLeu: 5.957 ± 1.916
0.851SerMet: 0.851 ± 0.426
1.277SerAsn: 1.277 ± 0.424
2.979SerPro: 2.979 ± 2.762
2.553SerGln: 2.553 ± 0.214
4.255SerArg: 4.255 ± 1.065
5.106SerSer: 5.106 ± 2.761
4.681SerThr: 4.681 ± 0.215
5.106SerVal: 5.106 ± 1.698
0.426SerTrp: 0.426 ± 0.213
2.128SerTyr: 2.128 ± 0.001
0.0SerXaa: 0.0 ± 0.0
Thr
11.064ThrAla: 11.064 ± 1.907
1.277ThrCys: 1.277 ± 0.424
3.404ThrAsp: 3.404 ± 0.423
5.106ThrGlu: 5.106 ± 0.428
1.702ThrPhe: 1.702 ± 0.212
2.128ThrGly: 2.128 ± 2.125
2.553ThrHis: 2.553 ± 1.277
2.553ThrIle: 2.553 ± 0.214
2.128ThrLys: 2.128 ± 1.062
5.532ThrLeu: 5.532 ± 0.641
2.128ThrMet: 2.128 ± 1.062
1.277ThrAsn: 1.277 ± 0.638
1.277ThrPro: 1.277 ± 0.638
1.277ThrGln: 1.277 ± 0.638
2.979ThrArg: 2.979 ± 0.636
3.83ThrSer: 3.83 ± 0.852
1.702ThrThr: 1.702 ± 2.338
3.404ThrVal: 3.404 ± 0.64
1.277ThrTrp: 1.277 ± 0.638
0.851ThrTyr: 0.851 ± 0.426
0.0ThrXaa: 0.0 ± 0.0
Val
8.936ValAla: 8.936 ± 2.971
0.426ValCys: 0.426 ± 0.85
4.681ValAsp: 4.681 ± 0.848
4.255ValGlu: 4.255 ± 2.128
2.979ValPhe: 2.979 ± 1.49
4.255ValGly: 4.255 ± 0.002
1.277ValHis: 1.277 ± 0.424
3.404ValIle: 3.404 ± 0.423
2.979ValLys: 2.979 ± 0.636
8.511ValLeu: 8.511 ± 2.121
2.979ValMet: 2.979 ± 0.427
2.553ValAsn: 2.553 ± 0.849
4.255ValPro: 4.255 ± 1.065
1.702ValGln: 1.702 ± 1.275
7.234ValArg: 7.234 ± 0.634
5.106ValSer: 5.106 ± 1.491
5.106ValThr: 5.106 ± 0.635
7.66ValVal: 7.66 ± 0.421
0.426ValTrp: 0.426 ± 0.213
3.404ValTyr: 3.404 ± 0.64
0.0ValXaa: 0.0 ± 0.0
Trp
2.128TrpAla: 2.128 ± 1.064
0.851TrpCys: 0.851 ± 0.637
1.277TrpAsp: 1.277 ± 0.638
0.851TrpGlu: 0.851 ± 0.426
0.0TrpPhe: 0.0 ± 0.0
1.277TrpGly: 1.277 ± 0.424
1.702TrpHis: 1.702 ± 0.851
0.851TrpIle: 0.851 ± 0.426
0.851TrpLys: 0.851 ± 0.637
0.426TrpLeu: 0.426 ± 0.213
0.851TrpMet: 0.851 ± 0.426
1.277TrpAsn: 1.277 ± 0.424
0.851TrpPro: 0.851 ± 0.637
0.426TrpGln: 0.426 ± 0.213
2.128TrpArg: 2.128 ± 1.064
0.851TrpSer: 0.851 ± 0.426
0.0TrpThr: 0.0 ± 0.0
2.553TrpVal: 2.553 ± 0.849
0.426TrpTrp: 0.426 ± 0.85
0.426TrpTyr: 0.426 ± 0.213
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.681TyrAla: 4.681 ± 1.278
0.426TyrCys: 0.426 ± 0.85
0.851TyrAsp: 0.851 ± 0.637
0.851TyrGlu: 0.851 ± 0.637
1.277TyrPhe: 1.277 ± 0.638
1.702TyrGly: 1.702 ± 0.851
0.851TyrHis: 0.851 ± 0.637
0.851TyrIle: 0.851 ± 0.426
0.851TyrLys: 0.851 ± 0.637
2.979TyrLeu: 2.979 ± 1.49
0.851TyrMet: 0.851 ± 0.426
0.426TyrAsn: 0.426 ± 0.213
0.851TyrPro: 0.851 ± 0.637
1.702TyrGln: 1.702 ± 0.212
2.128TyrArg: 2.128 ± 1.064
0.851TyrSer: 0.851 ± 0.426
0.851TyrThr: 0.851 ± 0.426
3.83TyrVal: 3.83 ± 0.211
0.426TyrTrp: 0.426 ± 0.213
1.277TyrTyr: 1.277 ± 0.638
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2351 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski