Amino acid dipepetide frequency for Hubei permutotetra-like virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.705AlaAla: 2.705 ± 2.299
0.0AlaCys: 0.0 ± 0.0
2.028AlaAsp: 2.028 ± 1.286
3.381AlaGlu: 3.381 ± 1.082
0.676AlaPhe: 0.676 ± 0.376
0.676AlaGly: 0.676 ± 0.376
1.352AlaHis: 1.352 ± 0.935
2.028AlaIle: 2.028 ± 1.129
9.466AlaLys: 9.466 ± 4.167
4.057AlaLeu: 4.057 ± 0.265
2.705AlaMet: 2.705 ± 1.506
3.381AlaAsn: 3.381 ± 2.147
4.733AlaPro: 4.733 ± 2.203
1.352AlaGln: 1.352 ± 0.753
1.352AlaArg: 1.352 ± 0.753
4.733AlaSer: 4.733 ± 1.773
4.057AlaThr: 4.057 ± 1.859
7.437AlaVal: 7.437 ± 1.759
0.676AlaTrp: 0.676 ± 0.376
4.057AlaTyr: 4.057 ± 1.355
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.676CysAsp: 0.676 ± 1.158
0.676CysGlu: 0.676 ± 1.344
0.0CysPhe: 0.0 ± 0.0
0.0CysGly: 0.0 ± 0.0
0.676CysHis: 0.676 ± 0.376
0.0CysIle: 0.0 ± 0.0
1.352CysLys: 1.352 ± 0.753
1.352CysLeu: 1.352 ± 0.753
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.676CysPro: 0.676 ± 0.376
1.352CysGln: 1.352 ± 0.935
0.0CysArg: 0.0 ± 0.0
0.676CysSer: 0.676 ± 1.158
0.0CysThr: 0.0 ± 0.0
0.676CysVal: 0.676 ± 1.344
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.381AspAla: 3.381 ± 1.882
0.0AspCys: 0.0 ± 0.0
3.381AspAsp: 3.381 ± 1.803
4.057AspGlu: 4.057 ± 0.265
2.705AspPhe: 2.705 ± 0.888
3.381AspGly: 3.381 ± 1.082
0.0AspHis: 0.0 ± 0.0
3.381AspIle: 3.381 ± 1.082
3.381AspLys: 3.381 ± 1.727
7.437AspLeu: 7.437 ± 0.821
2.028AspMet: 2.028 ± 0.83
1.352AspAsn: 1.352 ± 0.753
1.352AspPro: 1.352 ± 0.753
0.676AspGln: 0.676 ± 1.158
5.409AspArg: 5.409 ± 0.656
1.352AspSer: 1.352 ± 1.149
5.409AspThr: 5.409 ± 0.829
4.733AspVal: 4.733 ± 1.669
1.352AspTrp: 1.352 ± 0.753
3.381AspTyr: 3.381 ± 0.561
0.0AspXaa: 0.0 ± 0.0
Glu
2.028GluAla: 2.028 ± 0.83
0.0GluCys: 0.0 ± 0.0
3.381GluAsp: 3.381 ± 1.882
2.028GluGlu: 2.028 ± 1.129
1.352GluPhe: 1.352 ± 1.658
4.733GluGly: 4.733 ± 2.635
3.381GluHis: 3.381 ± 1.082
2.705GluIle: 2.705 ± 1.096
2.705GluLys: 2.705 ± 1.506
4.733GluLeu: 4.733 ± 2.635
2.028GluMet: 2.028 ± 0.83
1.352GluAsn: 1.352 ± 0.935
2.028GluPro: 2.028 ± 1.129
6.761GluGln: 6.761 ± 1.387
2.705GluArg: 2.705 ± 1.506
4.733GluSer: 4.733 ± 3.688
2.705GluThr: 2.705 ± 1.506
3.381GluVal: 3.381 ± 1.251
1.352GluTrp: 1.352 ± 0.935
1.352GluTyr: 1.352 ± 0.935
0.0GluXaa: 0.0 ± 0.0
Phe
1.352PheAla: 1.352 ± 0.753
0.0PheCys: 0.0 ± 0.0
1.352PheAsp: 1.352 ± 0.753
1.352PheGlu: 1.352 ± 0.935
0.676PhePhe: 0.676 ± 0.376
0.676PheGly: 0.676 ± 0.376
0.676PheHis: 0.676 ± 0.376
2.028PheIle: 2.028 ± 1.058
2.028PheLys: 2.028 ± 0.83
2.028PheLeu: 2.028 ± 1.129
0.676PheMet: 0.676 ± 0.376
2.705PheAsn: 2.705 ± 0.918
0.676PhePro: 0.676 ± 0.376
1.352PheGln: 1.352 ± 1.149
0.0PheArg: 0.0 ± 0.0
3.381PheSer: 3.381 ± 1.082
2.705PheThr: 2.705 ± 2.161
2.705PheVal: 2.705 ± 2.161
0.676PheTrp: 0.676 ± 0.376
1.352PheTyr: 1.352 ± 2.316
0.0PheXaa: 0.0 ± 0.0
Gly
4.057GlyAla: 4.057 ± 1.355
0.676GlyCys: 0.676 ± 1.344
3.381GlyAsp: 3.381 ± 2.147
3.381GlyGlu: 3.381 ± 1.882
0.676GlyPhe: 0.676 ± 0.376
4.733GlyGly: 4.733 ± 2.635
0.0GlyHis: 0.0 ± 0.0
3.381GlyIle: 3.381 ± 2.147
5.409GlyLys: 5.409 ± 0.656
4.057GlyLeu: 4.057 ± 0.265
0.676GlyMet: 0.676 ± 0.376
1.352GlyAsn: 1.352 ± 1.149
2.705GlyPro: 2.705 ± 0.918
2.028GlyGln: 2.028 ± 1.058
2.028GlyArg: 2.028 ± 1.058
2.028GlySer: 2.028 ± 1.129
4.733GlyThr: 4.733 ± 2.635
6.761GlyVal: 6.761 ± 1.747
0.676GlyTrp: 0.676 ± 0.376
1.352GlyTyr: 1.352 ± 0.753
0.0GlyXaa: 0.0 ± 0.0
His
0.676HisAla: 0.676 ± 0.376
0.0HisCys: 0.0 ± 0.0
2.705HisAsp: 2.705 ± 1.506
0.676HisGlu: 0.676 ± 1.158
0.676HisPhe: 0.676 ± 0.376
2.705HisGly: 2.705 ± 0.888
0.0HisHis: 0.0 ± 0.0
0.676HisIle: 0.676 ± 0.376
1.352HisLys: 1.352 ± 0.935
0.676HisLeu: 0.676 ± 1.158
0.0HisMet: 0.0 ± 0.0
0.676HisAsn: 0.676 ± 0.376
2.705HisPro: 2.705 ± 1.096
0.676HisGln: 0.676 ± 0.376
0.0HisArg: 0.0 ± 0.0
0.676HisSer: 0.676 ± 0.376
2.028HisThr: 2.028 ± 0.83
0.0HisVal: 0.0 ± 0.0
0.0HisTrp: 0.0 ± 0.0
0.676HisTyr: 0.676 ± 1.344
0.0HisXaa: 0.0 ± 0.0
Ile
0.676IleAla: 0.676 ± 0.376
0.676IleCys: 0.676 ± 1.158
1.352IleAsp: 1.352 ± 1.149
2.705IleGlu: 2.705 ± 0.888
0.676IlePhe: 0.676 ± 0.376
2.705IleGly: 2.705 ± 0.918
1.352IleHis: 1.352 ± 1.149
0.676IleIle: 0.676 ± 0.376
2.705IleLys: 2.705 ± 0.918
4.057IleLeu: 4.057 ± 1.453
2.028IleMet: 2.028 ± 0.83
2.028IleAsn: 2.028 ± 0.83
4.057IlePro: 4.057 ± 2.888
2.705IleGln: 2.705 ± 0.918
1.352IleArg: 1.352 ± 1.658
4.057IleSer: 4.057 ± 3.448
4.057IleThr: 4.057 ± 2.116
3.381IleVal: 3.381 ± 1.727
0.0IleTrp: 0.0 ± 0.0
2.705IleTyr: 2.705 ± 0.918
0.0IleXaa: 0.0 ± 0.0
Lys
3.381LysAla: 3.381 ± 1.251
0.0LysCys: 0.0 ± 0.0
2.705LysAsp: 2.705 ± 1.506
2.705LysGlu: 2.705 ± 1.506
2.705LysPhe: 2.705 ± 1.869
2.705LysGly: 2.705 ± 1.096
0.676LysHis: 0.676 ± 0.376
4.057LysIle: 4.057 ± 1.859
12.17LysLys: 12.17 ± 3.05
8.79LysLeu: 8.79 ± 3.799
0.0LysMet: 0.0 ± 0.0
4.057LysAsn: 4.057 ± 2.116
8.114LysPro: 8.114 ± 5.146
6.761LysGln: 6.761 ± 1.291
4.057LysArg: 4.057 ± 2.804
5.409LysSer: 5.409 ± 3.011
6.085LysThr: 6.085 ± 1.3
6.761LysVal: 6.761 ± 1.387
1.352LysTrp: 1.352 ± 0.753
2.028LysTyr: 2.028 ± 1.058
0.0LysXaa: 0.0 ± 0.0
Leu
8.79LeuAla: 8.79 ± 1.817
0.0LeuCys: 0.0 ± 0.0
5.409LeuAsp: 5.409 ± 2.544
4.733LeuGlu: 4.733 ± 0.33
6.085LeuPhe: 6.085 ± 2.352
4.733LeuGly: 4.733 ± 1.773
1.352LeuHis: 1.352 ± 0.753
3.381LeuIle: 3.381 ± 1.082
6.085LeuLys: 6.085 ± 1.018
9.466LeuLeu: 9.466 ± 2.034
3.381LeuMet: 3.381 ± 2.177
2.705LeuAsn: 2.705 ± 2.161
8.79LeuPro: 8.79 ± 1.435
6.761LeuGln: 6.761 ± 0.629
4.057LeuArg: 4.057 ± 1.355
6.761LeuSer: 6.761 ± 1.747
8.79LeuThr: 8.79 ± 1.147
4.057LeuVal: 4.057 ± 1.355
1.352LeuTrp: 1.352 ± 0.753
3.381LeuTyr: 3.381 ± 1.082
0.0LeuXaa: 0.0 ± 0.0
Met
4.057MetAla: 4.057 ± 1.355
0.676MetCys: 0.676 ± 0.376
2.705MetAsp: 2.705 ± 1.506
1.352MetGlu: 1.352 ± 2.316
0.676MetPhe: 0.676 ± 0.376
0.676MetGly: 0.676 ± 1.344
0.0MetHis: 0.0 ± 0.0
1.352MetIle: 1.352 ± 0.753
0.676MetLys: 0.676 ± 0.376
3.381MetLeu: 3.381 ± 1.882
1.352MetMet: 1.352 ± 0.935
3.381MetAsn: 3.381 ± 3.22
1.352MetPro: 1.352 ± 1.149
0.0MetGln: 0.0 ± 0.0
0.676MetArg: 0.676 ± 0.376
3.381MetSer: 3.381 ± 1.251
0.676MetThr: 0.676 ± 1.158
2.705MetVal: 2.705 ± 1.506
0.0MetTrp: 0.0 ± 0.0
2.028MetTyr: 2.028 ± 0.83
0.0MetXaa: 0.0 ± 0.0
Asn
1.352AsnAla: 1.352 ± 0.935
0.676AsnCys: 0.676 ± 0.376
2.028AsnAsp: 2.028 ± 2.071
0.676AsnGlu: 0.676 ± 0.376
2.028AsnPhe: 2.028 ± 2.071
1.352AsnGly: 1.352 ± 0.935
2.028AsnHis: 2.028 ± 2.071
3.381AsnIle: 3.381 ± 2.944
4.733AsnLys: 4.733 ± 3.317
4.733AsnLeu: 4.733 ± 2.567
2.028AsnMet: 2.028 ± 1.286
0.676AsnAsn: 0.676 ± 1.344
2.028AsnPro: 2.028 ± 1.058
2.028AsnGln: 2.028 ± 1.129
2.705AsnArg: 2.705 ± 2.299
0.676AsnSer: 0.676 ± 1.158
4.057AsnThr: 4.057 ± 4.33
1.352AsnVal: 1.352 ± 0.753
0.0AsnTrp: 0.0 ± 0.0
0.676AsnTyr: 0.676 ± 0.376
0.0AsnXaa: 0.0 ± 0.0
Pro
3.381ProAla: 3.381 ± 1.803
0.0ProCys: 0.0 ± 0.0
4.057ProAsp: 4.057 ± 1.487
7.437ProGlu: 7.437 ± 3.068
1.352ProPhe: 1.352 ± 1.658
2.705ProGly: 2.705 ± 1.096
0.676ProHis: 0.676 ± 1.344
4.057ProIle: 4.057 ± 2.888
2.705ProLys: 2.705 ± 0.888
6.085ProLeu: 6.085 ± 0.632
4.733ProMet: 4.733 ± 0.824
2.705ProAsn: 2.705 ± 0.918
2.705ProPro: 2.705 ± 0.888
3.381ProGln: 3.381 ± 4.386
3.381ProArg: 3.381 ± 1.251
3.381ProSer: 3.381 ± 1.727
4.057ProThr: 4.057 ± 2.258
2.705ProVal: 2.705 ± 1.096
2.705ProTrp: 2.705 ± 2.161
2.705ProTyr: 2.705 ± 0.888
0.0ProXaa: 0.0 ± 0.0
Gln
2.705GlnAla: 2.705 ± 1.096
0.0GlnCys: 0.0 ± 0.0
2.028GlnAsp: 2.028 ± 1.129
0.676GlnGlu: 0.676 ± 0.376
0.676GlnPhe: 0.676 ± 1.158
2.705GlnGly: 2.705 ± 1.096
1.352GlnHis: 1.352 ± 0.935
2.705GlnIle: 2.705 ± 0.918
6.085GlnLys: 6.085 ± 4.295
6.761GlnLeu: 6.761 ± 1.387
0.676GlnMet: 0.676 ± 0.376
2.705GlnAsn: 2.705 ± 3.222
3.381GlnPro: 3.381 ± 1.882
3.381GlnGln: 3.381 ± 1.882
4.057GlnArg: 4.057 ± 1.355
2.705GlnSer: 2.705 ± 2.161
2.028GlnThr: 2.028 ± 0.83
2.705GlnVal: 2.705 ± 1.506
0.0GlnTrp: 0.0 ± 0.0
0.676GlnTyr: 0.676 ± 1.344
0.0GlnXaa: 0.0 ± 0.0
Arg
1.352ArgAla: 1.352 ± 0.753
0.676ArgCys: 0.676 ± 1.158
4.733ArgAsp: 4.733 ± 1.669
3.381ArgGlu: 3.381 ± 2.177
2.705ArgPhe: 2.705 ± 2.299
2.028ArgGly: 2.028 ± 0.83
1.352ArgHis: 1.352 ± 0.753
0.676ArgIle: 0.676 ± 0.376
4.057ArgLys: 4.057 ± 1.859
2.705ArgLeu: 2.705 ± 0.918
0.0ArgMet: 0.0 ± 0.339
2.028ArgAsn: 2.028 ± 1.129
3.381ArgPro: 3.381 ± 1.082
2.028ArgGln: 2.028 ± 0.83
1.352ArgArg: 1.352 ± 0.753
0.676ArgSer: 0.676 ± 1.344
2.028ArgThr: 2.028 ± 1.286
5.409ArgVal: 5.409 ± 2.004
0.676ArgTrp: 0.676 ± 0.376
1.352ArgTyr: 1.352 ± 1.149
0.0ArgXaa: 0.0 ± 0.0
Ser
4.733SerAla: 4.733 ± 1.607
0.676SerCys: 0.676 ± 0.376
4.057SerAsp: 4.057 ± 1.66
2.028SerGlu: 2.028 ± 1.129
2.028SerPhe: 2.028 ± 1.129
6.761SerGly: 6.761 ± 4.295
1.352SerHis: 1.352 ± 0.753
0.676SerIle: 0.676 ± 1.344
4.057SerLys: 4.057 ± 0.265
9.466SerLeu: 9.466 ± 1.515
0.676SerMet: 0.676 ± 0.376
2.028SerAsn: 2.028 ± 1.058
2.028SerPro: 2.028 ± 1.058
1.352SerGln: 1.352 ± 0.935
1.352SerArg: 1.352 ± 1.149
6.761SerSer: 6.761 ± 1.747
3.381SerThr: 3.381 ± 2.177
6.085SerVal: 6.085 ± 0.632
0.0SerTrp: 0.0 ± 0.0
4.057SerTyr: 4.057 ± 3.448
0.0SerXaa: 0.0 ± 0.0
Thr
6.761ThrAla: 6.761 ± 1.123
1.352ThrCys: 1.352 ± 1.149
4.733ThrAsp: 4.733 ± 1.122
5.409ThrGlu: 5.409 ± 2.004
0.676ThrPhe: 0.676 ± 1.344
5.409ThrGly: 5.409 ± 0.829
0.676ThrHis: 0.676 ± 0.376
2.705ThrIle: 2.705 ± 2.299
5.409ThrLys: 5.409 ± 2.004
9.466ThrLeu: 9.466 ± 4.407
2.705ThrMet: 2.705 ± 1.506
2.028ThrAsn: 2.028 ± 3.474
4.057ThrPro: 4.057 ± 1.453
2.705ThrGln: 2.705 ± 1.506
3.381ThrArg: 3.381 ± 0.561
3.381ThrSer: 3.381 ± 1.251
6.761ThrThr: 6.761 ± 1.291
4.733ThrVal: 4.733 ± 1.607
2.028ThrTrp: 2.028 ± 1.129
2.705ThrTyr: 2.705 ± 1.869
0.0ThrXaa: 0.0 ± 0.0
Val
4.057ValAla: 4.057 ± 2.258
2.028ValCys: 2.028 ± 0.83
4.057ValAsp: 4.057 ± 1.355
6.085ValGlu: 6.085 ± 2.42
1.352ValPhe: 1.352 ± 0.753
2.705ValGly: 2.705 ± 1.506
0.0ValHis: 0.0 ± 0.0
4.057ValIle: 4.057 ± 5.534
4.733ValLys: 4.733 ± 1.773
5.409ValLeu: 5.409 ± 3.011
2.705ValMet: 2.705 ± 0.888
2.705ValAsn: 2.705 ± 0.918
6.761ValPro: 6.761 ± 1.747
1.352ValGln: 1.352 ± 0.753
3.381ValArg: 3.381 ± 0.561
6.085ValSer: 6.085 ± 3.174
5.409ValThr: 5.409 ± 2.191
2.028ValVal: 2.028 ± 1.129
1.352ValTrp: 1.352 ± 2.316
3.381ValTyr: 3.381 ± 1.882
0.0ValXaa: 0.0 ± 0.0
Trp
1.352TrpAla: 1.352 ± 0.753
0.0TrpCys: 0.0 ± 0.0
2.028TrpAsp: 2.028 ± 0.83
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.676TrpHis: 0.676 ± 0.376
2.028TrpIle: 2.028 ± 1.286
1.352TrpLys: 1.352 ± 0.753
0.676TrpLeu: 0.676 ± 0.376
1.352TrpMet: 1.352 ± 0.753
0.676TrpAsn: 0.676 ± 1.158
0.0TrpPro: 0.0 ± 0.0
0.676TrpGln: 0.676 ± 1.158
0.676TrpArg: 0.676 ± 0.376
0.676TrpSer: 0.676 ± 0.376
2.028TrpThr: 2.028 ± 0.83
0.676TrpVal: 0.676 ± 0.376
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
4.057TyrAla: 4.057 ± 0.265
0.676TyrCys: 0.676 ± 0.376
1.352TyrAsp: 1.352 ± 0.753
2.705TyrGlu: 2.705 ± 1.096
0.676TyrPhe: 0.676 ± 1.158
2.705TyrGly: 2.705 ± 1.096
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
4.057TyrLys: 4.057 ± 1.859
4.733TyrLeu: 4.733 ± 1.677
0.676TyrMet: 0.676 ± 1.158
0.676TyrAsn: 0.676 ± 1.158
3.381TyrPro: 3.381 ± 1.082
0.676TyrGln: 0.676 ± 0.376
1.352TyrArg: 1.352 ± 1.149
2.028TyrSer: 2.028 ± 2.473
6.085TyrThr: 6.085 ± 1.987
1.352TyrVal: 1.352 ± 0.753
0.676TyrTrp: 0.676 ± 0.376
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1480 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski