Amino acid dipepetide frequency for Hubei unio douglasiae virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.481AlaAla: 6.481 ± 0.261
0.648AlaCys: 0.648 ± 0.726
2.592AlaAsp: 2.592 ± 1.138
0.648AlaGlu: 0.648 ± 0.326
3.889AlaPhe: 3.889 ± 1.03
7.777AlaGly: 7.777 ± 3.233
2.592AlaHis: 2.592 ± 0.673
1.296AlaIle: 1.296 ± 0.539
4.537AlaLys: 4.537 ± 0.645
4.537AlaLeu: 4.537 ± 2.447
0.0AlaMet: 0.0 ± 0.0
4.537AlaAsn: 4.537 ± 1.154
5.185AlaPro: 5.185 ± 1.5
3.24AlaGln: 3.24 ± 1.63
3.889AlaArg: 3.889 ± 1.956
7.129AlaSer: 7.129 ± 1.812
9.073AlaThr: 9.073 ± 3.05
4.537AlaVal: 4.537 ± 1.154
0.0AlaTrp: 0.0 ± 0.0
3.24AlaTyr: 3.24 ± 0.846
0.0AlaXaa: 0.0 ± 0.0
Cys
1.296CysAla: 1.296 ± 0.539
0.0CysCys: 0.0 ± 0.0
0.648CysAsp: 0.648 ± 0.326
0.648CysGlu: 0.648 ± 0.326
0.648CysPhe: 0.648 ± 0.326
0.648CysGly: 0.648 ± 0.726
0.0CysHis: 0.0 ± 0.0
0.648CysIle: 0.648 ± 0.726
0.0CysLys: 0.0 ± 0.0
0.648CysLeu: 0.648 ± 0.326
0.0CysMet: 0.0 ± 0.0
1.944CysAsn: 1.944 ± 0.515
0.648CysPro: 0.648 ± 0.326
0.648CysGln: 0.648 ± 0.326
1.296CysArg: 1.296 ± 0.652
1.296CysSer: 1.296 ± 0.539
1.296CysThr: 1.296 ± 0.539
1.296CysVal: 1.296 ± 1.453
0.648CysTrp: 0.648 ± 0.726
1.944CysTyr: 1.944 ± 0.978
0.0CysXaa: 0.0 ± 0.0
Asp
2.592AspAla: 2.592 ± 1.088
1.296AspCys: 1.296 ± 0.539
3.889AspAsp: 3.889 ± 1.211
2.592AspGlu: 2.592 ± 1.138
1.944AspPhe: 1.944 ± 2.179
3.889AspGly: 3.889 ± 1.956
1.944AspHis: 1.944 ± 1.191
3.889AspIle: 3.889 ± 1.956
1.296AspLys: 1.296 ± 0.652
3.889AspLeu: 3.889 ± 0.678
1.296AspMet: 1.296 ± 0.61
3.24AspAsn: 3.24 ± 0.924
2.592AspPro: 2.592 ± 1.304
5.185AspGln: 5.185 ± 1.709
1.296AspArg: 1.296 ± 1.325
3.889AspSer: 3.889 ± 1.03
5.833AspThr: 5.833 ± 2.14
3.889AspVal: 3.889 ± 1.03
1.296AspTrp: 1.296 ± 0.652
3.24AspTyr: 3.24 ± 1.63
0.0AspXaa: 0.0 ± 0.0
Glu
3.24GluAla: 3.24 ± 1.176
0.648GluCys: 0.648 ± 0.326
0.648GluAsp: 0.648 ± 0.326
1.944GluGlu: 1.944 ± 1.365
1.944GluPhe: 1.944 ± 0.515
1.944GluGly: 1.944 ± 2.065
1.944GluHis: 1.944 ± 0.978
3.889GluIle: 3.889 ± 1.03
2.592GluLys: 2.592 ± 1.138
6.481GluLeu: 6.481 ± 1.794
0.0GluMet: 0.0 ± 0.0
2.592GluAsn: 2.592 ± 0.673
1.296GluPro: 1.296 ± 0.539
0.648GluGln: 0.648 ± 1.519
3.889GluArg: 3.889 ± 1.211
1.944GluSer: 1.944 ± 0.978
2.592GluThr: 2.592 ± 1.138
1.944GluVal: 1.944 ± 4.556
0.0GluTrp: 0.0 ± 0.0
2.592GluTyr: 2.592 ± 0.673
0.0GluXaa: 0.0 ± 0.0
Phe
3.24PheAla: 3.24 ± 1.764
1.296PheCys: 1.296 ± 0.539
1.296PheAsp: 1.296 ± 0.652
1.944PheGlu: 1.944 ± 0.515
0.648PhePhe: 0.648 ± 0.326
1.296PheGly: 1.296 ± 0.539
1.296PheHis: 1.296 ± 0.652
1.944PheIle: 1.944 ± 0.515
0.648PheLys: 0.648 ± 0.326
3.24PheLeu: 3.24 ± 0.924
0.0PheMet: 0.0 ± 0.0
2.592PheAsn: 2.592 ± 0.673
1.944PhePro: 1.944 ± 0.515
2.592PheGln: 2.592 ± 0.673
1.944PheArg: 1.944 ± 2.179
1.944PheSer: 1.944 ± 0.515
1.296PheThr: 1.296 ± 0.652
1.944PheVal: 1.944 ± 0.978
1.296PheTrp: 1.296 ± 0.539
1.296PheTyr: 1.296 ± 0.652
0.0PheXaa: 0.0 ± 0.0
Gly
4.537GlyAla: 4.537 ± 2.297
0.648GlyCys: 0.648 ± 0.326
1.944GlyAsp: 1.944 ± 0.515
3.24GlyGlu: 3.24 ± 0.846
3.24GlyPhe: 3.24 ± 1.003
2.592GlyGly: 2.592 ± 0.673
0.648GlyHis: 0.648 ± 0.326
5.833GlyIle: 5.833 ± 2.832
4.537GlyLys: 4.537 ± 0.645
7.777GlyLeu: 7.777 ± 2.061
3.24GlyMet: 3.24 ± 2.678
1.296GlyAsn: 1.296 ± 0.652
4.537GlyPro: 4.537 ± 1.514
2.592GlyGln: 2.592 ± 1.956
3.889GlyArg: 3.889 ± 1.211
3.24GlySer: 3.24 ± 0.924
4.537GlyThr: 4.537 ± 2.297
2.592GlyVal: 2.592 ± 1.956
0.648GlyTrp: 0.648 ± 0.326
2.592GlyTyr: 2.592 ± 1.078
0.0GlyXaa: 0.0 ± 0.0
His
1.296HisAla: 1.296 ± 0.652
0.648HisCys: 0.648 ± 0.326
1.944HisAsp: 1.944 ± 0.515
1.296HisGlu: 1.296 ± 0.652
0.648HisPhe: 0.648 ± 0.326
1.944HisGly: 1.944 ± 0.978
0.0HisHis: 0.0 ± 0.0
2.592HisIle: 2.592 ± 1.304
1.944HisLys: 1.944 ± 0.978
1.296HisLeu: 1.296 ± 3.037
0.0HisMet: 0.0 ± 0.0
0.648HisAsn: 0.648 ± 0.326
1.296HisPro: 1.296 ± 0.652
1.296HisGln: 1.296 ± 1.325
2.592HisArg: 2.592 ± 1.304
2.592HisSer: 2.592 ± 1.304
0.648HisThr: 0.648 ± 0.326
2.592HisVal: 2.592 ± 1.304
0.648HisTrp: 0.648 ± 0.326
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
5.185IleAla: 5.185 ± 0.452
0.648IleCys: 0.648 ± 0.326
3.889IleAsp: 3.889 ± 1.956
3.889IleGlu: 3.889 ± 1.211
0.648IlePhe: 0.648 ± 0.326
2.592IleGly: 2.592 ± 1.956
1.944IleHis: 1.944 ± 1.191
3.24IleIle: 3.24 ± 0.924
2.592IleLys: 2.592 ± 1.078
5.185IleLeu: 5.185 ± 1.334
1.944IleMet: 1.944 ± 1.365
1.944IleAsn: 1.944 ± 1.365
4.537IlePro: 4.537 ± 1.525
0.648IleGln: 0.648 ± 0.326
2.592IleArg: 2.592 ± 1.138
6.481IleSer: 6.481 ± 0.261
3.889IleThr: 3.889 ± 1.211
5.185IleVal: 5.185 ± 2.176
0.648IleTrp: 0.648 ± 0.326
0.648IleTyr: 0.648 ± 0.326
0.0IleXaa: 0.0 ± 0.0
Lys
3.24LysAla: 3.24 ± 1.63
1.296LysCys: 1.296 ± 0.539
3.24LysAsp: 3.24 ± 0.924
2.592LysGlu: 2.592 ± 1.304
2.592LysPhe: 2.592 ± 0.673
1.296LysGly: 1.296 ± 0.539
2.592LysHis: 2.592 ± 1.304
4.537LysIle: 4.537 ± 0.645
1.296LysLys: 1.296 ± 0.652
5.185LysLeu: 5.185 ± 1.709
1.296LysMet: 1.296 ± 2.57
3.889LysAsn: 3.889 ± 0.678
1.944LysPro: 1.944 ± 1.191
1.296LysGln: 1.296 ± 0.652
2.592LysArg: 2.592 ± 2.837
1.296LysSer: 1.296 ± 0.652
2.592LysThr: 2.592 ± 0.673
1.944LysVal: 1.944 ± 1.237
0.0LysTrp: 0.0 ± 0.0
0.648LysTyr: 0.648 ± 0.326
0.0LysXaa: 0.0 ± 0.0
Leu
9.721LeuAla: 9.721 ± 0.844
1.296LeuCys: 1.296 ± 0.539
4.537LeuAsp: 4.537 ± 2.307
1.944LeuGlu: 1.944 ± 3.098
2.592LeuPhe: 2.592 ± 0.673
3.889LeuGly: 3.889 ± 1.956
0.648LeuHis: 0.648 ± 0.326
3.889LeuIle: 3.889 ± 4.456
3.24LeuLys: 3.24 ± 1.003
5.833LeuLeu: 5.833 ± 3.574
1.296LeuMet: 1.296 ± 1.661
2.592LeuAsn: 2.592 ± 1.088
1.944LeuPro: 1.944 ± 1.191
2.592LeuGln: 2.592 ± 1.138
7.129LeuArg: 7.129 ± 2.778
11.017LeuSer: 11.017 ± 0.838
9.073LeuThr: 9.073 ± 6.63
6.481LeuVal: 6.481 ± 3.318
0.0LeuTrp: 0.0 ± 0.0
3.889LeuTyr: 3.889 ± 1.091
0.0LeuXaa: 0.0 ± 0.0
Met
0.648MetAla: 0.648 ± 0.726
0.0MetCys: 0.0 ± 0.0
3.889MetAsp: 3.889 ± 1.091
1.944MetGlu: 1.944 ± 1.365
0.648MetPhe: 0.648 ± 0.726
0.648MetGly: 0.648 ± 0.726
0.0MetHis: 0.0 ± 0.0
0.648MetIle: 0.648 ± 0.326
0.648MetLys: 0.648 ± 0.326
1.296MetLeu: 1.296 ± 1.325
0.648MetMet: 0.648 ± 0.726
1.296MetAsn: 1.296 ± 1.325
0.0MetPro: 0.0 ± 0.0
0.648MetGln: 0.648 ± 0.726
0.0MetArg: 0.0 ± 0.0
0.0MetSer: 0.0 ± 0.0
3.24MetThr: 3.24 ± 2.29
1.296MetVal: 1.296 ± 3.037
0.0MetTrp: 0.0 ± 0.0
0.648MetTyr: 0.648 ± 0.726
0.0MetXaa: 0.0 ± 0.0
Asn
1.296AsnAla: 1.296 ± 0.652
0.648AsnCys: 0.648 ± 0.326
3.889AsnAsp: 3.889 ± 1.956
3.24AsnGlu: 3.24 ± 1.415
1.944AsnPhe: 1.944 ± 0.515
4.537AsnGly: 4.537 ± 1.154
1.296AsnHis: 1.296 ± 0.652
2.592AsnIle: 2.592 ± 1.138
1.944AsnLys: 1.944 ± 0.978
1.944AsnLeu: 1.944 ± 1.191
1.944AsnMet: 1.944 ± 1.365
2.592AsnAsn: 2.592 ± 0.673
1.296AsnPro: 1.296 ± 0.539
1.296AsnGln: 1.296 ± 1.661
1.296AsnArg: 1.296 ± 0.652
3.24AsnSer: 3.24 ± 1.764
4.537AsnThr: 4.537 ± 1.514
3.889AsnVal: 3.889 ± 1.091
0.648AsnTrp: 0.648 ± 1.519
3.889AsnTyr: 3.889 ± 1.211
0.0AsnXaa: 0.0 ± 0.0
Pro
2.592ProAla: 2.592 ± 1.078
2.592ProCys: 2.592 ± 1.078
3.889ProAsp: 3.889 ± 2.474
1.944ProGlu: 1.944 ± 0.978
1.944ProPhe: 1.944 ± 0.978
5.185ProGly: 5.185 ± 1.5
1.944ProHis: 1.944 ± 1.191
3.24ProIle: 3.24 ± 1.764
1.944ProLys: 1.944 ± 0.978
3.889ProLeu: 3.889 ± 2.383
0.648ProMet: 0.648 ± 0.726
3.889ProAsn: 3.889 ± 1.03
1.944ProPro: 1.944 ± 0.515
3.24ProGln: 3.24 ± 1.176
3.24ProArg: 3.24 ± 1.63
1.944ProSer: 1.944 ± 1.365
3.889ProThr: 3.889 ± 1.091
5.833ProVal: 5.833 ± 1.583
1.296ProTrp: 1.296 ± 0.652
3.24ProTyr: 3.24 ± 0.924
0.0ProXaa: 0.0 ± 0.0
Gln
1.944GlnAla: 1.944 ± 1.365
0.0GlnCys: 0.0 ± 0.0
3.889GlnAsp: 3.889 ± 0.678
0.0GlnGlu: 0.0 ± 0.0
2.592GlnPhe: 2.592 ± 0.673
1.944GlnGly: 1.944 ± 0.515
1.296GlnHis: 1.296 ± 0.652
1.296GlnIle: 1.296 ± 1.661
2.592GlnLys: 2.592 ± 1.304
1.944GlnLeu: 1.944 ± 3.098
0.648GlnMet: 0.648 ± 0.326
2.592GlnAsn: 2.592 ± 2.837
3.889GlnPro: 3.889 ± 1.03
1.944GlnGln: 1.944 ± 1.365
4.537GlnArg: 4.537 ± 1.482
3.889GlnSer: 3.889 ± 1.211
3.24GlnThr: 3.24 ± 1.63
0.648GlnVal: 0.648 ± 0.326
0.0GlnTrp: 0.0 ± 0.0
1.296GlnTyr: 1.296 ± 1.325
0.0GlnXaa: 0.0 ± 0.0
Arg
4.537ArgAla: 4.537 ± 1.514
1.296ArgCys: 1.296 ± 0.652
1.944ArgAsp: 1.944 ± 0.978
3.24ArgGlu: 3.24 ± 0.924
1.296ArgPhe: 1.296 ± 0.652
5.185ArgGly: 5.185 ± 1.346
0.648ArgHis: 0.648 ± 0.326
2.592ArgIle: 2.592 ± 2.651
2.592ArgLys: 2.592 ± 1.088
3.889ArgLeu: 3.889 ± 1.297
1.296ArgMet: 1.296 ± 1.325
1.944ArgAsn: 1.944 ± 0.978
3.889ArgPro: 3.889 ± 1.969
3.24ArgGln: 3.24 ± 1.63
3.24ArgArg: 3.24 ± 1.63
3.889ArgSer: 3.889 ± 0.678
7.777ArgThr: 7.777 ± 2.594
4.537ArgVal: 4.537 ± 1.514
1.296ArgTrp: 1.296 ± 0.652
1.296ArgTyr: 1.296 ± 0.539
0.0ArgXaa: 0.0 ± 0.0
Ser
4.537SerAla: 4.537 ± 1.154
0.0SerCys: 0.0 ± 0.0
4.537SerAsp: 4.537 ± 1.514
3.24SerGlu: 3.24 ± 1.176
1.944SerPhe: 1.944 ± 1.237
7.777SerGly: 7.777 ± 2.061
1.944SerHis: 1.944 ± 0.978
1.944SerIle: 1.944 ± 0.515
5.185SerLys: 5.185 ± 1.995
4.537SerLeu: 4.537 ± 2.297
0.648SerMet: 0.648 ± 0.326
4.537SerAsn: 4.537 ± 1.514
6.481SerPro: 6.481 ± 1.249
1.944SerGln: 1.944 ± 2.179
4.537SerArg: 4.537 ± 2.307
2.592SerSer: 2.592 ± 1.138
6.481SerThr: 6.481 ± 3.318
5.185SerVal: 5.185 ± 1.5
0.0SerTrp: 0.0 ± 0.0
3.24SerTyr: 3.24 ± 1.764
0.0SerXaa: 0.0 ± 0.0
Thr
5.833ThrAla: 5.833 ± 1.546
0.648ThrCys: 0.648 ± 0.726
8.425ThrAsp: 8.425 ± 2.722
3.889ThrGlu: 3.889 ± 1.297
1.296ThrPhe: 1.296 ± 0.539
3.24ThrGly: 3.24 ± 1.764
1.296ThrHis: 1.296 ± 0.652
8.425ThrIle: 8.425 ± 2.149
3.24ThrLys: 3.24 ± 1.176
8.425ThrLeu: 8.425 ± 4.492
1.296ThrMet: 1.296 ± 1.067
3.24ThrAsn: 3.24 ± 1.415
6.481ThrPro: 6.481 ± 1.655
3.24ThrGln: 3.24 ± 2.591
5.185ThrArg: 5.185 ± 0.765
4.537ThrSer: 4.537 ± 3.584
5.185ThrThr: 5.185 ± 1.825
5.185ThrVal: 5.185 ± 1.709
0.648ThrTrp: 0.648 ± 0.726
1.944ThrTyr: 1.944 ± 0.978
0.0ThrXaa: 0.0 ± 0.0
Val
7.129ValAla: 7.129 ± 0.474
1.296ValCys: 1.296 ± 0.539
3.24ValAsp: 3.24 ± 1.176
3.889ValGlu: 3.889 ± 2.364
1.296ValPhe: 1.296 ± 0.652
6.481ValGly: 6.481 ± 3.694
2.592ValHis: 2.592 ± 1.304
3.24ValIle: 3.24 ± 0.924
3.24ValLys: 3.24 ± 1.176
9.721ValLeu: 9.721 ± 3.494
0.0ValMet: 0.0 ± 0.0
0.648ValAsn: 0.648 ± 0.326
5.185ValPro: 5.185 ± 2.176
0.648ValGln: 0.648 ± 0.326
4.537ValArg: 4.537 ± 1.482
5.185ValSer: 5.185 ± 1.825
3.24ValThr: 3.24 ± 0.924
4.537ValVal: 4.537 ± 0.645
0.648ValTrp: 0.648 ± 0.726
0.648ValTyr: 0.648 ± 0.726
0.0ValXaa: 0.0 ± 0.0
Trp
0.648TrpAla: 0.648 ± 0.726
0.648TrpCys: 0.648 ± 0.326
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.648TrpPhe: 0.648 ± 0.326
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.648TrpIle: 0.648 ± 0.326
1.296TrpLys: 1.296 ± 1.453
0.648TrpLeu: 0.648 ± 1.519
0.648TrpMet: 0.648 ± 1.519
1.296TrpAsn: 1.296 ± 0.652
0.648TrpPro: 0.648 ± 0.326
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
0.648TrpThr: 0.648 ± 0.326
1.944TrpVal: 1.944 ± 0.515
0.0TrpTrp: 0.0 ± 0.0
1.296TrpTyr: 1.296 ± 0.539
0.0TrpXaa: 0.0 ± 0.0
Tyr
5.833TyrAla: 5.833 ± 1.546
0.648TyrCys: 0.648 ± 0.326
0.648TyrAsp: 0.648 ± 0.326
0.648TyrGlu: 0.648 ± 0.726
1.296TyrPhe: 1.296 ± 0.652
1.296TyrGly: 1.296 ± 0.539
1.296TyrHis: 1.296 ± 0.652
1.944TyrIle: 1.944 ± 0.515
0.0TyrLys: 0.0 ± 0.0
3.24TyrLeu: 3.24 ± 1.63
0.648TyrMet: 0.648 ± 0.326
0.648TyrAsn: 0.648 ± 0.326
2.592TyrPro: 2.592 ± 1.078
3.24TyrGln: 3.24 ± 1.176
1.944TyrArg: 1.944 ± 2.179
5.185TyrSer: 5.185 ± 0.452
3.24TyrThr: 3.24 ± 0.924
1.944TyrVal: 1.944 ± 0.978
1.296TyrTrp: 1.296 ± 1.661
1.944TyrTyr: 1.944 ± 0.978
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (1544 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski