Amino acid dipepetide frequency for Wenzhou bivalvia virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
10.146AlaAla: 10.146 ± 1.507
0.634AlaCys: 0.634 ± 0.339
3.805AlaAsp: 3.805 ± 0.728
3.805AlaGlu: 3.805 ± 0.577
2.536AlaPhe: 2.536 ± 0.05
3.171AlaGly: 3.171 ± 0.389
1.902AlaHis: 1.902 ± 0.288
3.805AlaIle: 3.805 ± 1.882
1.902AlaLys: 1.902 ± 1.594
8.244AlaLeu: 8.244 ± 0.49
2.536AlaMet: 2.536 ± 1.524
2.536AlaAsn: 2.536 ± 0.05
2.536AlaPro: 2.536 ± 2.56
3.805AlaGln: 3.805 ± 0.577
4.439AlaArg: 4.439 ± 1.543
5.707AlaSer: 5.707 ± 0.865
5.707AlaThr: 5.707 ± 0.44
2.536AlaVal: 2.536 ± 1.255
0.0AlaTrp: 0.0 ± 0.0
5.707AlaTyr: 5.707 ± 0.44
0.0AlaXaa: 0.0 ± 0.0
Cys
1.902CysAla: 1.902 ± 0.288
0.0CysCys: 0.0 ± 0.0
1.268CysAsp: 1.268 ± 0.678
0.0CysGlu: 0.0 ± 0.0
1.268CysPhe: 1.268 ± 0.627
0.0CysGly: 0.0 ± 0.0
0.634CysHis: 0.634 ± 0.339
0.634CysIle: 0.634 ± 0.339
3.171CysLys: 3.171 ± 1.695
1.902CysLeu: 1.902 ± 0.288
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
0.634CysGln: 0.634 ± 0.339
0.634CysArg: 0.634 ± 0.339
0.634CysSer: 0.634 ± 0.339
1.268CysThr: 1.268 ± 0.627
0.0CysVal: 0.0 ± 0.0
1.268CysTrp: 1.268 ± 0.678
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
4.439AspAla: 4.439 ± 0.238
1.902AspCys: 1.902 ± 1.017
2.536AspAsp: 2.536 ± 1.356
1.902AspGlu: 1.902 ± 1.017
1.268AspPhe: 1.268 ± 0.678
5.073AspGly: 5.073 ± 1.204
0.634AspHis: 0.634 ± 0.339
3.171AspIle: 3.171 ± 1.695
1.268AspLys: 1.268 ± 0.678
5.073AspLeu: 5.073 ± 1.406
1.268AspMet: 1.268 ± 0.678
6.341AspAsn: 6.341 ± 4.442
3.805AspPro: 3.805 ± 2.034
2.536AspGln: 2.536 ± 1.356
1.902AspArg: 1.902 ± 1.017
3.805AspSer: 3.805 ± 0.728
1.902AspThr: 1.902 ± 0.288
6.341AspVal: 6.341 ± 3.389
1.268AspTrp: 1.268 ± 0.678
1.902AspTyr: 1.902 ± 1.017
0.0AspXaa: 0.0 ± 0.0
Glu
3.171GluAla: 3.171 ± 2.221
0.634GluCys: 0.634 ± 0.339
3.171GluAsp: 3.171 ± 1.695
4.439GluGlu: 4.439 ± 2.373
3.171GluPhe: 3.171 ± 0.916
1.268GluGly: 1.268 ± 0.627
0.634GluHis: 0.634 ± 0.339
3.171GluIle: 3.171 ± 0.916
2.536GluLys: 2.536 ± 0.05
3.171GluLeu: 3.171 ± 0.389
1.902GluMet: 1.902 ± 0.288
1.268GluAsn: 1.268 ± 0.678
2.536GluPro: 2.536 ± 1.356
1.268GluGln: 1.268 ± 0.678
2.536GluArg: 2.536 ± 1.356
0.634GluSer: 0.634 ± 0.339
3.805GluThr: 3.805 ± 0.728
1.902GluVal: 1.902 ± 1.017
1.268GluTrp: 1.268 ± 0.678
1.268GluTyr: 1.268 ± 0.627
0.0GluXaa: 0.0 ± 0.0
Phe
4.439PheAla: 4.439 ± 1.543
0.0PheCys: 0.0 ± 0.0
1.268PheAsp: 1.268 ± 0.627
0.0PheGlu: 0.0 ± 0.0
1.268PhePhe: 1.268 ± 0.627
2.536PheGly: 2.536 ± 1.356
1.268PheHis: 1.268 ± 0.678
4.439PheIle: 4.439 ± 0.238
3.171PheLys: 3.171 ± 2.221
1.902PheLeu: 1.902 ± 0.288
1.268PheMet: 1.268 ± 0.678
2.536PheAsn: 2.536 ± 0.05
1.902PhePro: 1.902 ± 1.594
1.902PheGln: 1.902 ± 1.017
2.536PheArg: 2.536 ± 0.05
2.536PheSer: 2.536 ± 1.356
1.902PheThr: 1.902 ± 1.017
3.805PheVal: 3.805 ± 3.187
1.902PheTrp: 1.902 ± 1.594
1.268PheTyr: 1.268 ± 0.678
0.0PheXaa: 0.0 ± 0.0
Gly
6.975GlyAla: 6.975 ± 5.409
0.0GlyCys: 0.0 ± 0.0
5.707GlyAsp: 5.707 ± 3.05
1.268GlyGlu: 1.268 ± 0.678
3.171GlyPhe: 3.171 ± 3.526
5.073GlyGly: 5.073 ± 1.406
0.634GlyHis: 0.634 ± 0.339
2.536GlyIle: 2.536 ± 0.05
5.073GlyLys: 5.073 ± 2.712
3.171GlyLeu: 3.171 ± 0.389
1.268GlyMet: 1.268 ± 0.678
4.439GlyAsn: 4.439 ± 1.543
4.439GlyPro: 4.439 ± 2.849
0.634GlyGln: 0.634 ± 0.339
5.073GlyArg: 5.073 ± 2.712
6.341GlySer: 6.341 ± 0.526
5.073GlyThr: 5.073 ± 2.51
6.341GlyVal: 6.341 ± 1.832
0.0GlyTrp: 0.0 ± 0.0
1.268GlyTyr: 1.268 ± 0.627
0.0GlyXaa: 0.0 ± 0.0
His
1.902HisAla: 1.902 ± 0.288
0.0HisCys: 0.0 ± 0.0
1.268HisAsp: 1.268 ± 0.627
1.268HisGlu: 1.268 ± 0.678
0.0HisPhe: 0.0 ± 0.0
1.268HisGly: 1.268 ± 0.678
0.634HisHis: 0.634 ± 0.339
0.634HisIle: 0.634 ± 0.339
1.268HisLys: 1.268 ± 0.678
2.536HisLeu: 2.536 ± 1.356
0.634HisMet: 0.634 ± 0.339
1.268HisAsn: 1.268 ± 0.678
2.536HisPro: 2.536 ± 1.356
1.268HisGln: 1.268 ± 0.678
3.171HisArg: 3.171 ± 0.389
0.634HisSer: 0.634 ± 0.339
3.805HisThr: 3.805 ± 0.728
0.634HisVal: 0.634 ± 0.339
0.0HisTrp: 0.0 ± 0.0
0.634HisTyr: 0.634 ± 0.339
0.0HisXaa: 0.0 ± 0.0
Ile
1.268IleAla: 1.268 ± 0.678
3.171IleCys: 3.171 ± 0.389
4.439IleAsp: 4.439 ± 0.238
4.439IleGlu: 4.439 ± 2.373
2.536IlePhe: 2.536 ± 0.05
4.439IleGly: 4.439 ± 1.543
0.634IleHis: 0.634 ± 0.339
2.536IleIle: 2.536 ± 1.356
3.171IleLys: 3.171 ± 0.389
4.439IleLeu: 4.439 ± 2.849
2.536IleMet: 2.536 ± 1.356
1.268IleAsn: 1.268 ± 0.678
0.634IlePro: 0.634 ± 0.339
1.268IleGln: 1.268 ± 0.627
1.268IleArg: 1.268 ± 0.627
4.439IleSer: 4.439 ± 0.238
3.805IleThr: 3.805 ± 0.577
2.536IleVal: 2.536 ± 2.56
0.0IleTrp: 0.0 ± 0.0
0.634IleTyr: 0.634 ± 0.339
0.0IleXaa: 0.0 ± 0.0
Lys
3.805LysAla: 3.805 ± 0.728
1.902LysCys: 1.902 ± 1.017
4.439LysAsp: 4.439 ± 0.238
1.268LysGlu: 1.268 ± 0.678
0.634LysPhe: 0.634 ± 0.966
5.073LysGly: 5.073 ± 1.204
2.536LysHis: 2.536 ± 1.356
2.536LysIle: 2.536 ± 1.255
3.805LysLys: 3.805 ± 5.798
5.073LysLeu: 5.073 ± 0.101
1.902LysMet: 1.902 ± 1.017
1.902LysAsn: 1.902 ± 0.288
6.341LysPro: 6.341 ± 0.526
2.536LysGln: 2.536 ± 0.05
3.805LysArg: 3.805 ± 2.034
1.268LysSer: 1.268 ± 0.627
4.439LysThr: 4.439 ± 2.373
3.171LysVal: 3.171 ± 1.695
0.0LysTrp: 0.0 ± 0.0
1.268LysTyr: 1.268 ± 0.678
0.0LysXaa: 0.0 ± 0.0
Leu
6.341LeuAla: 6.341 ± 2.084
1.268LeuCys: 1.268 ± 0.627
5.073LeuAsp: 5.073 ± 1.406
1.902LeuGlu: 1.902 ± 1.594
2.536LeuPhe: 2.536 ± 1.356
6.341LeuGly: 6.341 ± 1.832
2.536LeuHis: 2.536 ± 1.356
3.171LeuIle: 3.171 ± 0.389
4.439LeuLys: 4.439 ± 1.067
4.439LeuLeu: 4.439 ± 2.373
0.634LeuMet: 0.634 ± 0.339
3.171LeuAsn: 3.171 ± 2.221
6.341LeuPro: 6.341 ± 0.526
3.805LeuGln: 3.805 ± 0.577
3.171LeuArg: 3.171 ± 1.695
5.707LeuSer: 5.707 ± 2.171
4.439LeuThr: 4.439 ± 1.067
3.805LeuVal: 3.805 ± 0.728
1.268LeuTrp: 1.268 ± 0.678
2.536LeuTyr: 2.536 ± 0.05
0.0LeuXaa: 0.0 ± 0.0
Met
2.536MetAla: 2.536 ± 2.56
0.0MetCys: 0.0 ± 0.0
1.902MetAsp: 1.902 ± 1.017
0.0MetGlu: 0.0 ± 0.0
2.536MetPhe: 2.536 ± 1.356
4.439MetGly: 4.439 ± 2.373
0.0MetHis: 0.0 ± 0.0
1.268MetIle: 1.268 ± 0.678
1.902MetLys: 1.902 ± 1.017
1.902MetLeu: 1.902 ± 1.017
1.902MetMet: 1.902 ± 0.288
1.902MetAsn: 1.902 ± 0.288
0.634MetPro: 0.634 ± 0.966
0.0MetGln: 0.0 ± 0.0
3.171MetArg: 3.171 ± 0.389
1.268MetSer: 1.268 ± 0.678
0.634MetThr: 0.634 ± 0.339
1.268MetVal: 1.268 ± 0.678
0.0MetTrp: 0.0 ± 0.0
1.902MetTyr: 1.902 ± 1.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.805AsnAla: 3.805 ± 0.728
0.0AsnCys: 0.0 ± 0.0
1.902AsnAsp: 1.902 ± 0.288
3.171AsnGlu: 3.171 ± 0.389
2.536AsnPhe: 2.536 ± 1.255
3.171AsnGly: 3.171 ± 0.389
1.268AsnHis: 1.268 ± 0.627
1.902AsnIle: 1.902 ± 1.594
1.268AsnLys: 1.268 ± 0.627
2.536AsnLeu: 2.536 ± 0.05
0.634AsnMet: 0.634 ± 0.966
1.902AsnAsn: 1.902 ± 1.594
3.171AsnPro: 3.171 ± 0.389
1.268AsnGln: 1.268 ± 1.933
3.805AsnArg: 3.805 ± 1.882
3.171AsnSer: 3.171 ± 0.916
5.073AsnThr: 5.073 ± 2.51
3.171AsnVal: 3.171 ± 1.695
1.902AsnTrp: 1.902 ± 0.288
3.171AsnTyr: 3.171 ± 0.389
0.0AsnXaa: 0.0 ± 0.0
Pro
1.268ProAla: 1.268 ± 0.627
0.0ProCys: 0.0 ± 0.0
5.073ProAsp: 5.073 ± 1.204
4.439ProGlu: 4.439 ± 1.067
2.536ProPhe: 2.536 ± 2.56
3.805ProGly: 3.805 ± 0.577
1.268ProHis: 1.268 ± 0.678
3.171ProIle: 3.171 ± 0.389
5.707ProLys: 5.707 ± 1.745
3.171ProLeu: 3.171 ± 2.221
1.268ProMet: 1.268 ± 0.627
5.073ProAsn: 5.073 ± 0.101
3.805ProPro: 3.805 ± 0.577
2.536ProGln: 2.536 ± 0.05
3.805ProArg: 3.805 ± 0.577
5.707ProSer: 5.707 ± 2.171
5.073ProThr: 5.073 ± 1.204
2.536ProVal: 2.536 ± 0.05
1.902ProTrp: 1.902 ± 0.288
1.902ProTyr: 1.902 ± 0.288
0.0ProXaa: 0.0 ± 0.0
Gln
1.902GlnAla: 1.902 ± 1.017
0.634GlnCys: 0.634 ± 0.339
2.536GlnAsp: 2.536 ± 0.05
1.902GlnGlu: 1.902 ± 1.594
1.268GlnPhe: 1.268 ± 0.678
1.268GlnGly: 1.268 ± 0.627
2.536GlnHis: 2.536 ± 0.05
0.0GlnIle: 0.0 ± 0.0
3.171GlnLys: 3.171 ± 0.389
1.268GlnLeu: 1.268 ± 0.678
1.268GlnMet: 1.268 ± 0.678
0.0GlnAsn: 0.0 ± 0.0
3.805GlnPro: 3.805 ± 1.882
1.268GlnGln: 1.268 ± 0.678
3.171GlnArg: 3.171 ± 0.389
0.634GlnSer: 0.634 ± 0.339
2.536GlnThr: 2.536 ± 0.05
1.902GlnVal: 1.902 ± 1.017
0.0GlnTrp: 0.0 ± 0.0
1.268GlnTyr: 1.268 ± 1.933
0.0GlnXaa: 0.0 ± 0.0
Arg
4.439ArgAla: 4.439 ± 0.238
0.0ArgCys: 0.0 ± 0.0
2.536ArgAsp: 2.536 ± 1.356
2.536ArgGlu: 2.536 ± 0.05
1.902ArgPhe: 1.902 ± 0.288
3.171ArgGly: 3.171 ± 1.695
3.171ArgHis: 3.171 ± 1.695
1.268ArgIle: 1.268 ± 0.678
1.902ArgLys: 1.902 ± 1.017
7.609ArgLeu: 7.609 ± 0.151
3.805ArgMet: 3.805 ± 1.246
2.536ArgAsn: 2.536 ± 0.05
3.805ArgPro: 3.805 ± 0.728
1.268ArgGln: 1.268 ± 0.678
3.805ArgArg: 3.805 ± 0.728
5.073ArgSer: 5.073 ± 1.204
5.073ArgThr: 5.073 ± 1.406
5.073ArgVal: 5.073 ± 0.101
0.634ArgTrp: 0.634 ± 0.339
1.268ArgTyr: 1.268 ± 0.678
0.0ArgXaa: 0.0 ± 0.0
Ser
5.073SerAla: 5.073 ± 0.101
0.0SerCys: 0.0 ± 0.0
2.536SerAsp: 2.536 ± 1.356
1.902SerGlu: 1.902 ± 1.017
1.268SerPhe: 1.268 ± 0.678
5.707SerGly: 5.707 ± 2.171
1.902SerHis: 1.902 ± 1.017
5.073SerIle: 5.073 ± 1.204
2.536SerLys: 2.536 ± 0.05
5.073SerLeu: 5.073 ± 1.406
1.902SerMet: 1.902 ± 1.594
5.073SerAsn: 5.073 ± 0.101
6.341SerPro: 6.341 ± 0.779
1.902SerGln: 1.902 ± 0.288
5.073SerArg: 5.073 ± 0.101
5.073SerSer: 5.073 ± 3.815
1.902SerThr: 1.902 ± 1.594
5.707SerVal: 5.707 ± 4.781
1.268SerTrp: 1.268 ± 0.627
1.902SerTyr: 1.902 ± 0.288
0.0SerXaa: 0.0 ± 0.0
Thr
5.073ThrAla: 5.073 ± 0.101
2.536ThrCys: 2.536 ± 0.05
2.536ThrAsp: 2.536 ± 1.356
4.439ThrGlu: 4.439 ± 0.238
5.073ThrPhe: 5.073 ± 0.101
4.439ThrGly: 4.439 ± 2.849
1.268ThrHis: 1.268 ± 0.678
5.707ThrIle: 5.707 ± 0.865
3.805ThrLys: 3.805 ± 2.034
2.536ThrLeu: 2.536 ± 0.05
0.634ThrMet: 0.634 ± 0.339
1.268ThrAsn: 1.268 ± 1.933
4.439ThrPro: 4.439 ± 1.067
1.268ThrGln: 1.268 ± 0.678
4.439ThrArg: 4.439 ± 1.067
3.805ThrSer: 3.805 ± 0.728
5.707ThrThr: 5.707 ± 2.171
7.609ThrVal: 7.609 ± 5.07
1.902ThrTrp: 1.902 ± 1.017
1.902ThrTyr: 1.902 ± 0.288
0.0ThrXaa: 0.0 ± 0.0
Val
3.171ValAla: 3.171 ± 0.389
1.268ValCys: 1.268 ± 0.627
3.171ValAsp: 3.171 ± 1.695
3.171ValGlu: 3.171 ± 0.389
2.536ValPhe: 2.536 ± 0.05
4.439ValGly: 4.439 ± 1.543
0.634ValHis: 0.634 ± 0.966
2.536ValIle: 2.536 ± 0.05
5.073ValLys: 5.073 ± 2.51
5.073ValLeu: 5.073 ± 0.101
1.268ValMet: 1.268 ± 0.678
3.171ValAsn: 3.171 ± 0.389
5.073ValPro: 5.073 ± 5.12
2.536ValGln: 2.536 ± 1.255
3.805ValArg: 3.805 ± 2.034
7.609ValSer: 7.609 ± 1.154
3.171ValThr: 3.171 ± 0.389
6.341ValVal: 6.341 ± 0.526
0.634ValTrp: 0.634 ± 0.339
1.902ValTyr: 1.902 ± 0.288
0.0ValXaa: 0.0 ± 0.0
Trp
2.536TrpAla: 2.536 ± 0.05
0.634TrpCys: 0.634 ± 0.339
1.902TrpAsp: 1.902 ± 0.288
0.0TrpGlu: 0.0 ± 0.0
1.902TrpPhe: 1.902 ± 1.017
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.634TrpIle: 0.634 ± 0.339
1.268TrpLys: 1.268 ± 0.627
1.268TrpLeu: 1.268 ± 0.627
0.0TrpMet: 0.0 ± 0.0
1.902TrpAsn: 1.902 ± 0.288
0.634TrpPro: 0.634 ± 0.339
0.0TrpGln: 0.0 ± 0.0
0.634TrpArg: 0.634 ± 0.339
0.0TrpSer: 0.0 ± 0.0
0.634TrpThr: 0.634 ± 0.339
0.0TrpVal: 0.0 ± 0.0
0.634TrpTrp: 0.634 ± 0.339
2.536TrpTyr: 2.536 ± 1.356
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.902TyrAla: 1.902 ± 1.017
0.634TyrCys: 0.634 ± 0.339
1.268TyrAsp: 1.268 ± 0.678
1.902TyrGlu: 1.902 ± 0.288
1.902TyrPhe: 1.902 ± 0.288
3.805TyrGly: 3.805 ± 0.577
1.268TyrHis: 1.268 ± 0.678
1.268TyrIle: 1.268 ± 0.678
1.268TyrLys: 1.268 ± 0.678
2.536TyrLeu: 2.536 ± 1.356
1.902TyrMet: 1.902 ± 1.017
1.268TyrAsn: 1.268 ± 0.627
1.268TyrPro: 1.268 ± 0.627
0.634TyrGln: 0.634 ± 0.966
0.634TyrArg: 0.634 ± 0.339
3.171TyrSer: 3.171 ± 0.916
4.439TyrThr: 4.439 ± 1.543
1.902TyrVal: 1.902 ± 1.017
1.268TyrTrp: 1.268 ± 0.678
0.634TyrTyr: 0.634 ± 0.966
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski