Amino acid dipepetide frequency for Beihai weivirus-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
15.707AlaAla: 15.707 ± 2.394
0.873AlaCys: 0.873 ± 0.49
4.363AlaAsp: 4.363 ± 2.371
3.49AlaGlu: 3.49 ± 1.961
2.618AlaPhe: 2.618 ± 0.137
7.853AlaGly: 7.853 ± 1.197
0.0AlaHis: 0.0 ± 0.0
3.49AlaIle: 3.49 ± 1.961
4.363AlaLys: 4.363 ± 0.764
10.471AlaLeu: 10.471 ± 1.06
4.363AlaMet: 4.363 ± 0.653
5.236AlaAsn: 5.236 ± 3.488
5.236AlaPro: 5.236 ± 1.334
3.49AlaGln: 3.49 ± 0.353
6.108AlaArg: 6.108 ± 3.431
11.344AlaSer: 11.344 ± 3.271
7.853AlaThr: 7.853 ± 3.624
6.108AlaVal: 6.108 ± 0.217
2.618AlaTrp: 2.618 ± 0.137
2.618AlaTyr: 2.618 ± 0.137
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.745CysGlu: 1.745 ± 0.98
1.745CysPhe: 1.745 ± 0.98
0.873CysGly: 0.873 ± 0.49
0.873CysHis: 0.873 ± 0.49
0.873CysIle: 0.873 ± 1.117
0.0CysLys: 0.0 ± 0.0
1.745CysLeu: 1.745 ± 0.98
0.873CysMet: 0.873 ± 0.49
0.873CysAsn: 0.873 ± 0.49
1.745CysPro: 1.745 ± 0.627
1.745CysGln: 1.745 ± 0.98
0.0CysArg: 0.0 ± 0.0
0.0CysSer: 0.0 ± 0.0
3.49CysThr: 3.49 ± 1.961
0.0CysVal: 0.0 ± 0.0
0.873CysTrp: 0.873 ± 1.117
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
5.236AspAla: 5.236 ± 0.273
0.0AspCys: 0.0 ± 0.0
2.618AspAsp: 2.618 ± 1.47
3.49AspGlu: 3.49 ± 1.961
3.49AspPhe: 3.49 ± 0.353
6.108AspGly: 6.108 ± 1.824
2.618AspHis: 2.618 ± 0.137
0.873AspIle: 0.873 ± 1.117
2.618AspLys: 2.618 ± 1.744
4.363AspLeu: 4.363 ± 0.764
0.873AspMet: 0.873 ± 1.117
1.745AspAsn: 1.745 ± 2.234
0.873AspPro: 0.873 ± 0.49
0.0AspGln: 0.0 ± 0.0
4.363AspArg: 4.363 ± 2.451
4.363AspSer: 4.363 ± 2.371
2.618AspThr: 2.618 ± 0.137
3.49AspVal: 3.49 ± 0.353
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
5.236GluAla: 5.236 ± 1.334
1.745GluCys: 1.745 ± 0.98
2.618GluAsp: 2.618 ± 1.47
6.108GluGlu: 6.108 ± 3.431
1.745GluPhe: 1.745 ± 0.98
5.236GluGly: 5.236 ± 1.334
3.49GluHis: 3.49 ± 0.353
3.49GluIle: 3.49 ± 0.353
0.873GluLys: 0.873 ± 0.49
6.108GluLeu: 6.108 ± 0.217
0.873GluMet: 0.873 ± 0.49
2.618GluAsn: 2.618 ± 1.47
4.363GluPro: 4.363 ± 2.451
2.618GluGln: 2.618 ± 1.47
5.236GluArg: 5.236 ± 1.334
2.618GluSer: 2.618 ± 1.47
2.618GluThr: 2.618 ± 1.47
3.49GluVal: 3.49 ± 0.353
0.0GluTrp: 0.0 ± 0.0
3.49GluTyr: 3.49 ± 1.254
0.0GluXaa: 0.0 ± 0.0
Phe
1.745PheAla: 1.745 ± 0.98
0.873PheCys: 0.873 ± 0.49
0.873PheAsp: 0.873 ± 1.117
4.363PheGlu: 4.363 ± 2.451
1.745PhePhe: 1.745 ± 0.627
1.745PheGly: 1.745 ± 0.627
0.0PheHis: 0.0 ± 0.0
0.0PheIle: 0.0 ± 0.0
2.618PheLys: 2.618 ± 0.137
2.618PheLeu: 2.618 ± 1.47
0.0PheMet: 0.0 ± 0.0
2.618PheAsn: 2.618 ± 0.137
0.0PhePro: 0.0 ± 0.0
0.873PheGln: 0.873 ± 1.117
2.618PheArg: 2.618 ± 1.47
2.618PheSer: 2.618 ± 1.47
1.745PheThr: 1.745 ± 0.627
2.618PheVal: 2.618 ± 0.137
0.873PheTrp: 0.873 ± 0.49
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.363GlyAla: 4.363 ± 2.371
2.618GlyCys: 2.618 ± 1.47
6.108GlyAsp: 6.108 ± 1.824
6.108GlyGlu: 6.108 ± 0.217
0.873GlyPhe: 0.873 ± 0.49
4.363GlyGly: 4.363 ± 2.371
2.618GlyHis: 2.618 ± 0.137
2.618GlyIle: 2.618 ± 1.47
3.49GlyLys: 3.49 ± 0.353
5.236GlyLeu: 5.236 ± 1.334
0.0GlyMet: 0.0 ± 0.618
0.873GlyAsn: 0.873 ± 1.117
2.618GlyPro: 2.618 ± 0.137
3.49GlyGln: 3.49 ± 1.961
5.236GlyArg: 5.236 ± 1.881
6.108GlySer: 6.108 ± 2.998
3.49GlyThr: 3.49 ± 2.861
10.471GlyVal: 10.471 ± 2.154
2.618GlyTrp: 2.618 ± 0.137
2.618GlyTyr: 2.618 ± 1.47
0.0GlyXaa: 0.0 ± 0.0
His
2.618HisAla: 2.618 ± 0.137
0.0HisCys: 0.0 ± 0.0
0.873HisAsp: 0.873 ± 0.49
0.873HisGlu: 0.873 ± 1.117
0.873HisPhe: 0.873 ± 0.49
1.745HisGly: 1.745 ± 0.627
1.745HisHis: 1.745 ± 2.234
1.745HisIle: 1.745 ± 0.98
0.0HisLys: 0.0 ± 0.0
0.873HisLeu: 0.873 ± 1.117
1.745HisMet: 1.745 ± 0.98
0.873HisAsn: 0.873 ± 1.117
3.49HisPro: 3.49 ± 2.861
0.0HisGln: 0.0 ± 0.0
2.618HisArg: 2.618 ± 1.47
0.873HisSer: 0.873 ± 0.49
0.0HisThr: 0.0 ± 0.0
3.49HisVal: 3.49 ± 1.254
0.873HisTrp: 0.873 ± 0.49
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.49IleAla: 3.49 ± 1.254
1.745IleCys: 1.745 ± 0.627
1.745IleAsp: 1.745 ± 0.627
2.618IleGlu: 2.618 ± 1.47
1.745IlePhe: 1.745 ± 0.98
6.108IleGly: 6.108 ± 0.217
0.0IleHis: 0.0 ± 0.0
1.745IleIle: 1.745 ± 2.234
1.745IleLys: 1.745 ± 0.98
1.745IleLeu: 1.745 ± 0.98
2.618IleMet: 2.618 ± 0.137
1.745IleAsn: 1.745 ± 2.234
0.873IlePro: 0.873 ± 0.49
0.0IleGln: 0.0 ± 0.0
3.49IleArg: 3.49 ± 0.353
0.873IleSer: 0.873 ± 1.117
2.618IleThr: 2.618 ± 0.137
1.745IleVal: 1.745 ± 0.627
0.873IleTrp: 0.873 ± 0.49
0.873IleTyr: 0.873 ± 1.117
0.0IleXaa: 0.0 ± 0.0
Lys
8.726LysAla: 8.726 ± 1.687
0.873LysCys: 0.873 ± 0.49
1.745LysAsp: 1.745 ± 0.98
1.745LysGlu: 1.745 ± 0.98
0.0LysPhe: 0.0 ± 0.0
1.745LysGly: 1.745 ± 0.627
1.745LysHis: 1.745 ± 0.627
1.745LysIle: 1.745 ± 0.98
2.618LysLys: 2.618 ± 1.47
5.236LysLeu: 5.236 ± 1.881
0.873LysMet: 0.873 ± 0.49
0.873LysAsn: 0.873 ± 0.49
4.363LysPro: 4.363 ± 0.844
0.0LysGln: 0.0 ± 0.0
2.618LysArg: 2.618 ± 0.137
3.49LysSer: 3.49 ± 2.861
1.745LysThr: 1.745 ± 0.98
0.873LysVal: 0.873 ± 0.49
1.745LysTrp: 1.745 ± 0.627
0.873LysTyr: 0.873 ± 0.49
0.0LysXaa: 0.0 ± 0.0
Leu
10.471LeuAla: 10.471 ± 0.547
2.618LeuCys: 2.618 ± 0.137
3.49LeuAsp: 3.49 ± 0.353
8.726LeuGlu: 8.726 ± 0.08
1.745LeuPhe: 1.745 ± 0.98
6.108LeuGly: 6.108 ± 2.998
1.745LeuHis: 1.745 ± 0.627
2.618LeuIle: 2.618 ± 0.137
0.873LeuLys: 0.873 ± 0.49
5.236LeuLeu: 5.236 ± 1.881
1.745LeuMet: 1.745 ± 0.98
2.618LeuAsn: 2.618 ± 1.47
4.363LeuPro: 4.363 ± 0.844
0.0LeuGln: 0.0 ± 0.0
7.853LeuArg: 7.853 ± 1.197
6.981LeuSer: 6.981 ± 0.9
2.618LeuThr: 2.618 ± 1.744
5.236LeuVal: 5.236 ± 1.334
1.745LeuTrp: 1.745 ± 0.98
0.873LeuTyr: 0.873 ± 0.49
0.0LeuXaa: 0.0 ± 0.0
Met
2.618MetAla: 2.618 ± 1.47
0.0MetCys: 0.0 ± 0.0
2.618MetAsp: 2.618 ± 0.137
1.745MetGlu: 1.745 ± 0.98
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
0.0MetHis: 0.0 ± 0.0
0.873MetIle: 0.873 ± 1.117
1.745MetLys: 1.745 ± 0.98
3.49MetLeu: 3.49 ± 1.254
0.873MetMet: 0.873 ± 0.49
1.745MetAsn: 1.745 ± 2.234
0.873MetPro: 0.873 ± 0.49
1.745MetGln: 1.745 ± 0.98
0.873MetArg: 0.873 ± 0.49
5.236MetSer: 5.236 ± 0.273
2.618MetThr: 2.618 ± 0.137
2.618MetVal: 2.618 ± 0.137
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
0.0AsnCys: 0.0 ± 0.0
1.745AsnAsp: 1.745 ± 0.98
1.745AsnGlu: 1.745 ± 0.98
0.873AsnPhe: 0.873 ± 0.49
2.618AsnGly: 2.618 ± 1.744
0.873AsnHis: 0.873 ± 0.49
2.618AsnIle: 2.618 ± 3.351
2.618AsnLys: 2.618 ± 0.137
3.49AsnLeu: 3.49 ± 0.353
0.873AsnMet: 0.873 ± 0.49
0.873AsnAsn: 0.873 ± 1.117
3.49AsnPro: 3.49 ± 1.254
0.0AsnGln: 0.0 ± 0.0
1.745AsnArg: 1.745 ± 2.234
3.49AsnSer: 3.49 ± 1.254
1.745AsnThr: 1.745 ± 2.234
3.49AsnVal: 3.49 ± 0.353
0.873AsnTrp: 0.873 ± 0.49
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
6.108ProAla: 6.108 ± 1.39
0.873ProCys: 0.873 ± 0.49
3.49ProAsp: 3.49 ± 0.353
4.363ProGlu: 4.363 ± 2.451
0.873ProPhe: 0.873 ± 0.49
1.745ProGly: 1.745 ± 0.98
0.873ProHis: 0.873 ± 1.117
2.618ProIle: 2.618 ± 1.744
0.0ProLys: 0.0 ± 0.0
1.745ProLeu: 1.745 ± 0.627
0.873ProMet: 0.873 ± 0.49
2.618ProAsn: 2.618 ± 0.137
10.471ProPro: 10.471 ± 5.882
5.236ProGln: 5.236 ± 1.881
3.49ProArg: 3.49 ± 1.254
2.618ProSer: 2.618 ± 0.137
3.49ProThr: 3.49 ± 0.353
5.236ProVal: 5.236 ± 1.334
0.0ProTrp: 0.0 ± 0.0
0.873ProTyr: 0.873 ± 1.117
0.0ProXaa: 0.0 ± 0.0
Gln
5.236GlnAla: 5.236 ± 1.334
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
0.873GlnGlu: 0.873 ± 0.49
1.745GlnPhe: 1.745 ± 0.627
4.363GlnGly: 4.363 ± 0.844
0.0GlnHis: 0.0 ± 0.0
1.745GlnIle: 1.745 ± 0.627
1.745GlnLys: 1.745 ± 0.98
2.618GlnLeu: 2.618 ± 0.137
0.873GlnMet: 0.873 ± 1.117
0.0GlnAsn: 0.0 ± 0.0
0.873GlnPro: 0.873 ± 1.117
0.873GlnGln: 0.873 ± 0.49
2.618GlnArg: 2.618 ± 1.744
2.618GlnSer: 2.618 ± 0.137
0.873GlnThr: 0.873 ± 0.49
3.49GlnVal: 3.49 ± 1.961
0.0GlnTrp: 0.0 ± 0.0
0.873GlnTyr: 0.873 ± 0.49
0.0GlnXaa: 0.0 ± 0.0
Arg
9.599ArgAla: 9.599 ± 2.177
1.745ArgCys: 1.745 ± 0.98
2.618ArgAsp: 2.618 ± 0.137
6.981ArgGlu: 6.981 ± 2.314
3.49ArgPhe: 3.49 ± 1.254
5.236ArgGly: 5.236 ± 1.881
3.49ArgHis: 3.49 ± 0.353
3.49ArgIle: 3.49 ± 0.353
4.363ArgLys: 4.363 ± 0.764
2.618ArgLeu: 2.618 ± 1.47
2.618ArgMet: 2.618 ± 0.137
2.618ArgAsn: 2.618 ± 0.137
1.745ArgPro: 1.745 ± 0.627
3.49ArgGln: 3.49 ± 0.353
4.363ArgArg: 4.363 ± 2.371
2.618ArgSer: 2.618 ± 1.47
3.49ArgThr: 3.49 ± 1.254
6.981ArgVal: 6.981 ± 0.9
1.745ArgTrp: 1.745 ± 0.98
0.0ArgTyr: 0.0 ± 0.0
0.0ArgXaa: 0.0 ± 0.0
Ser
8.726SerAla: 8.726 ± 6.349
1.745SerCys: 1.745 ± 0.98
6.108SerAsp: 6.108 ± 2.998
2.618SerGlu: 2.618 ± 1.744
0.873SerPhe: 0.873 ± 0.49
5.236SerGly: 5.236 ± 1.334
0.873SerHis: 0.873 ± 1.117
1.745SerIle: 1.745 ± 0.98
4.363SerLys: 4.363 ± 0.844
9.599SerLeu: 9.599 ± 2.644
1.745SerMet: 1.745 ± 0.98
1.745SerAsn: 1.745 ± 0.98
4.363SerPro: 4.363 ± 0.764
1.745SerGln: 1.745 ± 0.98
4.363SerArg: 4.363 ± 2.371
5.236SerSer: 5.236 ± 5.095
2.618SerThr: 2.618 ± 0.137
6.108SerVal: 6.108 ± 3.431
2.618SerTrp: 2.618 ± 0.137
1.745SerTyr: 1.745 ± 0.627
0.0SerXaa: 0.0 ± 0.0
Thr
3.49ThrAla: 3.49 ± 1.961
0.873ThrCys: 0.873 ± 0.49
3.49ThrAsp: 3.49 ± 0.353
0.873ThrGlu: 0.873 ± 0.49
3.49ThrPhe: 3.49 ± 1.254
6.108ThrGly: 6.108 ± 2.998
1.745ThrHis: 1.745 ± 0.627
3.49ThrIle: 3.49 ± 1.254
5.236ThrLys: 5.236 ± 1.881
3.49ThrLeu: 3.49 ± 0.353
2.618ThrMet: 2.618 ± 0.137
0.0ThrAsn: 0.0 ± 0.0
2.618ThrPro: 2.618 ± 0.137
1.745ThrGln: 1.745 ± 2.234
2.618ThrArg: 2.618 ± 1.744
3.49ThrSer: 3.49 ± 1.961
3.49ThrThr: 3.49 ± 0.353
1.745ThrVal: 1.745 ± 2.234
0.873ThrTrp: 0.873 ± 1.117
0.873ThrTyr: 0.873 ± 0.49
0.0ThrXaa: 0.0 ± 0.0
Val
9.599ValAla: 9.599 ± 0.57
0.873ValCys: 0.873 ± 1.117
0.0ValAsp: 0.0 ± 0.0
6.108ValGlu: 6.108 ± 1.824
2.618ValPhe: 2.618 ± 1.47
8.726ValGly: 8.726 ± 0.08
0.873ValHis: 0.873 ± 1.117
1.745ValIle: 1.745 ± 0.98
2.618ValLys: 2.618 ± 1.47
5.236ValLeu: 5.236 ± 1.334
2.618ValMet: 2.618 ± 1.744
1.745ValAsn: 1.745 ± 0.98
2.618ValPro: 2.618 ± 1.744
2.618ValGln: 2.618 ± 1.744
8.726ValArg: 8.726 ± 1.687
6.108ValSer: 6.108 ± 0.217
2.618ValThr: 2.618 ± 0.137
8.726ValVal: 8.726 ± 1.687
0.0ValTrp: 0.0 ± 0.0
3.49ValTyr: 3.49 ± 0.353
0.0ValXaa: 0.0 ± 0.0
Trp
2.618TrpAla: 2.618 ± 0.137
0.0TrpCys: 0.0 ± 0.0
3.49TrpAsp: 3.49 ± 2.861
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.873TrpGly: 0.873 ± 1.117
0.873TrpHis: 0.873 ± 0.49
0.0TrpIle: 0.0 ± 0.0
0.873TrpLys: 0.873 ± 0.49
0.873TrpLeu: 0.873 ± 0.49
0.0TrpMet: 0.0 ± 0.0
0.873TrpAsn: 0.873 ± 0.49
0.873TrpPro: 0.873 ± 0.49
0.0TrpGln: 0.0 ± 0.0
2.618TrpArg: 2.618 ± 1.47
2.618TrpSer: 2.618 ± 0.137
0.0TrpThr: 0.0 ± 0.0
1.745TrpVal: 1.745 ± 0.98
1.745TrpTrp: 1.745 ± 0.98
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.618TyrAla: 2.618 ± 0.137
0.0TyrCys: 0.0 ± 0.0
1.745TyrAsp: 1.745 ± 0.98
0.0TyrGlu: 0.0 ± 0.0
0.0TyrPhe: 0.0 ± 0.0
0.0TyrGly: 0.0 ± 0.0
0.873TyrHis: 0.873 ± 0.49
0.873TyrIle: 0.873 ± 0.49
0.873TyrLys: 0.873 ± 0.49
0.873TyrLeu: 0.873 ± 0.49
1.745TyrMet: 1.745 ± 0.627
0.873TyrAsn: 0.873 ± 1.117
1.745TyrPro: 1.745 ± 0.627
1.745TyrGln: 1.745 ± 0.98
1.745TyrArg: 1.745 ± 0.627
0.873TyrSer: 0.873 ± 0.49
2.618TyrThr: 2.618 ± 1.744
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1147 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski