Amino acid dipepetide frequency for Beihai picorna-like virus 5

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.775AlaAla: 8.775 ± 1.472
1.053AlaCys: 1.053 ± 0.542
3.51AlaAsp: 3.51 ± 0.628
4.914AlaGlu: 4.914 ± 0.703
1.053AlaPhe: 1.053 ± 0.067
4.212AlaGly: 4.212 ± 2.701
2.457AlaHis: 2.457 ± 0.047
3.861AlaIle: 3.861 ± 0.161
4.563AlaLys: 4.563 ± 1.13
7.02AlaLeu: 7.02 ± 1.178
1.755AlaMet: 1.755 ± 0.402
2.808AlaAsn: 2.808 ± 0.989
5.265AlaPro: 5.265 ± 0.942
1.755AlaGln: 1.755 ± 0.294
2.808AlaArg: 2.808 ± 0.836
5.265AlaSer: 5.265 ± 2.159
5.265AlaThr: 5.265 ± 0.942
3.159AlaVal: 3.159 ± 0.408
2.106AlaTrp: 2.106 ± 0.742
2.457AlaTyr: 2.457 ± 0.561
0.0AlaXaa: 0.0 ± 0.0
Cys
1.053CysAla: 1.053 ± 0.542
0.351CysCys: 0.351 ± 0.181
1.404CysAsp: 1.404 ± 0.114
0.702CysGlu: 0.702 ± 0.361
0.351CysPhe: 0.351 ± 0.428
1.755CysGly: 1.755 ± 0.294
0.0CysHis: 0.0 ± 0.0
0.351CysIle: 0.351 ± 0.181
1.053CysLys: 1.053 ± 0.542
0.351CysLeu: 0.351 ± 0.181
0.702CysMet: 0.702 ± 0.361
1.404CysAsn: 1.404 ± 0.722
1.755CysPro: 1.755 ± 0.314
0.351CysGln: 0.351 ± 0.181
0.702CysArg: 0.702 ± 0.361
1.404CysSer: 1.404 ± 1.103
1.053CysThr: 1.053 ± 0.542
0.351CysVal: 0.351 ± 0.181
0.0CysTrp: 0.0 ± 0.0
1.053CysTyr: 1.053 ± 1.284
0.0CysXaa: 0.0 ± 0.0
Asp
4.212AspAla: 4.212 ± 0.267
1.053AspCys: 1.053 ± 0.067
5.265AspAsp: 5.265 ± 1.492
5.967AspGlu: 5.967 ± 1.853
3.51AspPhe: 3.51 ± 0.628
3.159AspGly: 3.159 ± 0.2
1.755AspHis: 1.755 ± 0.294
2.106AspIle: 2.106 ± 1.083
2.808AspLys: 2.808 ± 1.444
4.563AspLeu: 4.563 ± 0.695
2.106AspMet: 2.106 ± 0.742
1.755AspAsn: 1.755 ± 1.531
2.457AspPro: 2.457 ± 1.778
1.053AspGln: 1.053 ± 0.067
2.808AspArg: 2.808 ± 0.228
2.457AspSer: 2.457 ± 0.655
1.755AspThr: 1.755 ± 0.903
2.457AspVal: 2.457 ± 0.047
1.404AspTrp: 1.404 ± 0.114
5.265AspTyr: 5.265 ± 1.492
0.0AspXaa: 0.0 ± 0.0
Glu
5.616GluAla: 5.616 ± 1.672
1.755GluCys: 1.755 ± 0.294
2.808GluAsp: 2.808 ± 0.381
8.775GluGlu: 8.775 ± 2.08
3.861GluPhe: 3.861 ± 0.769
3.861GluGly: 3.861 ± 0.447
3.159GluHis: 3.159 ± 1.625
4.212GluIle: 4.212 ± 0.95
5.265GluLys: 5.265 ± 2.1
3.51GluLeu: 3.51 ± 0.02
2.808GluMet: 2.808 ± 0.381
3.861GluAsn: 3.861 ± 0.447
3.159GluPro: 3.159 ± 0.809
2.106GluGln: 2.106 ± 1.083
2.106GluArg: 2.106 ± 0.133
3.159GluSer: 3.159 ± 1.417
4.563GluThr: 4.563 ± 0.522
5.265GluVal: 5.265 ± 0.883
3.159GluTrp: 3.159 ± 0.408
4.563GluTyr: 4.563 ± 0.522
0.0GluXaa: 0.0 ± 0.0
Phe
2.808PheAla: 2.808 ± 0.381
0.702PheCys: 0.702 ± 0.361
2.106PheAsp: 2.106 ± 1.083
2.808PheGlu: 2.808 ± 0.228
3.159PhePhe: 3.159 ± 1.017
3.51PheGly: 3.51 ± 0.628
2.106PheHis: 2.106 ± 0.475
1.755PheIle: 1.755 ± 0.294
4.212PheLys: 4.212 ± 0.95
6.318PheLeu: 6.318 ± 0.4
1.404PheMet: 1.404 ± 0.114
2.106PheAsn: 2.106 ± 0.475
1.755PhePro: 1.755 ± 0.922
2.808PheGln: 2.808 ± 0.228
2.106PheArg: 2.106 ± 1.083
3.861PheSer: 3.861 ± 0.161
2.808PheThr: 2.808 ± 1.444
2.808PheVal: 2.808 ± 1.598
0.702PheTrp: 0.702 ± 0.247
1.755PheTyr: 1.755 ± 0.314
0.0PheXaa: 0.0 ± 0.0
Gly
3.51GlyAla: 3.51 ± 0.628
1.755GlyCys: 1.755 ± 0.314
5.967GlyAsp: 5.967 ± 0.027
4.563GlyGlu: 4.563 ± 3.128
3.51GlyPhe: 3.51 ± 1.197
4.563GlyGly: 4.563 ± 1.912
1.755GlyHis: 1.755 ± 0.314
3.51GlyIle: 3.51 ± 1.197
3.51GlyLys: 3.51 ± 1.197
5.967GlyLeu: 5.967 ± 1.189
1.053GlyMet: 1.053 ± 0.542
1.755GlyAsn: 1.755 ± 0.314
3.159GlyPro: 3.159 ± 0.809
2.457GlyGln: 2.457 ± 1.17
3.159GlyArg: 3.159 ± 1.417
2.808GlySer: 2.808 ± 1.598
3.861GlyThr: 3.861 ± 1.664
3.861GlyVal: 3.861 ± 0.447
0.351GlyTrp: 0.351 ± 0.428
4.212GlyTyr: 4.212 ± 0.267
0.0GlyXaa: 0.0 ± 0.0
His
1.755HisAla: 1.755 ± 0.314
0.0HisCys: 0.0 ± 0.0
1.404HisAsp: 1.404 ± 0.722
1.755HisGlu: 1.755 ± 0.294
2.106HisPhe: 2.106 ± 0.475
1.755HisGly: 1.755 ± 0.903
0.351HisHis: 0.351 ± 0.181
3.159HisIle: 3.159 ± 1.017
0.351HisLys: 0.351 ± 0.181
1.755HisLeu: 1.755 ± 0.922
0.702HisMet: 0.702 ± 0.361
1.404HisAsn: 1.404 ± 0.495
2.808HisPro: 2.808 ± 0.381
1.755HisGln: 1.755 ± 0.922
1.404HisArg: 1.404 ± 1.103
2.808HisSer: 2.808 ± 0.989
0.351HisThr: 0.351 ± 0.181
3.159HisVal: 3.159 ± 0.2
0.0HisTrp: 0.0 ± 0.0
0.351HisTyr: 0.351 ± 0.181
0.0HisXaa: 0.0 ± 0.0
Ile
5.967IleAla: 5.967 ± 0.636
0.702IleCys: 0.702 ± 0.361
4.914IleAsp: 4.914 ± 0.703
3.159IleGlu: 3.159 ± 1.625
2.106IlePhe: 2.106 ± 0.133
4.914IleGly: 4.914 ± 0.703
0.702IleHis: 0.702 ± 0.361
2.457IleIle: 2.457 ± 0.047
2.457IleLys: 2.457 ± 0.047
5.265IleLeu: 5.265 ± 2.1
0.702IleMet: 0.702 ± 0.361
2.106IleAsn: 2.106 ± 0.742
1.755IlePro: 1.755 ± 0.314
1.755IleGln: 1.755 ± 0.903
3.861IleArg: 3.861 ± 1.378
3.159IleSer: 3.159 ± 0.2
2.457IleThr: 2.457 ± 0.047
4.212IleVal: 4.212 ± 1.484
0.0IleTrp: 0.0 ± 0.0
1.755IleTyr: 1.755 ± 0.294
0.0IleXaa: 0.0 ± 0.0
Lys
3.51LysAla: 3.51 ± 1.197
1.755LysCys: 1.755 ± 0.294
3.159LysAsp: 3.159 ± 1.625
3.159LysGlu: 3.159 ± 1.017
3.861LysPhe: 3.861 ± 1.378
0.702LysGly: 0.702 ± 0.361
1.404LysHis: 1.404 ± 0.722
3.159LysIle: 3.159 ± 1.017
2.106LysLys: 2.106 ± 1.083
6.318LysLeu: 6.318 ± 1.425
1.755LysMet: 1.755 ± 0.294
2.457LysAsn: 2.457 ± 0.655
1.755LysPro: 1.755 ± 0.314
2.106LysGln: 2.106 ± 0.742
2.106LysArg: 2.106 ± 0.475
3.861LysSer: 3.861 ± 1.378
1.755LysThr: 1.755 ± 0.294
2.106LysVal: 2.106 ± 0.133
0.351LysTrp: 0.351 ± 0.181
3.51LysTyr: 3.51 ± 1.197
0.0LysXaa: 0.0 ± 0.0
Leu
8.424LeuAla: 8.424 ± 0.534
1.053LeuCys: 1.053 ± 0.675
5.265LeuAsp: 5.265 ± 0.275
5.616LeuGlu: 5.616 ± 1.064
4.563LeuPhe: 4.563 ± 1.13
3.159LeuGly: 3.159 ± 0.809
2.808LeuHis: 2.808 ± 0.989
2.457LeuIle: 2.457 ± 1.264
4.563LeuLys: 4.563 ± 1.13
5.616LeuLeu: 5.616 ± 1.064
1.053LeuMet: 1.053 ± 0.542
4.563LeuAsn: 4.563 ± 1.303
5.616LeuPro: 5.616 ± 1.064
3.159LeuGln: 3.159 ± 0.408
7.722LeuArg: 7.722 ± 0.287
7.02LeuSer: 7.02 ± 0.039
3.51LeuThr: 3.51 ± 0.02
3.159LeuVal: 3.159 ± 1.017
1.404LeuTrp: 1.404 ± 0.722
1.755LeuTyr: 1.755 ± 0.922
0.0LeuXaa: 0.0 ± 0.0
Met
2.457MetAla: 2.457 ± 0.561
0.702MetCys: 0.702 ± 0.361
1.404MetAsp: 1.404 ± 0.114
2.457MetGlu: 2.457 ± 0.655
1.755MetPhe: 1.755 ± 0.294
0.351MetGly: 0.351 ± 0.428
0.351MetHis: 0.351 ± 0.181
2.808MetIle: 2.808 ± 0.228
2.106MetLys: 2.106 ± 1.083
2.106MetLeu: 2.106 ± 0.133
1.404MetMet: 1.404 ± 0.722
0.351MetAsn: 0.351 ± 0.181
1.404MetPro: 1.404 ± 0.722
1.404MetGln: 1.404 ± 0.114
1.755MetArg: 1.755 ± 0.294
1.755MetSer: 1.755 ± 0.314
1.755MetThr: 1.755 ± 0.314
1.404MetVal: 1.404 ± 0.495
0.351MetTrp: 0.351 ± 0.181
0.702MetTyr: 0.702 ± 0.247
0.0MetXaa: 0.0 ± 0.0
Asn
2.106AsnAla: 2.106 ± 0.133
1.053AsnCys: 1.053 ± 0.542
1.755AsnAsp: 1.755 ± 0.922
2.457AsnGlu: 2.457 ± 1.17
1.755AsnPhe: 1.755 ± 0.294
4.914AsnGly: 4.914 ± 2.339
1.404AsnHis: 1.404 ± 0.495
2.808AsnIle: 2.808 ± 0.989
1.053AsnLys: 1.053 ± 0.067
1.755AsnLeu: 1.755 ± 0.922
0.702AsnMet: 0.702 ± 0.247
1.404AsnAsn: 1.404 ± 0.114
3.159AsnPro: 3.159 ± 0.408
0.702AsnGln: 0.702 ± 0.361
2.808AsnArg: 2.808 ± 0.381
2.808AsnSer: 2.808 ± 0.836
2.457AsnThr: 2.457 ± 0.047
0.702AsnVal: 0.702 ± 0.247
1.755AsnTrp: 1.755 ± 0.314
1.755AsnTyr: 1.755 ± 0.294
0.0AsnXaa: 0.0 ± 0.0
Pro
2.808ProAla: 2.808 ± 0.381
0.351ProCys: 0.351 ± 0.428
3.51ProAsp: 3.51 ± 0.628
2.457ProGlu: 2.457 ± 0.655
3.861ProPhe: 3.861 ± 1.056
2.808ProGly: 2.808 ± 0.381
3.159ProHis: 3.159 ± 0.2
2.808ProIle: 2.808 ± 0.381
2.106ProLys: 2.106 ± 1.083
4.914ProLeu: 4.914 ± 0.094
1.053ProMet: 1.053 ± 0.067
2.106ProAsn: 2.106 ± 0.475
1.755ProPro: 1.755 ± 0.294
1.404ProGln: 1.404 ± 0.495
1.404ProArg: 1.404 ± 0.495
1.755ProSer: 1.755 ± 0.294
4.563ProThr: 4.563 ± 2.52
3.51ProVal: 3.51 ± 0.628
0.351ProTrp: 0.351 ± 0.428
2.808ProTyr: 2.808 ± 0.989
0.0ProXaa: 0.0 ± 0.0
Gln
3.159GlnAla: 3.159 ± 0.408
0.702GlnCys: 0.702 ± 0.361
1.053GlnAsp: 1.053 ± 0.542
2.457GlnGlu: 2.457 ± 0.047
0.702GlnPhe: 0.702 ± 0.247
2.457GlnGly: 2.457 ± 0.655
0.702GlnHis: 0.702 ± 0.247
2.808GlnIle: 2.808 ± 0.836
1.755GlnLys: 1.755 ± 0.294
2.808GlnLeu: 2.808 ± 0.228
0.702GlnMet: 0.702 ± 0.361
0.702GlnAsn: 0.702 ± 0.361
1.755GlnPro: 1.755 ± 1.531
1.404GlnGln: 1.404 ± 1.103
0.702GlnArg: 0.702 ± 0.361
1.755GlnSer: 1.755 ± 1.531
1.404GlnThr: 1.404 ± 0.722
2.457GlnVal: 2.457 ± 0.561
0.0GlnTrp: 0.0 ± 0.0
0.351GlnTyr: 0.351 ± 0.181
0.0GlnXaa: 0.0 ± 0.0
Arg
2.457ArgAla: 2.457 ± 0.561
0.351ArgCys: 0.351 ± 0.181
2.808ArgAsp: 2.808 ± 0.228
4.563ArgGlu: 4.563 ± 2.347
4.563ArgPhe: 4.563 ± 1.303
4.212ArgGly: 4.212 ± 0.267
1.755ArgHis: 1.755 ± 0.922
4.212ArgIle: 4.212 ± 0.341
1.404ArgLys: 1.404 ± 0.722
3.159ArgLeu: 3.159 ± 0.809
2.457ArgMet: 2.457 ± 0.047
0.702ArgAsn: 0.702 ± 0.247
2.808ArgPro: 2.808 ± 1.598
0.351ArgGln: 0.351 ± 0.181
4.212ArgArg: 4.212 ± 0.875
4.563ArgSer: 4.563 ± 0.522
2.457ArgThr: 2.457 ± 0.047
3.51ArgVal: 3.51 ± 0.02
0.702ArgTrp: 0.702 ± 0.361
1.755ArgTyr: 1.755 ± 0.294
0.0ArgXaa: 0.0 ± 0.0
Ser
4.563SerAla: 4.563 ± 0.695
1.404SerCys: 1.404 ± 0.114
3.51SerAsp: 3.51 ± 0.02
6.318SerGlu: 6.318 ± 0.816
2.457SerPhe: 2.457 ± 1.17
5.265SerGly: 5.265 ± 1.55
0.351SerHis: 0.351 ± 0.428
3.861SerIle: 3.861 ± 0.161
3.159SerLys: 3.159 ± 1.417
6.318SerLeu: 6.318 ± 2.033
1.755SerMet: 1.755 ± 0.294
3.51SerAsn: 3.51 ± 0.02
2.457SerPro: 2.457 ± 0.561
1.404SerGln: 1.404 ± 0.495
2.457SerArg: 2.457 ± 0.561
3.159SerSer: 3.159 ± 1.017
2.457SerThr: 2.457 ± 1.17
5.616SerVal: 5.616 ± 0.153
1.053SerTrp: 1.053 ± 0.067
2.808SerTyr: 2.808 ± 1.598
0.0SerXaa: 0.0 ± 0.0
Thr
3.51ThrAla: 3.51 ± 1.236
0.351ThrCys: 0.351 ± 0.181
2.808ThrAsp: 2.808 ± 0.989
5.265ThrGlu: 5.265 ± 0.275
2.106ThrPhe: 2.106 ± 1.083
4.914ThrGly: 4.914 ± 0.514
1.755ThrHis: 1.755 ± 0.922
3.159ThrIle: 3.159 ± 0.809
3.159ThrLys: 3.159 ± 1.017
3.861ThrLeu: 3.861 ± 1.056
2.457ThrMet: 2.457 ± 0.047
1.053ThrAsn: 1.053 ± 0.067
2.457ThrPro: 2.457 ± 0.047
1.755ThrGln: 1.755 ± 0.294
2.106ThrArg: 2.106 ± 0.475
3.159ThrSer: 3.159 ± 0.2
4.212ThrThr: 4.212 ± 3.309
5.265ThrVal: 5.265 ± 0.275
0.0ThrTrp: 0.0 ± 0.0
2.457ThrTyr: 2.457 ± 1.17
0.0ThrXaa: 0.0 ± 0.0
Val
4.563ValAla: 4.563 ± 2.52
0.702ValCys: 0.702 ± 0.247
2.106ValAsp: 2.106 ± 0.133
5.967ValGlu: 5.967 ± 0.581
2.106ValPhe: 2.106 ± 0.133
4.212ValGly: 4.212 ± 1.484
2.106ValHis: 2.106 ± 0.475
1.755ValIle: 1.755 ± 0.294
3.159ValLys: 3.159 ± 0.408
3.861ValLeu: 3.861 ± 0.769
1.053ValMet: 1.053 ± 0.542
2.808ValAsn: 2.808 ± 0.989
2.106ValPro: 2.106 ± 0.475
1.404ValGln: 1.404 ± 0.722
5.616ValArg: 5.616 ± 0.762
4.914ValSer: 4.914 ± 0.094
3.51ValThr: 3.51 ± 0.02
4.914ValVal: 4.914 ± 0.703
1.053ValTrp: 1.053 ± 0.542
2.457ValTyr: 2.457 ± 0.561
0.0ValXaa: 0.0 ± 0.0
Trp
0.702TrpAla: 0.702 ± 0.361
0.0TrpCys: 0.0 ± 0.0
0.702TrpAsp: 0.702 ± 0.361
0.702TrpGlu: 0.702 ± 0.361
1.053TrpPhe: 1.053 ± 0.067
1.053TrpGly: 1.053 ± 0.542
0.0TrpHis: 0.0 ± 0.0
0.351TrpIle: 0.351 ± 0.428
1.053TrpLys: 1.053 ± 0.542
2.106TrpLeu: 2.106 ± 0.133
1.404TrpMet: 1.404 ± 0.495
0.702TrpAsn: 0.702 ± 0.247
0.351TrpPro: 0.351 ± 0.181
0.351TrpGln: 0.351 ± 0.181
2.106TrpArg: 2.106 ± 0.133
0.351TrpSer: 0.351 ± 0.181
1.053TrpThr: 1.053 ± 1.284
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.053TrpTyr: 1.053 ± 0.542
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.457TyrAla: 2.457 ± 0.561
0.351TyrCys: 0.351 ± 0.428
2.457TyrAsp: 2.457 ± 0.047
3.51TyrGlu: 3.51 ± 0.628
3.159TyrPhe: 3.159 ± 1.017
3.51TyrGly: 3.51 ± 1.236
1.053TyrHis: 1.053 ± 0.675
2.808TyrIle: 2.808 ± 0.228
1.053TyrLys: 1.053 ± 0.067
4.914TyrLeu: 4.914 ± 1.311
1.404TyrMet: 1.404 ± 0.305
1.755TyrAsn: 1.755 ± 0.922
1.755TyrPro: 1.755 ± 0.903
0.351TyrGln: 0.351 ± 0.181
1.053TyrArg: 1.053 ± 0.675
3.861TyrSer: 3.861 ± 1.056
4.563TyrThr: 4.563 ± 0.086
2.457TyrVal: 2.457 ± 0.047
0.351TyrTrp: 0.351 ± 0.181
2.106TyrTyr: 2.106 ± 0.133
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2850 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski