Amino acid dipepetide frequency for Beihai picorna-like virus 41

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.717AlaAla: 5.717 ± 0.374
0.817AlaCys: 0.817 ± 0.966
4.083AlaAsp: 4.083 ± 1.261
2.45AlaGlu: 2.45 ± 0.757
4.492AlaPhe: 4.492 ± 0.317
4.083AlaGly: 4.083 ± 0.165
0.408AlaHis: 0.408 ± 0.231
2.858AlaIle: 2.858 ± 0.187
3.675AlaLys: 3.675 ± 0.779
7.758AlaLeu: 7.758 ± 0.613
4.9AlaMet: 4.9 ± 0.412
2.858AlaAsn: 2.858 ± 0.901
2.45AlaPro: 2.45 ± 0.67
2.858AlaGln: 2.858 ± 0.901
2.858AlaArg: 2.858 ± 0.526
4.9AlaSer: 4.9 ± 2.053
3.267AlaThr: 3.267 ± 1.845
2.45AlaVal: 2.45 ± 0.043
0.817AlaTrp: 0.817 ± 0.252
3.675AlaTyr: 3.675 ± 0.779
0.0AlaXaa: 0.0 ± 0.0
Cys
1.633CysAla: 1.633 ± 0.209
0.817CysCys: 0.817 ± 0.966
2.858CysAsp: 2.858 ± 1.953
1.225CysGlu: 1.225 ± 0.692
0.817CysPhe: 0.817 ± 0.966
2.45CysGly: 2.45 ± 0.67
0.408CysHis: 0.408 ± 0.483
1.225CysIle: 1.225 ± 0.735
2.042CysLys: 2.042 ± 0.439
1.225CysLeu: 1.225 ± 0.022
0.0CysMet: 0.0 ± 0.0
1.633CysAsn: 1.633 ± 0.209
0.817CysPro: 0.817 ± 0.252
1.225CysGln: 1.225 ± 0.735
1.225CysArg: 1.225 ± 0.022
3.267CysSer: 3.267 ± 0.296
0.817CysThr: 0.817 ± 0.252
1.633CysVal: 1.633 ± 0.209
0.408CysTrp: 0.408 ± 0.231
0.817CysTyr: 0.817 ± 0.461
0.0CysXaa: 0.0 ± 0.0
Asp
3.267AspAla: 3.267 ± 0.418
0.817AspCys: 0.817 ± 0.966
4.9AspAsp: 4.9 ± 2.227
8.167AspGlu: 8.167 ± 0.331
3.267AspPhe: 3.267 ± 0.296
4.083AspGly: 4.083 ± 0.165
0.0AspHis: 0.0 ± 0.0
4.9AspIle: 4.9 ± 0.627
2.45AspLys: 2.45 ± 0.757
5.717AspLeu: 5.717 ± 1.088
1.633AspMet: 1.633 ± 0.505
2.45AspAsn: 2.45 ± 0.043
1.633AspPro: 1.633 ± 0.209
3.267AspGln: 3.267 ± 0.296
2.858AspArg: 2.858 ± 0.901
5.308AspSer: 5.308 ± 0.144
1.633AspThr: 1.633 ± 0.505
4.083AspVal: 4.083 ± 0.879
0.408AspTrp: 0.408 ± 0.231
1.633AspTyr: 1.633 ± 0.209
0.0AspXaa: 0.0 ± 0.0
Glu
4.083GluAla: 4.083 ± 1.975
2.042GluCys: 2.042 ± 1.153
2.858GluAsp: 2.858 ± 0.187
4.492GluGlu: 4.492 ± 0.317
4.9GluPhe: 4.9 ± 1.514
2.042GluGly: 2.042 ± 1.153
1.225GluHis: 1.225 ± 0.735
4.083GluIle: 4.083 ± 0.548
9.392GluLys: 9.392 ± 1.736
4.492GluLeu: 4.492 ± 0.396
3.675GluMet: 3.675 ± 0.065
1.633GluAsn: 1.633 ± 0.505
1.225GluPro: 1.225 ± 0.735
2.45GluGln: 2.45 ± 0.67
3.267GluArg: 3.267 ± 0.418
3.267GluSer: 3.267 ± 1.845
4.9GluThr: 4.9 ± 0.087
5.717GluVal: 5.717 ± 0.374
0.817GluTrp: 0.817 ± 0.461
3.267GluTyr: 3.267 ± 0.418
0.0GluXaa: 0.0 ± 0.0
Phe
4.083PheAla: 4.083 ± 1.592
0.408PheCys: 0.408 ± 0.231
4.492PheAsp: 4.492 ± 0.317
2.042PheGlu: 2.042 ± 0.274
2.858PhePhe: 2.858 ± 0.187
2.858PheGly: 2.858 ± 0.901
1.225PheHis: 1.225 ± 0.022
1.225PheIle: 1.225 ± 0.022
2.042PheLys: 2.042 ± 0.274
3.675PheLeu: 3.675 ± 0.065
0.408PheMet: 0.408 ± 0.231
0.817PheAsn: 0.817 ± 0.252
2.042PhePro: 2.042 ± 0.274
1.633PheGln: 1.633 ± 0.505
4.083PheArg: 4.083 ± 2.688
3.675PheSer: 3.675 ± 1.362
2.858PheThr: 2.858 ± 1.953
3.267PheVal: 3.267 ± 0.418
1.225PheTrp: 1.225 ± 0.735
1.633PheTyr: 1.633 ± 0.505
0.0PheXaa: 0.0 ± 0.0
Gly
4.492GlyAla: 4.492 ± 1.031
2.45GlyCys: 2.45 ± 1.383
2.45GlyAsp: 2.45 ± 0.67
3.267GlyGlu: 3.267 ± 0.296
2.45GlyPhe: 2.45 ± 1.383
2.45GlyGly: 2.45 ± 0.67
0.0GlyHis: 0.0 ± 0.0
4.083GlyIle: 4.083 ± 0.548
2.858GlyLys: 2.858 ± 1.24
4.083GlyLeu: 4.083 ± 0.879
1.225GlyMet: 1.225 ± 0.588
1.633GlyAsn: 1.633 ± 0.209
1.225GlyPro: 1.225 ± 0.022
1.225GlyGln: 1.225 ± 0.022
0.817GlyArg: 0.817 ± 0.252
1.633GlySer: 1.633 ± 0.209
3.675GlyThr: 3.675 ± 1.362
3.675GlyVal: 3.675 ± 0.065
0.0GlyTrp: 0.0 ± 0.0
2.858GlyTyr: 2.858 ± 0.187
0.0GlyXaa: 0.0 ± 0.0
His
1.633HisAla: 1.633 ± 0.505
1.225HisCys: 1.225 ± 0.735
1.633HisAsp: 1.633 ± 1.218
0.0HisGlu: 0.0 ± 0.0
0.408HisPhe: 0.408 ± 0.483
1.225HisGly: 1.225 ± 0.735
0.817HisHis: 0.817 ± 0.966
0.817HisIle: 0.817 ± 0.461
1.225HisLys: 1.225 ± 0.692
2.042HisLeu: 2.042 ± 0.274
0.408HisMet: 0.408 ± 0.231
0.0HisAsn: 0.0 ± 0.0
1.633HisPro: 1.633 ± 0.209
0.817HisGln: 0.817 ± 0.252
1.633HisArg: 1.633 ± 0.505
1.633HisSer: 1.633 ± 0.505
0.408HisThr: 0.408 ± 0.231
0.817HisVal: 0.817 ± 0.461
0.408HisTrp: 0.408 ± 0.231
0.817HisTyr: 0.817 ± 0.252
0.0HisXaa: 0.0 ± 0.0
Ile
4.492IleAla: 4.492 ± 0.396
3.267IleCys: 3.267 ± 1.009
2.45IleAsp: 2.45 ± 1.47
4.083IleGlu: 4.083 ± 0.165
2.45IlePhe: 2.45 ± 2.184
2.042IleGly: 2.042 ± 1.701
0.408IleHis: 0.408 ± 0.483
2.858IleIle: 2.858 ± 1.614
5.308IleLys: 5.308 ± 0.144
5.308IleLeu: 5.308 ± 0.857
1.225IleMet: 1.225 ± 0.022
2.858IleAsn: 2.858 ± 0.526
2.45IlePro: 2.45 ± 0.67
2.042IleGln: 2.042 ± 1.153
4.083IleArg: 4.083 ± 1.261
5.308IleSer: 5.308 ± 0.57
2.042IleThr: 2.042 ± 0.274
3.675IleVal: 3.675 ± 1.362
0.0IleTrp: 0.0 ± 0.0
3.267IleTyr: 3.267 ± 0.418
0.0IleXaa: 0.0 ± 0.0
Lys
6.125LysAla: 6.125 ± 1.318
1.633LysCys: 1.633 ± 0.505
2.858LysAsp: 2.858 ± 0.187
6.125LysGlu: 6.125 ± 1.318
3.675LysPhe: 3.675 ± 1.362
3.675LysGly: 3.675 ± 0.648
2.45LysHis: 2.45 ± 0.757
6.125LysIle: 6.125 ± 1.535
7.35LysLys: 7.35 ± 0.583
6.125LysLeu: 6.125 ± 0.109
1.225LysMet: 1.225 ± 0.022
1.633LysAsn: 1.633 ± 0.209
2.042LysPro: 2.042 ± 0.439
2.042LysGln: 2.042 ± 0.274
4.9LysArg: 4.9 ± 0.087
4.083LysSer: 4.083 ± 1.592
3.675LysThr: 3.675 ± 0.065
4.492LysVal: 4.492 ± 0.317
0.408LysTrp: 0.408 ± 0.231
1.633LysTyr: 1.633 ± 0.209
0.0LysXaa: 0.0 ± 0.0
Leu
6.125LeuAla: 6.125 ± 0.605
2.45LeuCys: 2.45 ± 0.043
8.575LeuAsp: 8.575 ± 0.561
7.35LeuGlu: 7.35 ± 1.557
4.083LeuPhe: 4.083 ± 0.548
1.225LeuGly: 1.225 ± 0.692
0.817LeuHis: 0.817 ± 0.461
4.9LeuIle: 4.9 ± 0.087
4.083LeuLys: 4.083 ± 1.592
6.125LeuLeu: 6.125 ± 1.535
2.858LeuMet: 2.858 ± 0.187
4.083LeuAsn: 4.083 ± 0.165
2.45LeuPro: 2.45 ± 2.184
2.042LeuGln: 2.042 ± 0.987
7.758LeuArg: 7.758 ± 0.814
6.942LeuSer: 6.942 ± 1.066
5.308LeuThr: 5.308 ± 1.283
4.9LeuVal: 4.9 ± 0.8
1.225LeuTrp: 1.225 ± 0.692
4.9LeuTyr: 4.9 ± 0.087
0.0LeuXaa: 0.0 ± 0.0
Met
1.633MetAla: 1.633 ± 0.922
0.817MetCys: 0.817 ± 0.461
2.042MetAsp: 2.042 ± 1.153
0.408MetGlu: 0.408 ± 0.231
0.817MetPhe: 0.817 ± 0.461
0.408MetGly: 0.408 ± 0.483
0.0MetHis: 0.0 ± 0.0
1.225MetIle: 1.225 ± 0.022
2.45MetLys: 2.45 ± 0.757
3.267MetLeu: 3.267 ± 1.722
0.817MetMet: 0.817 ± 0.461
0.817MetAsn: 0.817 ± 0.966
1.225MetPro: 1.225 ± 0.692
0.817MetGln: 0.817 ± 0.252
2.042MetArg: 2.042 ± 0.439
0.817MetSer: 0.817 ± 0.252
1.225MetThr: 1.225 ± 0.022
1.633MetVal: 1.633 ± 0.209
0.0MetTrp: 0.0 ± 0.0
1.225MetTyr: 1.225 ± 0.022
0.0MetXaa: 0.0 ± 0.0
Asn
2.042AsnAla: 2.042 ± 1.153
0.817AsnCys: 0.817 ± 0.966
2.45AsnAsp: 2.45 ± 1.383
4.083AsnGlu: 4.083 ± 1.592
1.225AsnPhe: 1.225 ± 0.692
2.042AsnGly: 2.042 ± 0.987
2.042AsnHis: 2.042 ± 0.987
4.083AsnIle: 4.083 ± 0.879
2.042AsnLys: 2.042 ± 1.153
2.45AsnLeu: 2.45 ± 1.47
0.408AsnMet: 0.408 ± 0.231
0.817AsnAsn: 0.817 ± 0.252
2.858AsnPro: 2.858 ± 0.526
0.817AsnGln: 0.817 ± 0.966
1.633AsnArg: 1.633 ± 0.209
3.675AsnSer: 3.675 ± 0.065
1.225AsnThr: 1.225 ± 0.022
2.45AsnVal: 2.45 ± 0.043
0.817AsnTrp: 0.817 ± 0.252
1.633AsnTyr: 1.633 ± 0.209
0.0AsnXaa: 0.0 ± 0.0
Pro
1.225ProAla: 1.225 ± 0.022
0.408ProCys: 0.408 ± 0.483
1.633ProAsp: 1.633 ± 0.209
4.083ProGlu: 4.083 ± 1.261
1.633ProPhe: 1.633 ± 0.505
1.225ProGly: 1.225 ± 0.022
0.0ProHis: 0.0 ± 0.0
1.633ProIle: 1.633 ± 0.209
1.633ProLys: 1.633 ± 0.505
3.675ProLeu: 3.675 ± 0.779
0.817ProMet: 0.817 ± 0.252
4.083ProAsn: 4.083 ± 0.165
2.45ProPro: 2.45 ± 2.184
0.817ProGln: 0.817 ± 0.461
4.083ProArg: 4.083 ± 0.165
2.858ProSer: 2.858 ± 0.901
3.267ProThr: 3.267 ± 0.296
1.633ProVal: 1.633 ± 1.218
0.817ProTrp: 0.817 ± 0.966
2.45ProTyr: 2.45 ± 0.67
0.0ProXaa: 0.0 ± 0.0
Gln
1.225GlnAla: 1.225 ± 0.022
0.817GlnCys: 0.817 ± 0.461
3.267GlnAsp: 3.267 ± 0.418
2.042GlnGlu: 2.042 ± 1.153
1.633GlnPhe: 1.633 ± 0.922
0.817GlnGly: 0.817 ± 0.252
0.408GlnHis: 0.408 ± 0.231
2.45GlnIle: 2.45 ± 0.043
4.083GlnLys: 4.083 ± 0.879
3.267GlnLeu: 3.267 ± 0.296
0.408GlnMet: 0.408 ± 0.483
1.633GlnAsn: 1.633 ± 0.209
1.633GlnPro: 1.633 ± 0.505
2.042GlnGln: 2.042 ± 0.439
2.858GlnArg: 2.858 ± 0.526
2.042GlnSer: 2.042 ± 0.439
2.45GlnThr: 2.45 ± 0.757
2.042GlnVal: 2.042 ± 0.274
0.408GlnTrp: 0.408 ± 0.483
0.817GlnTyr: 0.817 ± 0.461
0.0GlnXaa: 0.0 ± 0.0
Arg
3.675ArgAla: 3.675 ± 0.065
1.225ArgCys: 1.225 ± 0.022
4.083ArgAsp: 4.083 ± 0.879
5.717ArgGlu: 5.717 ± 0.374
5.308ArgPhe: 5.308 ± 0.144
1.225ArgGly: 1.225 ± 0.022
2.858ArgHis: 2.858 ± 0.901
2.858ArgIle: 2.858 ± 0.187
4.492ArgLys: 4.492 ± 0.317
5.308ArgLeu: 5.308 ± 1.283
1.225ArgMet: 1.225 ± 0.022
1.633ArgAsn: 1.633 ± 0.209
2.858ArgPro: 2.858 ± 0.526
2.45ArgGln: 2.45 ± 0.67
2.45ArgArg: 2.45 ± 0.043
3.675ArgSer: 3.675 ± 0.648
2.042ArgThr: 2.042 ± 0.274
4.083ArgVal: 4.083 ± 0.548
0.408ArgTrp: 0.408 ± 0.231
4.083ArgTyr: 4.083 ± 1.261
0.0ArgXaa: 0.0 ± 0.0
Ser
4.9SerAla: 4.9 ± 0.8
2.042SerCys: 2.042 ± 1.153
3.675SerAsp: 3.675 ± 0.648
6.125SerGlu: 6.125 ± 0.605
2.858SerPhe: 2.858 ± 0.187
5.717SerGly: 5.717 ± 1.088
2.45SerHis: 2.45 ± 0.043
3.675SerIle: 3.675 ± 0.648
5.717SerLys: 5.717 ± 0.374
7.35SerLeu: 7.35 ± 2.01
0.408SerMet: 0.408 ± 0.231
3.675SerAsn: 3.675 ± 1.362
2.858SerPro: 2.858 ± 0.187
2.042SerGln: 2.042 ± 0.439
2.042SerArg: 2.042 ± 1.153
6.533SerSer: 6.533 ± 1.549
3.675SerThr: 3.675 ± 0.648
2.45SerVal: 2.45 ± 0.67
0.817SerTrp: 0.817 ± 0.252
2.042SerTyr: 2.042 ± 0.987
0.0SerXaa: 0.0 ± 0.0
Thr
3.267ThrAla: 3.267 ± 1.722
2.042ThrCys: 2.042 ± 0.987
1.633ThrAsp: 1.633 ± 0.505
2.45ThrGlu: 2.45 ± 1.383
0.408ThrPhe: 0.408 ± 0.483
1.225ThrGly: 1.225 ± 0.022
0.817ThrHis: 0.817 ± 0.966
2.042ThrIle: 2.042 ± 0.274
4.083ThrLys: 4.083 ± 0.879
7.758ThrLeu: 7.758 ± 0.1
0.817ThrMet: 0.817 ± 0.461
2.45ThrAsn: 2.45 ± 0.757
3.267ThrPro: 3.267 ± 0.296
2.858ThrGln: 2.858 ± 0.901
2.858ThrArg: 2.858 ± 1.24
4.9ThrSer: 4.9 ± 0.087
2.858ThrThr: 2.858 ± 0.187
2.45ThrVal: 2.45 ± 1.383
1.225ThrTrp: 1.225 ± 0.022
2.042ThrTyr: 2.042 ± 0.274
0.0ThrXaa: 0.0 ± 0.0
Val
4.492ValAla: 4.492 ± 0.396
0.817ValCys: 0.817 ± 0.252
4.083ValAsp: 4.083 ± 1.592
3.675ValGlu: 3.675 ± 1.492
0.817ValPhe: 0.817 ± 0.252
3.267ValGly: 3.267 ± 1.131
1.633ValHis: 1.633 ± 0.922
4.9ValIle: 4.9 ± 1.514
2.858ValLys: 2.858 ± 0.187
4.492ValLeu: 4.492 ± 0.396
0.0ValMet: 0.0 ± 0.0
3.675ValAsn: 3.675 ± 0.065
3.675ValPro: 3.675 ± 0.779
1.225ValGln: 1.225 ± 0.692
6.533ValArg: 6.533 ± 2.262
2.858ValSer: 2.858 ± 0.187
3.267ValThr: 3.267 ± 0.296
2.858ValVal: 2.858 ± 0.526
0.817ValTrp: 0.817 ± 0.461
2.042ValTyr: 2.042 ± 0.274
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.0TrpCys: 0.0 ± 0.0
0.817TrpAsp: 0.817 ± 0.461
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.817TrpHis: 0.817 ± 0.966
0.408TrpIle: 0.408 ± 0.483
1.633TrpLys: 1.633 ± 0.922
0.817TrpLeu: 0.817 ± 0.252
0.408TrpMet: 0.408 ± 0.483
0.408TrpAsn: 0.408 ± 0.231
0.0TrpPro: 0.0 ± 0.0
0.817TrpGln: 0.817 ± 0.252
1.633TrpArg: 1.633 ± 0.209
1.225TrpSer: 1.225 ± 0.692
0.408TrpThr: 0.408 ± 0.483
0.408TrpVal: 0.408 ± 0.231
0.0TrpTrp: 0.0 ± 0.0
1.225TrpTyr: 1.225 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.675TyrAla: 3.675 ± 0.065
1.633TyrCys: 1.633 ± 0.505
1.633TyrAsp: 1.633 ± 0.209
2.042TyrGlu: 2.042 ± 0.439
2.45TyrPhe: 2.45 ± 0.757
4.492TyrGly: 4.492 ± 1.109
0.817TyrHis: 0.817 ± 0.252
3.267TyrIle: 3.267 ± 1.722
2.858TyrLys: 2.858 ± 0.526
3.267TyrLeu: 3.267 ± 0.418
0.408TyrMet: 0.408 ± 0.483
0.817TyrAsn: 0.817 ± 0.461
1.633TyrPro: 1.633 ± 0.505
2.858TyrGln: 2.858 ± 0.187
2.042TyrArg: 2.042 ± 0.439
2.45TyrSer: 2.45 ± 0.043
2.45TyrThr: 2.45 ± 0.043
3.267TyrVal: 3.267 ± 0.418
0.0TyrTrp: 0.0 ± 0.0
5.717TyrTyr: 5.717 ± 3.906
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2450 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski