Amino acid dipepetide frequency for Beihai sobemo-like virus 21

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.906AlaAla: 3.906 ± 0.131
1.116AlaCys: 1.116 ± 0.063
3.906AlaAsp: 3.906 ± 0.838
6.138AlaGlu: 6.138 ± 0.004
2.232AlaPhe: 2.232 ± 0.127
4.464AlaGly: 4.464 ± 0.96
1.116AlaHis: 1.116 ± 0.77
1.116AlaIle: 1.116 ± 0.643
3.906AlaLys: 3.906 ± 1.544
4.464AlaLeu: 4.464 ± 0.254
1.674AlaMet: 1.674 ± 0.448
2.232AlaAsn: 2.232 ± 1.286
2.79AlaPro: 2.79 ± 0.512
2.79AlaGln: 2.79 ± 0.195
4.464AlaArg: 4.464 ± 0.254
8.371AlaSer: 8.371 ± 0.584
4.464AlaThr: 4.464 ± 0.254
5.022AlaVal: 5.022 ± 0.774
2.232AlaTrp: 2.232 ± 0.833
2.232AlaTyr: 2.232 ± 0.58
0.0AlaXaa: 0.0 ± 0.0
Cys
2.232CysAla: 2.232 ± 0.833
0.558CysCys: 0.558 ± 0.322
1.674CysAsp: 1.674 ± 0.448
2.232CysGlu: 2.232 ± 1.54
0.558CysPhe: 0.558 ± 0.385
2.232CysGly: 2.232 ± 0.833
0.0CysHis: 0.0 ± 0.0
0.558CysIle: 0.558 ± 0.385
0.558CysLys: 0.558 ± 0.385
2.232CysLeu: 2.232 ± 0.127
0.558CysMet: 0.558 ± 0.385
0.0CysAsn: 0.0 ± 0.0
1.674CysPro: 1.674 ± 1.155
0.0CysGln: 0.0 ± 0.0
2.232CysArg: 2.232 ± 0.833
2.232CysSer: 2.232 ± 0.833
0.558CysThr: 0.558 ± 0.322
2.232CysVal: 2.232 ± 0.833
0.0CysTrp: 0.0 ± 0.0
2.232CysTyr: 2.232 ± 0.127
0.0CysXaa: 0.0 ± 0.0
Asp
2.232AspAla: 2.232 ± 0.127
1.674AspCys: 1.674 ± 1.155
4.464AspAsp: 4.464 ± 0.453
2.79AspGlu: 2.79 ± 1.218
3.906AspPhe: 3.906 ± 0.838
2.79AspGly: 2.79 ± 0.512
1.674AspHis: 1.674 ± 0.258
0.558AspIle: 0.558 ± 0.385
3.348AspLys: 3.348 ± 0.897
5.58AspLeu: 5.58 ± 1.802
0.558AspMet: 0.558 ± 0.385
2.79AspAsn: 2.79 ± 0.512
3.906AspPro: 3.906 ± 0.575
4.464AspGln: 4.464 ± 1.667
2.79AspArg: 2.79 ± 1.608
2.232AspSer: 2.232 ± 0.127
4.464AspThr: 4.464 ± 0.254
3.348AspVal: 3.348 ± 0.19
1.674AspTrp: 1.674 ± 0.258
2.232AspTyr: 2.232 ± 0.58
0.0AspXaa: 0.0 ± 0.0
Glu
4.464GluAla: 4.464 ± 0.254
2.232GluCys: 2.232 ± 0.127
3.906GluAsp: 3.906 ± 2.251
4.464GluGlu: 4.464 ± 0.453
2.232GluPhe: 2.232 ± 0.127
3.906GluGly: 3.906 ± 0.131
0.0GluHis: 0.0 ± 0.0
4.464GluIle: 4.464 ± 1.159
5.022GluLys: 5.022 ± 1.481
5.022GluLeu: 5.022 ± 0.639
0.0GluMet: 0.0 ± 0.0
1.116GluAsn: 1.116 ± 0.77
1.674GluPro: 1.674 ± 0.448
2.79GluGln: 2.79 ± 0.195
5.022GluArg: 5.022 ± 0.639
7.812GluSer: 7.812 ± 0.262
6.138GluThr: 6.138 ± 1.417
3.348GluVal: 3.348 ± 0.897
0.558GluTrp: 0.558 ± 0.322
1.674GluTyr: 1.674 ± 0.258
0.0GluXaa: 0.0 ± 0.0
Phe
1.674PheAla: 1.674 ± 0.448
0.558PheCys: 0.558 ± 0.385
1.674PheAsp: 1.674 ± 0.448
3.348PheGlu: 3.348 ± 0.516
1.116PhePhe: 1.116 ± 0.063
3.906PheGly: 3.906 ± 0.131
0.558PheHis: 0.558 ± 0.385
2.232PheIle: 2.232 ± 1.286
1.116PheLys: 1.116 ± 0.77
5.58PheLeu: 5.58 ± 1.096
0.558PheMet: 0.558 ± 0.322
1.116PheAsn: 1.116 ± 0.77
1.116PhePro: 1.116 ± 0.77
1.674PheGln: 1.674 ± 0.448
2.232PheArg: 2.232 ± 0.127
2.79PheSer: 2.79 ± 0.195
2.232PheThr: 2.232 ± 0.58
0.558PheVal: 0.558 ± 0.322
1.674PheTrp: 1.674 ± 0.448
0.558PheTyr: 0.558 ± 0.322
0.0PheXaa: 0.0 ± 0.0
Gly
5.022GlyAla: 5.022 ± 0.639
2.79GlyCys: 2.79 ± 1.218
5.58GlyAsp: 5.58 ± 0.389
7.812GlyGlu: 7.812 ± 0.969
2.79GlyPhe: 2.79 ± 0.512
3.906GlyGly: 3.906 ± 1.988
1.116GlyHis: 1.116 ± 0.063
4.464GlyIle: 4.464 ± 0.96
5.58GlyLys: 5.58 ± 1.024
3.348GlyLeu: 3.348 ± 0.19
1.116GlyMet: 1.116 ± 0.337
2.79GlyAsn: 2.79 ± 0.195
2.232GlyPro: 2.232 ± 0.127
3.348GlyGln: 3.348 ± 0.516
3.906GlyArg: 3.906 ± 1.988
4.464GlySer: 4.464 ± 0.453
3.348GlyThr: 3.348 ± 0.897
2.232GlyVal: 2.232 ± 0.833
2.232GlyTrp: 2.232 ± 0.833
3.348GlyTyr: 3.348 ± 0.19
0.0GlyXaa: 0.0 ± 0.0
His
2.232HisAla: 2.232 ± 0.58
1.116HisCys: 1.116 ± 0.063
0.558HisAsp: 0.558 ± 0.385
2.232HisGlu: 2.232 ± 0.833
0.558HisPhe: 0.558 ± 0.385
0.558HisGly: 0.558 ± 0.322
0.0HisHis: 0.0 ± 0.0
0.558HisIle: 0.558 ± 0.322
1.674HisLys: 1.674 ± 0.448
1.674HisLeu: 1.674 ± 0.965
0.0HisMet: 0.0 ± 0.0
0.558HisAsn: 0.558 ± 0.385
0.558HisPro: 0.558 ± 0.322
1.116HisGln: 1.116 ± 0.643
2.232HisArg: 2.232 ± 0.127
1.116HisSer: 1.116 ± 0.063
1.116HisThr: 1.116 ± 0.063
0.558HisVal: 0.558 ± 0.322
0.558HisTrp: 0.558 ± 0.322
0.558HisTyr: 0.558 ± 0.322
0.0HisXaa: 0.0 ± 0.0
Ile
5.022IleAla: 5.022 ± 0.068
1.674IleCys: 1.674 ± 1.155
1.674IleAsp: 1.674 ± 0.258
2.232IleGlu: 2.232 ± 0.58
1.674IlePhe: 1.674 ± 0.448
3.906IleGly: 3.906 ± 1.282
1.116IleHis: 1.116 ± 0.643
2.232IleIle: 2.232 ± 0.58
1.674IleLys: 1.674 ± 0.258
3.348IleLeu: 3.348 ± 0.516
1.116IleMet: 1.116 ± 0.643
1.116IleAsn: 1.116 ± 0.063
1.674IlePro: 1.674 ± 0.448
2.232IleGln: 2.232 ± 1.286
1.116IleArg: 1.116 ± 0.063
2.79IleSer: 2.79 ± 0.195
1.674IleThr: 1.674 ± 0.448
2.232IleVal: 2.232 ± 0.127
0.558IleTrp: 0.558 ± 0.385
0.558IleTyr: 0.558 ± 0.385
0.0IleXaa: 0.0 ± 0.0
Lys
3.348LysAla: 3.348 ± 0.19
1.116LysCys: 1.116 ± 0.77
1.116LysAsp: 1.116 ± 0.77
2.232LysGlu: 2.232 ± 0.127
2.79LysPhe: 2.79 ± 0.512
2.232LysGly: 2.232 ± 1.286
0.558LysHis: 0.558 ± 0.322
2.79LysIle: 2.79 ± 1.218
5.58LysLys: 5.58 ± 0.389
6.696LysLeu: 6.696 ± 1.739
1.116LysMet: 1.116 ± 0.643
1.674LysAsn: 1.674 ± 0.448
5.022LysPro: 5.022 ± 1.345
4.464LysGln: 4.464 ± 1.159
2.79LysArg: 2.79 ± 0.195
6.138LysSer: 6.138 ± 1.417
2.79LysThr: 2.79 ± 0.195
0.558LysVal: 0.558 ± 0.322
0.558LysTrp: 0.558 ± 0.385
1.674LysTyr: 1.674 ± 0.258
0.0LysXaa: 0.0 ± 0.0
Leu
5.58LeuAla: 5.58 ± 1.096
1.116LeuCys: 1.116 ± 0.063
6.696LeuAsp: 6.696 ± 1.087
5.58LeuGlu: 5.58 ± 1.802
2.232LeuPhe: 2.232 ± 0.833
5.58LeuGly: 5.58 ± 0.317
1.116LeuHis: 1.116 ± 0.063
2.232LeuIle: 2.232 ± 0.58
2.232LeuLys: 2.232 ± 0.833
7.254LeuLeu: 7.254 ± 2.06
1.674LeuMet: 1.674 ± 0.448
2.79LeuAsn: 2.79 ± 1.608
5.022LeuPro: 5.022 ± 0.068
4.464LeuGln: 4.464 ± 0.453
5.58LeuArg: 5.58 ± 0.317
6.138LeuSer: 6.138 ± 0.711
6.696LeuThr: 6.696 ± 3.152
6.138LeuVal: 6.138 ± 0.702
2.232LeuTrp: 2.232 ± 0.127
2.232LeuTyr: 2.232 ± 0.127
0.0LeuXaa: 0.0 ± 0.0
Met
1.116MetAla: 1.116 ± 0.063
0.0MetCys: 0.0 ± 0.0
1.674MetAsp: 1.674 ± 0.258
0.558MetGlu: 0.558 ± 0.322
0.558MetPhe: 0.558 ± 0.322
1.674MetGly: 1.674 ± 1.155
0.558MetHis: 0.558 ± 0.322
1.116MetIle: 1.116 ± 0.063
1.674MetLys: 1.674 ± 0.965
2.232MetLeu: 2.232 ± 0.127
0.0MetMet: 0.0 ± 0.0
1.116MetAsn: 1.116 ± 0.063
0.0MetPro: 0.0 ± 0.0
0.558MetGln: 0.558 ± 0.322
0.558MetArg: 0.558 ± 0.322
1.674MetSer: 1.674 ± 0.258
0.0MetThr: 0.0 ± 0.0
3.348MetVal: 3.348 ± 0.19
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
2.79AsnAla: 2.79 ± 0.512
0.558AsnCys: 0.558 ± 0.385
1.674AsnAsp: 1.674 ± 0.258
1.116AsnGlu: 1.116 ± 0.063
1.674AsnPhe: 1.674 ± 0.965
3.348AsnGly: 3.348 ± 0.19
1.116AsnHis: 1.116 ± 0.77
0.0AsnIle: 0.0 ± 0.0
3.906AsnLys: 3.906 ± 2.251
2.79AsnLeu: 2.79 ± 0.512
0.558AsnMet: 0.558 ± 0.271
0.0AsnAsn: 0.0 ± 0.0
3.348AsnPro: 3.348 ± 0.897
1.674AsnGln: 1.674 ± 0.448
1.116AsnArg: 1.116 ± 0.77
3.906AsnSer: 3.906 ± 0.575
0.558AsnThr: 0.558 ± 0.385
3.906AsnVal: 3.906 ± 0.838
0.0AsnTrp: 0.0 ± 0.0
1.674AsnTyr: 1.674 ± 0.258
0.0AsnXaa: 0.0 ± 0.0
Pro
5.58ProAla: 5.58 ± 0.317
0.558ProCys: 0.558 ± 0.322
3.348ProAsp: 3.348 ± 1.603
3.906ProGlu: 3.906 ± 0.131
1.674ProPhe: 1.674 ± 0.258
3.906ProGly: 3.906 ± 2.695
2.232ProHis: 2.232 ± 0.127
2.79ProIle: 2.79 ± 0.195
2.79ProLys: 2.79 ± 0.195
3.906ProLeu: 3.906 ± 0.131
1.116ProMet: 1.116 ± 0.063
1.674ProAsn: 1.674 ± 1.155
3.906ProPro: 3.906 ± 2.251
2.232ProGln: 2.232 ± 0.127
1.674ProArg: 1.674 ± 0.258
4.464ProSer: 4.464 ± 0.254
6.696ProThr: 6.696 ± 0.381
3.906ProVal: 3.906 ± 0.838
0.558ProTrp: 0.558 ± 0.322
0.558ProTyr: 0.558 ± 0.385
0.0ProXaa: 0.0 ± 0.0
Gln
5.58GlnAla: 5.58 ± 1.802
1.116GlnCys: 1.116 ± 0.77
0.558GlnAsp: 0.558 ± 0.322
2.79GlnGlu: 2.79 ± 0.901
1.116GlnPhe: 1.116 ± 0.77
2.232GlnGly: 2.232 ± 0.127
0.0GlnHis: 0.0 ± 0.0
2.232GlnIle: 2.232 ± 0.127
2.79GlnLys: 2.79 ± 1.608
5.022GlnLeu: 5.022 ± 0.774
2.232GlnMet: 2.232 ± 0.58
1.674GlnAsn: 1.674 ± 0.965
2.79GlnPro: 2.79 ± 1.608
3.906GlnGln: 3.906 ± 0.131
3.348GlnArg: 3.348 ± 0.897
1.674GlnSer: 1.674 ± 0.258
3.906GlnThr: 3.906 ± 1.282
3.348GlnVal: 3.348 ± 0.516
2.232GlnTrp: 2.232 ± 0.833
0.558GlnTyr: 0.558 ± 0.322
0.0GlnXaa: 0.0 ± 0.0
Arg
1.116ArgAla: 1.116 ± 0.643
2.79ArgCys: 2.79 ± 1.218
4.464ArgAsp: 4.464 ± 0.96
3.348ArgGlu: 3.348 ± 0.516
2.232ArgPhe: 2.232 ± 1.286
6.138ArgGly: 6.138 ± 2.115
1.116ArgHis: 1.116 ± 0.063
0.558ArgIle: 0.558 ± 0.385
3.348ArgLys: 3.348 ± 1.603
6.696ArgLeu: 6.696 ± 1.087
1.116ArgMet: 1.116 ± 0.063
2.79ArgAsn: 2.79 ± 0.512
5.022ArgPro: 5.022 ± 0.639
3.348ArgGln: 3.348 ± 0.516
6.138ArgArg: 6.138 ± 1.409
3.906ArgSer: 3.906 ± 0.131
3.348ArgThr: 3.348 ± 0.897
2.79ArgVal: 2.79 ± 0.195
0.558ArgTrp: 0.558 ± 0.322
1.674ArgTyr: 1.674 ± 0.258
0.0ArgXaa: 0.0 ± 0.0
Ser
6.138SerAla: 6.138 ± 0.711
2.232SerCys: 2.232 ± 0.833
1.674SerAsp: 1.674 ± 0.258
6.138SerGlu: 6.138 ± 1.409
2.79SerPhe: 2.79 ± 0.195
9.487SerGly: 9.487 ± 0.52
2.79SerHis: 2.79 ± 0.901
3.906SerIle: 3.906 ± 0.575
3.348SerLys: 3.348 ± 0.19
5.022SerLeu: 5.022 ± 2.187
1.116SerMet: 1.116 ± 0.063
3.906SerAsn: 3.906 ± 0.838
3.906SerPro: 3.906 ± 0.838
2.79SerGln: 2.79 ± 0.901
6.138SerArg: 6.138 ± 0.711
5.58SerSer: 5.58 ± 0.317
5.58SerThr: 5.58 ± 1.024
4.464SerVal: 4.464 ± 1.159
1.674SerTrp: 1.674 ± 0.965
3.348SerTyr: 3.348 ± 0.897
0.0SerXaa: 0.0 ± 0.0
Thr
3.348ThrAla: 3.348 ± 0.19
1.674ThrCys: 1.674 ± 0.448
5.58ThrAsp: 5.58 ± 0.389
3.906ThrGlu: 3.906 ± 1.544
2.79ThrPhe: 2.79 ± 0.901
6.696ThrGly: 6.696 ± 0.326
2.79ThrHis: 2.79 ± 0.901
3.906ThrIle: 3.906 ± 1.282
1.674ThrLys: 1.674 ± 0.258
3.906ThrLeu: 3.906 ± 1.988
2.232ThrMet: 2.232 ± 0.58
1.116ThrAsn: 1.116 ± 0.77
4.464ThrPro: 4.464 ± 0.254
2.79ThrGln: 2.79 ± 1.608
3.906ThrArg: 3.906 ± 0.575
7.254ThrSer: 7.254 ± 0.647
5.58ThrThr: 5.58 ± 0.317
3.906ThrVal: 3.906 ± 0.575
0.0ThrTrp: 0.0 ± 0.0
0.558ThrTyr: 0.558 ± 0.385
0.0ThrXaa: 0.0 ± 0.0
Val
3.906ValAla: 3.906 ± 0.131
1.116ValCys: 1.116 ± 0.77
4.464ValAsp: 4.464 ± 1.667
2.79ValGlu: 2.79 ± 0.195
1.674ValPhe: 1.674 ± 1.155
3.348ValGly: 3.348 ± 0.516
0.558ValHis: 0.558 ± 0.322
3.348ValIle: 3.348 ± 0.516
2.79ValLys: 2.79 ± 0.901
3.906ValLeu: 3.906 ± 0.131
0.558ValMet: 0.558 ± 0.322
4.464ValAsn: 4.464 ± 0.254
5.022ValPro: 5.022 ± 0.774
2.79ValGln: 2.79 ± 0.512
5.58ValArg: 5.58 ± 1.024
5.022ValSer: 5.022 ± 0.774
4.464ValThr: 4.464 ± 1.866
1.674ValVal: 1.674 ± 0.965
0.558ValTrp: 0.558 ± 0.322
0.558ValTyr: 0.558 ± 0.322
0.0ValXaa: 0.0 ± 0.0
Trp
1.116TrpAla: 1.116 ± 0.643
0.0TrpCys: 0.0 ± 0.0
1.674TrpAsp: 1.674 ± 0.448
1.116TrpGlu: 1.116 ± 0.643
0.558TrpPhe: 0.558 ± 0.385
1.116TrpGly: 1.116 ± 0.77
0.558TrpHis: 0.558 ± 0.322
0.558TrpIle: 0.558 ± 0.322
1.116TrpLys: 1.116 ± 0.77
0.558TrpLeu: 0.558 ± 0.385
0.558TrpMet: 0.558 ± 0.322
0.558TrpAsn: 0.558 ± 0.322
1.116TrpPro: 1.116 ± 0.063
0.558TrpGln: 0.558 ± 0.322
1.116TrpArg: 1.116 ± 0.77
1.116TrpSer: 1.116 ± 0.643
1.674TrpThr: 1.674 ± 1.155
1.674TrpVal: 1.674 ± 0.258
0.0TrpTrp: 0.0 ± 0.0
1.674TrpTyr: 1.674 ± 0.448
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.232TyrAla: 2.232 ± 0.127
0.558TyrCys: 0.558 ± 0.322
1.116TyrAsp: 1.116 ± 0.063
1.116TyrGlu: 1.116 ± 0.643
1.116TyrPhe: 1.116 ± 0.643
0.558TyrGly: 0.558 ± 0.322
0.558TyrHis: 0.558 ± 0.385
0.0TyrIle: 0.0 ± 0.0
1.116TyrLys: 1.116 ± 0.77
3.348TyrLeu: 3.348 ± 0.516
0.0TyrMet: 0.0 ± 0.0
2.232TyrAsn: 2.232 ± 0.127
2.232TyrPro: 2.232 ± 0.833
1.116TyrGln: 1.116 ± 0.063
0.558TyrArg: 0.558 ± 0.385
3.348TyrSer: 3.348 ± 0.516
2.79TyrThr: 2.79 ± 0.195
3.348TyrVal: 3.348 ± 0.516
0.558TyrTrp: 0.558 ± 0.385
0.558TyrTyr: 0.558 ± 0.322
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1793 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski