Amino acid dipepetide frequency for Beihai sobemo-like virus 14

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.524AlaAla: 6.524 ± 0.173
0.0AlaCys: 0.0 ± 0.0
5.592AlaAsp: 5.592 ± 0.328
3.728AlaGlu: 3.728 ± 0.337
3.728AlaPhe: 3.728 ± 0.337
4.66AlaGly: 4.66 ± 2.508
1.864AlaHis: 1.864 ± 0.666
2.796AlaIle: 2.796 ± 1.833
6.524AlaLys: 6.524 ± 1.842
2.796AlaLeu: 2.796 ± 0.164
1.864AlaMet: 1.864 ± 0.666
2.796AlaAsn: 2.796 ± 1.505
3.728AlaPro: 3.728 ± 0.337
0.932AlaGln: 0.932 ± 0.502
3.728AlaArg: 3.728 ± 2.006
8.388AlaSer: 8.388 ± 1.177
2.796AlaThr: 2.796 ± 0.164
5.592AlaVal: 5.592 ± 0.328
0.932AlaTrp: 0.932 ± 0.502
1.864AlaTyr: 1.864 ± 2.334
0.0AlaXaa: 0.0 ± 0.0
Cys
1.864CysAla: 1.864 ± 0.666
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
2.796CysGlu: 2.796 ± 0.164
2.796CysPhe: 2.796 ± 1.833
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
0.932CysIle: 0.932 ± 0.502
1.864CysLys: 1.864 ± 0.666
0.932CysLeu: 0.932 ± 0.502
0.932CysMet: 0.932 ± 0.418
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
1.864CysGln: 1.864 ± 2.334
0.932CysArg: 0.932 ± 0.502
3.728CysSer: 3.728 ± 0.337
0.932CysThr: 0.932 ± 1.167
1.864CysVal: 1.864 ± 1.003
0.0CysTrp: 0.0 ± 0.0
0.932CysTyr: 0.932 ± 1.167
0.0CysXaa: 0.0 ± 0.0
Asp
1.864AspAla: 1.864 ± 1.003
1.864AspCys: 1.864 ± 0.666
11.184AspAsp: 11.184 ± 1.012
10.252AspGlu: 10.252 ± 2.18
2.796AspPhe: 2.796 ± 0.164
3.728AspGly: 3.728 ± 2.006
0.932AspHis: 0.932 ± 1.167
3.728AspIle: 3.728 ± 2.006
0.932AspLys: 0.932 ± 1.167
2.796AspLeu: 2.796 ± 0.164
1.864AspMet: 1.864 ± 0.666
2.796AspAsn: 2.796 ± 1.505
1.864AspPro: 1.864 ± 0.666
1.864AspGln: 1.864 ± 0.666
0.932AspArg: 0.932 ± 0.502
4.66AspSer: 4.66 ± 2.508
2.796AspThr: 2.796 ± 1.833
4.66AspVal: 4.66 ± 0.839
1.864AspTrp: 1.864 ± 2.334
1.864AspTyr: 1.864 ± 1.003
0.0AspXaa: 0.0 ± 0.0
Glu
8.388GluAla: 8.388 ± 4.514
2.796GluCys: 2.796 ± 0.164
2.796GluAsp: 2.796 ± 0.164
6.524GluGlu: 6.524 ± 4.833
1.864GluPhe: 1.864 ± 1.003
7.456GluGly: 7.456 ± 2.663
0.932GluHis: 0.932 ± 0.502
4.66GluIle: 4.66 ± 0.839
3.728GluLys: 3.728 ± 0.337
5.592GluLeu: 5.592 ± 1.997
1.864GluMet: 1.864 ± 1.003
2.796GluAsn: 2.796 ± 0.164
3.728GluPro: 3.728 ± 1.331
3.728GluGln: 3.728 ± 1.331
3.728GluArg: 3.728 ± 2.006
2.796GluSer: 2.796 ± 1.505
3.728GluThr: 3.728 ± 1.331
4.66GluVal: 4.66 ± 2.508
0.932GluTrp: 0.932 ± 1.167
2.796GluTyr: 2.796 ± 0.164
0.0GluXaa: 0.0 ± 0.0
Phe
2.796PheAla: 2.796 ± 1.833
0.932PheCys: 0.932 ± 0.502
1.864PheAsp: 1.864 ± 0.666
0.932PheGlu: 0.932 ± 0.502
0.0PhePhe: 0.0 ± 0.0
1.864PheGly: 1.864 ± 0.666
1.864PheHis: 1.864 ± 0.666
0.932PheIle: 0.932 ± 1.167
1.864PheLys: 1.864 ± 1.003
6.524PheLeu: 6.524 ± 0.173
0.932PheMet: 0.932 ± 1.167
0.932PheAsn: 0.932 ± 1.167
0.932PhePro: 0.932 ± 1.167
0.932PheGln: 0.932 ± 0.502
0.932PheArg: 0.932 ± 0.502
1.864PheSer: 1.864 ± 2.334
2.796PheThr: 2.796 ± 1.505
1.864PheVal: 1.864 ± 2.334
1.864PheTrp: 1.864 ± 1.003
1.864PheTyr: 1.864 ± 1.003
0.0PheXaa: 0.0 ± 0.0
Gly
4.66GlyAla: 4.66 ± 0.83
2.796GlyCys: 2.796 ± 1.505
2.796GlyAsp: 2.796 ± 0.164
1.864GlyGlu: 1.864 ± 1.003
2.796GlyPhe: 2.796 ± 1.833
3.728GlyGly: 3.728 ± 4.669
0.0GlyHis: 0.0 ± 0.0
4.66GlyIle: 4.66 ± 0.83
8.388GlyLys: 8.388 ± 1.177
5.592GlyLeu: 5.592 ± 1.341
0.932GlyMet: 0.932 ± 0.502
1.864GlyAsn: 1.864 ± 0.666
3.728GlyPro: 3.728 ± 2.006
1.864GlyGln: 1.864 ± 0.666
6.524GlyArg: 6.524 ± 1.495
2.796GlySer: 2.796 ± 1.505
1.864GlyThr: 1.864 ± 1.003
3.728GlyVal: 3.728 ± 0.337
2.796GlyTrp: 2.796 ± 1.833
3.728GlyTyr: 3.728 ± 0.337
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.864HisAsp: 1.864 ± 0.666
0.0HisGlu: 0.0 ± 0.0
0.932HisPhe: 0.932 ± 1.167
1.864HisGly: 1.864 ± 1.003
2.796HisHis: 2.796 ± 0.164
0.932HisIle: 0.932 ± 1.167
0.932HisLys: 0.932 ± 1.167
2.796HisLeu: 2.796 ± 0.164
0.0HisMet: 0.0 ± 0.0
0.0HisAsn: 0.0 ± 0.0
0.932HisPro: 0.932 ± 1.167
0.932HisGln: 0.932 ± 0.502
0.0HisArg: 0.0 ± 0.0
1.864HisSer: 1.864 ± 1.003
0.932HisThr: 0.932 ± 0.502
0.932HisVal: 0.932 ± 0.502
0.0HisTrp: 0.0 ± 0.0
0.932HisTyr: 0.932 ± 1.167
0.0HisXaa: 0.0 ± 0.0
Ile
2.796IleAla: 2.796 ± 1.505
0.0IleCys: 0.0 ± 0.0
4.66IleAsp: 4.66 ± 2.508
1.864IleGlu: 1.864 ± 1.003
1.864IlePhe: 1.864 ± 1.003
1.864IleGly: 1.864 ± 0.666
0.0IleHis: 0.0 ± 0.0
0.932IleIle: 0.932 ± 0.502
7.456IleLys: 7.456 ± 0.675
6.524IleLeu: 6.524 ± 3.164
1.864IleMet: 1.864 ± 0.666
1.864IleAsn: 1.864 ± 1.003
0.932IlePro: 0.932 ± 0.502
3.728IleGln: 3.728 ± 1.331
0.0IleArg: 0.0 ± 0.0
1.864IleSer: 1.864 ± 0.666
1.864IleThr: 1.864 ± 0.666
3.728IleVal: 3.728 ± 0.337
0.0IleTrp: 0.0 ± 0.0
1.864IleTyr: 1.864 ± 0.666
0.0IleXaa: 0.0 ± 0.0
Lys
3.728LysAla: 3.728 ± 1.331
0.932LysCys: 0.932 ± 0.502
3.728LysAsp: 3.728 ± 2.006
6.524LysGlu: 6.524 ± 0.173
0.932LysPhe: 0.932 ± 0.502
3.728LysGly: 3.728 ± 2.006
0.0LysHis: 0.0 ± 0.0
1.864LysIle: 1.864 ± 0.666
4.66LysLys: 4.66 ± 2.508
3.728LysLeu: 3.728 ± 0.337
0.932LysMet: 0.932 ± 0.502
1.864LysAsn: 1.864 ± 1.003
1.864LysPro: 1.864 ± 1.003
3.728LysGln: 3.728 ± 3.0
5.592LysArg: 5.592 ± 1.341
7.456LysSer: 7.456 ± 2.344
1.864LysThr: 1.864 ± 1.003
5.592LysVal: 5.592 ± 1.341
0.932LysTrp: 0.932 ± 1.167
1.864LysTyr: 1.864 ± 0.666
0.0LysXaa: 0.0 ± 0.0
Leu
5.592LeuAla: 5.592 ± 3.666
0.0LeuCys: 0.0 ± 0.0
3.728LeuAsp: 3.728 ± 0.337
4.66LeuGlu: 4.66 ± 0.83
2.796LeuPhe: 2.796 ± 3.502
5.592LeuGly: 5.592 ± 1.341
1.864LeuHis: 1.864 ± 2.334
5.592LeuIle: 5.592 ± 1.341
2.796LeuLys: 2.796 ± 1.833
5.592LeuLeu: 5.592 ± 1.997
1.864LeuMet: 1.864 ± 0.666
3.728LeuAsn: 3.728 ± 1.331
4.66LeuPro: 4.66 ± 2.508
2.796LeuGln: 2.796 ± 0.164
10.252LeuArg: 10.252 ± 2.827
7.456LeuSer: 7.456 ± 0.994
3.728LeuThr: 3.728 ± 0.337
3.728LeuVal: 3.728 ± 0.337
0.932LeuTrp: 0.932 ± 1.167
7.456LeuTyr: 7.456 ± 0.675
0.0LeuXaa: 0.0 ± 0.0
Met
0.932MetAla: 0.932 ± 1.167
0.0MetCys: 0.0 ± 0.0
0.0MetAsp: 0.0 ± 0.0
2.796MetGlu: 2.796 ± 1.833
0.0MetPhe: 0.0 ± 0.0
3.728MetGly: 3.728 ± 3.0
0.0MetHis: 0.0 ± 0.0
1.864MetIle: 1.864 ± 0.666
0.0MetLys: 0.0 ± 0.0
2.796MetLeu: 2.796 ± 1.833
0.0MetMet: 0.0 ± 0.0
0.0MetAsn: 0.0 ± 0.0
3.728MetPro: 3.728 ± 1.331
1.864MetGln: 1.864 ± 0.666
1.864MetArg: 1.864 ± 0.666
1.864MetSer: 1.864 ± 1.003
1.864MetThr: 1.864 ± 1.003
1.864MetVal: 1.864 ± 0.666
1.864MetTrp: 1.864 ± 0.666
0.932MetTyr: 0.932 ± 0.502
0.0MetXaa: 0.0 ± 0.0
Asn
0.0AsnAla: 0.0 ± 0.0
1.864AsnCys: 1.864 ± 0.666
1.864AsnAsp: 1.864 ± 2.334
1.864AsnGlu: 1.864 ± 0.666
1.864AsnPhe: 1.864 ± 0.666
0.932AsnGly: 0.932 ± 0.502
0.932AsnHis: 0.932 ± 1.167
0.932AsnIle: 0.932 ± 0.502
0.932AsnLys: 0.932 ± 0.502
2.796AsnLeu: 2.796 ± 1.505
1.864AsnMet: 1.864 ± 2.334
0.932AsnAsn: 0.932 ± 0.502
1.864AsnPro: 1.864 ± 0.666
2.796AsnGln: 2.796 ± 1.505
1.864AsnArg: 1.864 ± 0.666
0.0AsnSer: 0.0 ± 0.0
3.728AsnThr: 3.728 ± 2.006
0.932AsnVal: 0.932 ± 0.502
0.932AsnTrp: 0.932 ± 0.502
0.932AsnTyr: 0.932 ± 0.502
0.0AsnXaa: 0.0 ± 0.0
Pro
2.796ProAla: 2.796 ± 0.164
0.0ProCys: 0.0 ± 0.0
2.796ProAsp: 2.796 ± 1.505
5.592ProGlu: 5.592 ± 1.997
0.0ProPhe: 0.0 ± 0.0
2.796ProGly: 2.796 ± 0.164
0.932ProHis: 0.932 ± 1.167
1.864ProIle: 1.864 ± 0.666
0.932ProLys: 0.932 ± 0.502
4.66ProLeu: 4.66 ± 0.839
2.796ProMet: 2.796 ± 0.164
0.0ProAsn: 0.0 ± 0.0
3.728ProPro: 3.728 ± 1.331
0.932ProGln: 0.932 ± 0.502
3.728ProArg: 3.728 ± 2.006
3.728ProSer: 3.728 ± 0.337
0.0ProThr: 0.0 ± 0.0
2.796ProVal: 2.796 ± 0.164
1.864ProTrp: 1.864 ± 0.666
0.932ProTyr: 0.932 ± 0.502
0.0ProXaa: 0.0 ± 0.0
Gln
1.864GlnAla: 1.864 ± 1.003
0.932GlnCys: 0.932 ± 1.167
0.0GlnAsp: 0.0 ± 0.0
5.592GlnGlu: 5.592 ± 1.997
0.932GlnPhe: 0.932 ± 0.502
0.0GlnGly: 0.0 ± 0.0
0.0GlnHis: 0.0 ± 0.0
1.864GlnIle: 1.864 ± 0.666
6.524GlnLys: 6.524 ± 0.173
5.592GlnLeu: 5.592 ± 1.997
0.932GlnMet: 0.932 ± 0.502
0.0GlnAsn: 0.0 ± 0.0
3.728GlnPro: 3.728 ± 1.331
1.864GlnGln: 1.864 ± 1.003
3.728GlnArg: 3.728 ± 0.337
3.728GlnSer: 3.728 ± 0.337
0.932GlnThr: 0.932 ± 0.502
2.796GlnVal: 2.796 ± 1.505
0.932GlnTrp: 0.932 ± 1.167
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.524ArgAla: 6.524 ± 3.511
1.864ArgCys: 1.864 ± 1.003
5.592ArgAsp: 5.592 ± 3.009
3.728ArgGlu: 3.728 ± 2.006
1.864ArgPhe: 1.864 ± 0.666
4.66ArgGly: 4.66 ± 0.839
0.0ArgHis: 0.0 ± 0.0
1.864ArgIle: 1.864 ± 1.003
1.864ArgLys: 1.864 ± 1.003
8.388ArgLeu: 8.388 ± 5.499
0.0ArgMet: 0.0 ± 0.0
3.728ArgAsn: 3.728 ± 1.331
0.932ArgPro: 0.932 ± 0.502
0.0ArgGln: 0.0 ± 0.0
5.592ArgArg: 5.592 ± 3.009
4.66ArgSer: 4.66 ± 2.508
3.728ArgThr: 3.728 ± 1.331
3.728ArgVal: 3.728 ± 0.337
1.864ArgTrp: 1.864 ± 0.666
0.932ArgTyr: 0.932 ± 0.502
0.0ArgXaa: 0.0 ± 0.0
Ser
4.66SerAla: 4.66 ± 2.508
0.932SerCys: 0.932 ± 1.167
4.66SerAsp: 4.66 ± 2.508
6.524SerGlu: 6.524 ± 1.842
5.592SerPhe: 5.592 ± 1.341
8.388SerGly: 8.388 ± 1.177
1.864SerHis: 1.864 ± 1.003
3.728SerIle: 3.728 ± 2.006
3.728SerLys: 3.728 ± 2.006
4.66SerLeu: 4.66 ± 2.508
2.796SerMet: 2.796 ± 1.833
0.932SerAsn: 0.932 ± 1.167
0.932SerPro: 0.932 ± 0.502
3.728SerGln: 3.728 ± 2.006
5.592SerArg: 5.592 ± 3.009
10.252SerSer: 10.252 ± 3.849
3.728SerThr: 3.728 ± 0.337
4.66SerVal: 4.66 ± 0.839
1.864SerTrp: 1.864 ± 2.334
2.796SerTyr: 2.796 ± 0.164
0.0SerXaa: 0.0 ± 0.0
Thr
3.728ThrAla: 3.728 ± 2.006
1.864ThrCys: 1.864 ± 0.666
5.592ThrAsp: 5.592 ± 1.997
0.932ThrGlu: 0.932 ± 0.502
0.932ThrPhe: 0.932 ± 0.502
5.592ThrGly: 5.592 ± 0.328
2.796ThrHis: 2.796 ± 1.505
4.66ThrIle: 4.66 ± 0.83
0.932ThrLys: 0.932 ± 0.502
3.728ThrLeu: 3.728 ± 1.331
0.932ThrMet: 0.932 ± 1.167
2.796ThrAsn: 2.796 ± 1.833
2.796ThrPro: 2.796 ± 1.505
1.864ThrGln: 1.864 ± 1.003
0.0ThrArg: 0.0 ± 0.0
3.728ThrSer: 3.728 ± 2.006
0.932ThrThr: 0.932 ± 0.502
3.728ThrVal: 3.728 ± 0.337
0.932ThrTrp: 0.932 ± 0.502
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
7.456ValAla: 7.456 ± 0.994
2.796ValCys: 2.796 ± 0.164
4.66ValAsp: 4.66 ± 0.839
6.524ValGlu: 6.524 ± 0.173
1.864ValPhe: 1.864 ± 2.334
1.864ValGly: 1.864 ± 0.666
0.0ValHis: 0.0 ± 0.0
1.864ValIle: 1.864 ± 1.003
3.728ValLys: 3.728 ± 2.006
3.728ValLeu: 3.728 ± 0.337
0.932ValMet: 0.932 ± 0.502
0.932ValAsn: 0.932 ± 0.502
1.864ValPro: 1.864 ± 0.666
3.728ValGln: 3.728 ± 0.337
1.864ValArg: 1.864 ± 1.003
8.388ValSer: 8.388 ± 2.845
4.66ValThr: 4.66 ± 0.839
2.796ValVal: 2.796 ± 0.164
0.932ValTrp: 0.932 ± 0.502
1.864ValTyr: 1.864 ± 0.666
0.0ValXaa: 0.0 ± 0.0
Trp
0.932TrpAla: 0.932 ± 0.502
0.932TrpCys: 0.932 ± 1.167
0.932TrpAsp: 0.932 ± 1.167
0.932TrpGlu: 0.932 ± 0.502
0.932TrpPhe: 0.932 ± 0.502
2.796TrpGly: 2.796 ± 1.833
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
1.864TrpLys: 1.864 ± 0.666
3.728TrpLeu: 3.728 ± 4.669
0.932TrpMet: 0.932 ± 0.337
0.932TrpAsn: 0.932 ± 0.502
0.0TrpPro: 0.0 ± 0.0
0.932TrpGln: 0.932 ± 0.502
1.864TrpArg: 1.864 ± 0.666
0.932TrpSer: 0.932 ± 0.502
1.864TrpThr: 1.864 ± 2.334
0.932TrpVal: 0.932 ± 1.167
0.932TrpTrp: 0.932 ± 0.502
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.728TyrAla: 3.728 ± 1.331
1.864TyrCys: 1.864 ± 2.334
0.932TyrAsp: 0.932 ± 1.167
1.864TyrGlu: 1.864 ± 1.003
0.0TyrPhe: 0.0 ± 0.0
1.864TyrGly: 1.864 ± 1.003
1.864TyrHis: 1.864 ± 1.003
0.0TyrIle: 0.0 ± 0.0
1.864TyrLys: 1.864 ± 1.003
1.864TyrLeu: 1.864 ± 1.003
2.796TyrMet: 2.796 ± 3.502
0.932TyrAsn: 0.932 ± 0.502
0.932TyrPro: 0.932 ± 0.502
1.864TyrGln: 1.864 ± 0.666
2.796TyrArg: 2.796 ± 0.164
1.864TyrSer: 1.864 ± 1.003
3.728TyrThr: 3.728 ± 2.006
1.864TyrVal: 1.864 ± 0.666
0.932TyrTrp: 0.932 ± 1.167
0.932TyrTyr: 0.932 ± 1.167
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1074 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski