Amino acid dipepetide frequency for Beihai sobemo-like virus 17

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.435AlaAla: 5.435 ± 2.805
1.812AlaCys: 1.812 ± 2.035
4.529AlaAsp: 4.529 ± 2.118
3.623AlaGlu: 3.623 ± 1.87
4.529AlaPhe: 4.529 ± 0.852
0.906AlaGly: 0.906 ± 0.467
2.717AlaHis: 2.717 ± 1.402
3.623AlaIle: 3.623 ± 4.07
4.529AlaLys: 4.529 ± 0.633
1.812AlaLeu: 1.812 ± 0.935
3.623AlaMet: 3.623 ± 1.1
1.812AlaAsn: 1.812 ± 0.935
7.246AlaPro: 7.246 ± 0.769
1.812AlaGln: 1.812 ± 0.935
0.906AlaArg: 0.906 ± 0.467
6.341AlaSer: 6.341 ± 3.272
3.623AlaThr: 3.623 ± 1.1
7.246AlaVal: 7.246 ± 0.769
1.812AlaTrp: 1.812 ± 0.935
2.717AlaTyr: 2.717 ± 0.083
0.0AlaXaa: 0.0 ± 0.0
Cys
0.906CysAla: 0.906 ± 1.018
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
1.812CysGly: 1.812 ± 2.035
1.812CysHis: 1.812 ± 2.035
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
2.717CysPro: 2.717 ± 1.568
1.812CysGln: 1.812 ± 0.55
0.906CysArg: 0.906 ± 0.467
1.812CysSer: 1.812 ± 0.935
0.906CysThr: 0.906 ± 0.467
0.0CysVal: 0.0 ± 0.0
0.906CysTrp: 0.906 ± 0.467
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.623AspAla: 3.623 ± 0.385
0.0AspCys: 0.0 ± 0.0
4.529AspAsp: 4.529 ± 2.118
3.623AspGlu: 3.623 ± 1.87
3.623AspPhe: 3.623 ± 1.87
6.341AspGly: 6.341 ± 0.302
0.906AspHis: 0.906 ± 0.467
0.0AspIle: 0.0 ± 0.0
5.435AspLys: 5.435 ± 1.65
4.529AspLeu: 4.529 ± 2.118
2.717AspMet: 2.717 ± 1.402
2.717AspAsn: 2.717 ± 1.402
2.717AspPro: 2.717 ± 1.568
1.812AspGln: 1.812 ± 0.55
2.717AspArg: 2.717 ± 0.083
2.717AspSer: 2.717 ± 0.083
4.529AspThr: 4.529 ± 2.118
0.906AspVal: 0.906 ± 0.467
0.906AspTrp: 0.906 ± 1.018
0.906AspTyr: 0.906 ± 0.467
0.0AspXaa: 0.0 ± 0.0
Glu
6.341GluAla: 6.341 ± 3.272
0.0GluCys: 0.0 ± 0.0
2.717GluAsp: 2.717 ± 1.402
3.623GluGlu: 3.623 ± 0.385
1.812GluPhe: 1.812 ± 0.55
3.623GluGly: 3.623 ± 1.1
1.812GluHis: 1.812 ± 0.55
0.906GluIle: 0.906 ± 1.018
1.812GluLys: 1.812 ± 0.935
5.435GluLeu: 5.435 ± 0.165
0.906GluMet: 0.906 ± 0.467
2.717GluAsn: 2.717 ± 1.402
6.341GluPro: 6.341 ± 0.302
3.623GluGln: 3.623 ± 1.87
2.717GluArg: 2.717 ± 1.402
5.435GluSer: 5.435 ± 2.805
6.341GluThr: 6.341 ± 1.787
7.246GluVal: 7.246 ± 0.716
0.0GluTrp: 0.0 ± 0.0
0.906GluTyr: 0.906 ± 0.467
0.0GluXaa: 0.0 ± 0.0
Phe
1.812PheAla: 1.812 ± 0.55
0.906PheCys: 0.906 ± 1.018
1.812PheAsp: 1.812 ± 0.55
6.341PheGlu: 6.341 ± 3.272
3.623PhePhe: 3.623 ± 0.385
3.623PheGly: 3.623 ± 1.1
0.906PheHis: 0.906 ± 0.467
4.529PheIle: 4.529 ± 0.852
1.812PheLys: 1.812 ± 2.035
3.623PheLeu: 3.623 ± 1.87
0.906PheMet: 0.906 ± 1.018
0.906PheAsn: 0.906 ± 1.018
0.906PhePro: 0.906 ± 1.018
1.812PheGln: 1.812 ± 0.935
4.529PheArg: 4.529 ± 0.633
4.529PheSer: 4.529 ± 2.337
2.717PheThr: 2.717 ± 1.402
1.812PheVal: 1.812 ± 0.55
0.0PheTrp: 0.0 ± 0.0
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.717GlyAla: 2.717 ± 1.402
0.906GlyCys: 0.906 ± 1.018
3.623GlyAsp: 3.623 ± 1.1
2.717GlyGlu: 2.717 ± 1.402
5.435GlyPhe: 5.435 ± 0.165
5.435GlyGly: 5.435 ± 1.32
0.0GlyHis: 0.0 ± 0.0
3.623GlyIle: 3.623 ± 1.1
1.812GlyLys: 1.812 ± 0.55
2.717GlyLeu: 2.717 ± 0.083
1.812GlyMet: 1.812 ± 0.541
2.717GlyAsn: 2.717 ± 0.083
4.529GlyPro: 4.529 ± 0.633
3.623GlyGln: 3.623 ± 1.87
3.623GlyArg: 3.623 ± 1.1
2.717GlySer: 2.717 ± 1.568
2.717GlyThr: 2.717 ± 0.083
7.246GlyVal: 7.246 ± 3.686
1.812GlyTrp: 1.812 ± 2.035
3.623GlyTyr: 3.623 ± 2.585
0.0GlyXaa: 0.0 ± 0.0
His
0.906HisAla: 0.906 ± 1.018
0.0HisCys: 0.0 ± 0.0
0.906HisAsp: 0.906 ± 0.467
0.906HisGlu: 0.906 ± 1.018
0.0HisPhe: 0.0 ± 0.0
0.0HisGly: 0.0 ± 0.0
1.812HisHis: 1.812 ± 0.55
1.812HisIle: 1.812 ± 2.035
1.812HisLys: 1.812 ± 0.55
2.717HisLeu: 2.717 ± 0.083
0.0HisMet: 0.0 ± 0.0
0.906HisAsn: 0.906 ± 0.467
1.812HisPro: 1.812 ± 0.55
0.906HisGln: 0.906 ± 0.467
0.906HisArg: 0.906 ± 0.467
2.717HisSer: 2.717 ± 0.083
0.906HisThr: 0.906 ± 0.467
1.812HisVal: 1.812 ± 0.935
0.0HisTrp: 0.0 ± 0.0
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
0.906IleAla: 0.906 ± 1.018
0.0IleCys: 0.0 ± 0.0
4.529IleAsp: 4.529 ± 0.633
1.812IleGlu: 1.812 ± 0.935
2.717IlePhe: 2.717 ± 0.083
2.717IleGly: 2.717 ± 0.083
0.0IleHis: 0.0 ± 0.0
0.906IleIle: 0.906 ± 1.018
3.623IleLys: 3.623 ± 1.1
7.246IleLeu: 7.246 ± 2.201
0.906IleMet: 0.906 ± 1.018
1.812IleAsn: 1.812 ± 0.935
0.0IlePro: 0.0 ± 0.0
2.717IleGln: 2.717 ± 0.083
1.812IleArg: 1.812 ± 2.035
3.623IleSer: 3.623 ± 0.385
0.906IleThr: 0.906 ± 0.467
2.717IleVal: 2.717 ± 1.568
0.0IleTrp: 0.0 ± 0.0
0.906IleTyr: 0.906 ± 0.467
0.0IleXaa: 0.0 ± 0.0
Lys
3.623LysAla: 3.623 ± 0.385
0.0LysCys: 0.0 ± 0.0
2.717LysAsp: 2.717 ± 0.083
5.435LysGlu: 5.435 ± 0.165
1.812LysPhe: 1.812 ± 2.035
0.906LysGly: 0.906 ± 0.467
1.812LysHis: 1.812 ± 0.55
1.812LysIle: 1.812 ± 2.035
6.341LysLys: 6.341 ± 0.302
3.623LysLeu: 3.623 ± 1.1
1.812LysMet: 1.812 ± 0.55
3.623LysAsn: 3.623 ± 1.1
5.435LysPro: 5.435 ± 0.165
3.623LysGln: 3.623 ± 2.585
2.717LysArg: 2.717 ± 1.402
5.435LysSer: 5.435 ± 0.165
1.812LysThr: 1.812 ± 0.55
5.435LysVal: 5.435 ± 1.32
0.0LysTrp: 0.0 ± 0.0
1.812LysTyr: 1.812 ± 0.935
0.0LysXaa: 0.0 ± 0.0
Leu
8.152LeuAla: 8.152 ± 2.722
1.812LeuCys: 1.812 ± 0.935
1.812LeuAsp: 1.812 ± 0.55
6.341LeuGlu: 6.341 ± 0.302
4.529LeuPhe: 4.529 ± 0.852
8.152LeuGly: 8.152 ± 4.703
1.812LeuHis: 1.812 ± 0.55
2.717LeuIle: 2.717 ± 1.402
4.529LeuLys: 4.529 ± 0.633
6.341LeuLeu: 6.341 ± 0.302
0.906LeuMet: 0.906 ± 0.467
4.529LeuAsn: 4.529 ± 0.633
2.717LeuPro: 2.717 ± 1.568
4.529LeuGln: 4.529 ± 0.852
9.058LeuArg: 9.058 ± 0.219
7.246LeuSer: 7.246 ± 0.769
1.812LeuThr: 1.812 ± 0.935
2.717LeuVal: 2.717 ± 1.402
2.717LeuTrp: 2.717 ± 1.568
2.717LeuTyr: 2.717 ± 3.053
0.0LeuXaa: 0.0 ± 0.0
Met
2.717MetAla: 2.717 ± 0.083
1.812MetCys: 1.812 ± 0.55
0.906MetAsp: 0.906 ± 1.018
2.717MetGlu: 2.717 ± 1.402
0.906MetPhe: 0.906 ± 0.467
1.812MetGly: 1.812 ± 0.55
0.906MetHis: 0.906 ± 0.467
0.906MetIle: 0.906 ± 0.467
1.812MetLys: 1.812 ± 0.935
2.717MetLeu: 2.717 ± 1.568
1.812MetMet: 1.812 ± 0.935
0.0MetAsn: 0.0 ± 0.0
3.623MetPro: 3.623 ± 1.1
0.906MetGln: 0.906 ± 1.018
1.812MetArg: 1.812 ± 0.55
0.0MetSer: 0.0 ± 0.0
0.906MetThr: 0.906 ± 0.467
0.906MetVal: 0.906 ± 1.018
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
0.906AsnAla: 0.906 ± 0.467
0.0AsnCys: 0.0 ± 0.0
0.906AsnAsp: 0.906 ± 1.018
4.529AsnGlu: 4.529 ± 0.633
2.717AsnPhe: 2.717 ± 0.083
3.623AsnGly: 3.623 ± 1.1
0.906AsnHis: 0.906 ± 1.018
0.906AsnIle: 0.906 ± 0.467
0.906AsnLys: 0.906 ± 0.467
7.246AsnLeu: 7.246 ± 3.739
0.906AsnMet: 0.906 ± 0.467
0.906AsnAsn: 0.906 ± 0.467
1.812AsnPro: 1.812 ± 0.55
2.717AsnGln: 2.717 ± 1.402
0.906AsnArg: 0.906 ± 1.018
3.623AsnSer: 3.623 ± 0.385
1.812AsnThr: 1.812 ± 0.55
0.906AsnVal: 0.906 ± 1.018
1.812AsnTrp: 1.812 ± 0.935
4.529AsnTyr: 4.529 ± 2.337
0.0AsnXaa: 0.0 ± 0.0
Pro
2.717ProAla: 2.717 ± 1.568
0.0ProCys: 0.0 ± 0.0
4.529ProAsp: 4.529 ± 0.852
0.906ProGlu: 0.906 ± 1.018
2.717ProPhe: 2.717 ± 0.083
3.623ProGly: 3.623 ± 4.07
0.906ProHis: 0.906 ± 1.018
1.812ProIle: 1.812 ± 0.55
5.435ProLys: 5.435 ± 3.135
1.812ProLeu: 1.812 ± 0.55
1.812ProMet: 1.812 ± 0.55
3.623ProAsn: 3.623 ± 0.385
4.529ProPro: 4.529 ± 0.852
4.529ProGln: 4.529 ± 2.337
5.435ProArg: 5.435 ± 2.805
5.435ProSer: 5.435 ± 0.165
1.812ProThr: 1.812 ± 0.935
4.529ProVal: 4.529 ± 0.633
0.906ProTrp: 0.906 ± 1.018
0.906ProTyr: 0.906 ± 0.467
0.0ProXaa: 0.0 ± 0.0
Gln
2.717GlnAla: 2.717 ± 1.402
1.812GlnCys: 1.812 ± 0.935
3.623GlnAsp: 3.623 ± 0.385
6.341GlnGlu: 6.341 ± 0.302
1.812GlnPhe: 1.812 ± 0.55
3.623GlnGly: 3.623 ± 0.385
0.0GlnHis: 0.0 ± 0.0
1.812GlnIle: 1.812 ± 0.55
4.529GlnLys: 4.529 ± 0.852
5.435GlnLeu: 5.435 ± 0.165
1.812GlnMet: 1.812 ± 0.853
2.717GlnAsn: 2.717 ± 0.083
0.906GlnPro: 0.906 ± 0.467
1.812GlnGln: 1.812 ± 0.935
0.906GlnArg: 0.906 ± 0.467
2.717GlnSer: 2.717 ± 0.083
3.623GlnThr: 3.623 ± 0.385
5.435GlnVal: 5.435 ± 2.805
1.812GlnTrp: 1.812 ± 0.935
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
6.341ArgAla: 6.341 ± 0.302
0.0ArgCys: 0.0 ± 0.0
4.529ArgAsp: 4.529 ± 0.852
0.906ArgGlu: 0.906 ± 1.018
1.812ArgPhe: 1.812 ± 0.55
3.623ArgGly: 3.623 ± 0.385
1.812ArgHis: 1.812 ± 0.935
1.812ArgIle: 1.812 ± 0.55
5.435ArgLys: 5.435 ± 0.165
2.717ArgLeu: 2.717 ± 1.402
0.0ArgMet: 0.0 ± 0.0
4.529ArgAsn: 4.529 ± 2.337
0.906ArgPro: 0.906 ± 0.467
1.812ArgGln: 1.812 ± 0.935
2.717ArgArg: 2.717 ± 0.083
3.623ArgSer: 3.623 ± 0.385
2.717ArgThr: 2.717 ± 0.083
6.341ArgVal: 6.341 ± 0.302
1.812ArgTrp: 1.812 ± 0.55
0.906ArgTyr: 0.906 ± 0.467
0.0ArgXaa: 0.0 ± 0.0
Ser
5.435SerAla: 5.435 ± 0.165
0.906SerCys: 0.906 ± 1.018
5.435SerAsp: 5.435 ± 2.805
3.623SerGlu: 3.623 ± 1.87
1.812SerPhe: 1.812 ± 0.935
5.435SerGly: 5.435 ± 1.65
0.0SerHis: 0.0 ± 0.0
2.717SerIle: 2.717 ± 1.402
1.812SerLys: 1.812 ± 0.935
6.341SerLeu: 6.341 ± 0.302
2.717SerMet: 2.717 ± 0.083
2.717SerAsn: 2.717 ± 1.402
2.717SerPro: 2.717 ± 0.083
6.341SerGln: 6.341 ± 1.787
4.529SerArg: 4.529 ± 0.852
3.623SerSer: 3.623 ± 0.385
4.529SerThr: 4.529 ± 0.852
1.812SerVal: 1.812 ± 0.935
1.812SerTrp: 1.812 ± 0.55
6.341SerTyr: 6.341 ± 1.183
0.0SerXaa: 0.0 ± 0.0
Thr
4.529ThrAla: 4.529 ± 0.633
0.906ThrCys: 0.906 ± 1.018
2.717ThrAsp: 2.717 ± 0.083
2.717ThrGlu: 2.717 ± 1.568
2.717ThrPhe: 2.717 ± 1.402
4.529ThrGly: 4.529 ± 0.852
0.906ThrHis: 0.906 ± 0.467
3.623ThrIle: 3.623 ± 2.585
2.717ThrLys: 2.717 ± 0.083
6.341ThrLeu: 6.341 ± 1.183
0.906ThrMet: 0.906 ± 0.467
1.812ThrAsn: 1.812 ± 0.55
3.623ThrPro: 3.623 ± 0.385
2.717ThrGln: 2.717 ± 1.402
3.623ThrArg: 3.623 ± 1.87
3.623ThrSer: 3.623 ± 1.87
0.0ThrThr: 0.0 ± 0.0
0.906ThrVal: 0.906 ± 1.018
0.0ThrTrp: 0.0 ± 0.0
0.906ThrTyr: 0.906 ± 0.467
0.0ThrXaa: 0.0 ± 0.0
Val
4.529ValAla: 4.529 ± 2.118
0.906ValCys: 0.906 ± 0.467
1.812ValAsp: 1.812 ± 0.935
2.717ValGlu: 2.717 ± 1.402
3.623ValPhe: 3.623 ± 1.1
3.623ValGly: 3.623 ± 1.87
0.0ValHis: 0.0 ± 0.0
4.529ValIle: 4.529 ± 2.337
4.529ValLys: 4.529 ± 0.852
9.058ValLeu: 9.058 ± 1.266
0.906ValMet: 0.906 ± 1.018
2.717ValAsn: 2.717 ± 1.568
2.717ValPro: 2.717 ± 0.083
3.623ValGln: 3.623 ± 0.385
2.717ValArg: 2.717 ± 0.083
4.529ValSer: 4.529 ± 2.118
5.435ValThr: 5.435 ± 1.65
6.341ValVal: 6.341 ± 0.302
2.717ValTrp: 2.717 ± 1.402
2.717ValTyr: 2.717 ± 0.083
0.0ValXaa: 0.0 ± 0.0
Trp
2.717TrpAla: 2.717 ± 0.083
0.906TrpCys: 0.906 ± 0.467
1.812TrpAsp: 1.812 ± 2.035
2.717TrpGlu: 2.717 ± 1.402
0.0TrpPhe: 0.0 ± 0.0
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
0.906TrpIle: 0.906 ± 0.467
0.906TrpLys: 0.906 ± 1.018
3.623TrpLeu: 3.623 ± 0.385
1.812TrpMet: 1.812 ± 0.55
0.0TrpAsn: 0.0 ± 0.0
0.906TrpPro: 0.906 ± 0.467
0.906TrpGln: 0.906 ± 1.018
0.0TrpArg: 0.0 ± 0.0
0.0TrpSer: 0.0 ± 0.0
1.812TrpThr: 1.812 ± 0.55
0.906TrpVal: 0.906 ± 1.018
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.623TyrAla: 3.623 ± 0.385
0.906TyrCys: 0.906 ± 1.018
1.812TyrAsp: 1.812 ± 0.935
2.717TyrGlu: 2.717 ± 1.402
0.906TyrPhe: 0.906 ± 0.467
0.906TyrGly: 0.906 ± 0.467
1.812TyrHis: 1.812 ± 0.55
0.906TyrIle: 0.906 ± 0.467
0.0TyrLys: 0.0 ± 0.0
1.812TyrLeu: 1.812 ± 0.55
0.0TyrMet: 0.0 ± 0.0
1.812TyrAsn: 1.812 ± 0.55
1.812TyrPro: 1.812 ± 0.55
1.812TyrGln: 1.812 ± 2.035
1.812TyrArg: 1.812 ± 0.935
0.906TyrSer: 0.906 ± 0.467
0.906TyrThr: 0.906 ± 1.018
4.529TyrVal: 4.529 ± 0.852
0.906TyrTrp: 0.906 ± 1.018
4.529TyrTyr: 4.529 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski