Amino acid dipepetide frequency for Salmovirus WFRC1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.136AlaAla: 8.136 ± 2.829
2.219AlaCys: 2.219 ± 0.204
1.479AlaAsp: 1.479 ± 0.222
3.698AlaGlu: 3.698 ± 1.056
2.219AlaPhe: 2.219 ± 0.204
1.479AlaGly: 1.479 ± 0.222
0.74AlaHis: 0.74 ± 0.647
2.959AlaIle: 2.959 ± 1.704
2.219AlaLys: 2.219 ± 0.204
7.396AlaLeu: 7.396 ± 0.034
2.219AlaMet: 2.219 ± 0.869
3.698AlaAsn: 3.698 ± 1.056
4.438AlaPro: 4.438 ± 1.738
2.219AlaGln: 2.219 ± 0.204
3.698AlaArg: 3.698 ± 1.056
8.136AlaSer: 8.136 ± 1.755
3.698AlaThr: 3.698 ± 1.056
6.657AlaVal: 6.657 ± 1.534
0.74AlaTrp: 0.74 ± 0.426
2.959AlaTyr: 2.959 ± 0.443
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
1.479CysCys: 1.479 ± 0.852
2.959CysAsp: 2.959 ± 0.443
3.698CysGlu: 3.698 ± 1.091
0.74CysPhe: 0.74 ± 0.647
2.219CysGly: 2.219 ± 0.204
0.0CysHis: 0.0 ± 0.0
3.698CysIle: 3.698 ± 1.091
1.479CysLys: 1.479 ± 0.852
0.74CysLeu: 0.74 ± 0.647
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.0CysPro: 0.0 ± 0.0
2.219CysGln: 2.219 ± 1.942
2.219CysArg: 2.219 ± 0.204
2.959CysSer: 2.959 ± 1.517
0.74CysThr: 0.74 ± 0.647
0.74CysVal: 0.74 ± 0.426
0.74CysTrp: 0.74 ± 0.426
1.479CysTyr: 1.479 ± 0.852
0.0CysXaa: 0.0 ± 0.0
Asp
5.178AspAla: 5.178 ± 0.835
1.479AspCys: 1.479 ± 1.295
2.219AspAsp: 2.219 ± 0.204
2.219AspGlu: 2.219 ± 0.869
2.219AspPhe: 2.219 ± 1.278
3.698AspGly: 3.698 ± 2.164
1.479AspHis: 1.479 ± 0.852
1.479AspIle: 1.479 ± 0.222
3.698AspLys: 3.698 ± 2.164
5.917AspLeu: 5.917 ± 0.187
0.0AspMet: 0.0 ± 0.0
0.74AspAsn: 0.74 ± 0.426
4.438AspPro: 4.438 ± 1.482
2.959AspGln: 2.959 ± 0.63
2.219AspArg: 2.219 ± 0.204
5.917AspSer: 5.917 ± 2.334
0.74AspThr: 0.74 ± 0.647
5.178AspVal: 5.178 ± 0.835
0.0AspTrp: 0.0 ± 0.0
0.0AspTyr: 0.0 ± 0.0
0.0AspXaa: 0.0 ± 0.0
Glu
2.959GluAla: 2.959 ± 0.443
1.479GluCys: 1.479 ± 0.222
4.438GluAsp: 4.438 ± 1.482
0.0GluGlu: 0.0 ± 0.0
5.178GluPhe: 5.178 ± 1.908
5.917GluGly: 5.917 ± 3.033
0.74GluHis: 0.74 ± 0.647
2.959GluIle: 2.959 ± 0.63
3.698GluLys: 3.698 ± 1.056
2.959GluLeu: 2.959 ± 0.63
0.74GluMet: 0.74 ± 0.426
0.74GluAsn: 0.74 ± 0.647
1.479GluPro: 1.479 ± 0.222
1.479GluGln: 1.479 ± 0.222
3.698GluArg: 3.698 ± 1.091
5.178GluSer: 5.178 ± 1.312
2.959GluThr: 2.959 ± 0.63
2.959GluVal: 2.959 ± 0.443
0.74GluTrp: 0.74 ± 0.426
0.74GluTyr: 0.74 ± 0.426
0.0GluXaa: 0.0 ± 0.0
Phe
2.959PheAla: 2.959 ± 0.443
0.0PheCys: 0.0 ± 0.0
0.74PheAsp: 0.74 ± 0.426
2.219PheGlu: 2.219 ± 1.278
4.438PhePhe: 4.438 ± 0.409
7.396PheGly: 7.396 ± 1.108
0.0PheHis: 0.0 ± 0.0
2.959PheIle: 2.959 ± 0.443
1.479PheLys: 1.479 ± 0.222
3.698PheLeu: 3.698 ± 1.056
0.74PheMet: 0.74 ± 0.426
0.0PheAsn: 0.0 ± 0.0
1.479PhePro: 1.479 ± 0.222
0.74PheGln: 0.74 ± 0.426
2.219PheArg: 2.219 ± 1.278
2.959PheSer: 2.959 ± 0.63
5.178PheThr: 5.178 ± 0.835
5.178PheVal: 5.178 ± 1.908
0.0PheTrp: 0.0 ± 0.0
1.479PheTyr: 1.479 ± 0.852
0.0PheXaa: 0.0 ± 0.0
Gly
6.657GlyAla: 6.657 ± 2.76
3.698GlyCys: 3.698 ± 2.164
5.178GlyAsp: 5.178 ± 1.312
2.959GlyGlu: 2.959 ± 1.517
7.396GlyPhe: 7.396 ± 1.108
4.438GlyGly: 4.438 ± 0.409
1.479GlyHis: 1.479 ± 1.295
1.479GlyIle: 1.479 ± 0.222
2.959GlyLys: 2.959 ± 0.443
7.396GlyLeu: 7.396 ± 4.328
2.959GlyMet: 2.959 ± 0.443
1.479GlyAsn: 1.479 ± 0.852
2.959GlyPro: 2.959 ± 0.443
3.698GlyGln: 3.698 ± 1.056
2.219GlyArg: 2.219 ± 1.278
8.876GlySer: 8.876 ± 1.891
4.438GlyThr: 4.438 ± 2.812
6.657GlyVal: 6.657 ± 0.46
0.74GlyTrp: 0.74 ± 0.426
2.959GlyTyr: 2.959 ± 1.517
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.74HisAsp: 0.74 ± 0.647
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.219HisGly: 2.219 ± 0.204
1.479HisHis: 1.479 ± 0.222
0.74HisIle: 0.74 ± 0.647
0.0HisLys: 0.0 ± 0.0
1.479HisLeu: 1.479 ± 1.295
0.0HisMet: 0.0 ± 0.0
0.74HisAsn: 0.74 ± 0.647
0.74HisPro: 0.74 ± 0.426
0.0HisGln: 0.0 ± 0.0
1.479HisArg: 1.479 ± 0.852
1.479HisSer: 1.479 ± 0.222
2.959HisThr: 2.959 ± 1.517
1.479HisVal: 1.479 ± 0.222
0.0HisTrp: 0.0 ± 0.0
1.479HisTyr: 1.479 ± 1.295
0.0HisXaa: 0.0 ± 0.0
Ile
2.959IleAla: 2.959 ± 0.443
0.74IleCys: 0.74 ± 0.647
1.479IleAsp: 1.479 ± 0.222
2.219IleGlu: 2.219 ± 0.204
1.479IlePhe: 1.479 ± 0.852
2.219IleGly: 2.219 ± 1.278
0.74IleHis: 0.74 ± 0.426
2.219IleIle: 2.219 ± 0.204
1.479IleLys: 1.479 ± 0.222
5.178IleLeu: 5.178 ± 0.835
2.219IleMet: 2.219 ± 0.204
0.74IleAsn: 0.74 ± 0.647
1.479IlePro: 1.479 ± 0.852
0.74IleGln: 0.74 ± 0.426
0.74IleArg: 0.74 ± 0.426
5.178IleSer: 5.178 ± 1.312
4.438IleThr: 4.438 ± 0.665
1.479IleVal: 1.479 ± 0.222
0.0IleTrp: 0.0 ± 0.0
2.959IleTyr: 2.959 ± 0.443
0.0IleXaa: 0.0 ± 0.0
Lys
4.438LysAla: 4.438 ± 0.409
0.74LysCys: 0.74 ± 0.647
2.219LysAsp: 2.219 ± 0.204
2.959LysGlu: 2.959 ± 0.443
3.698LysPhe: 3.698 ± 0.017
1.479LysGly: 1.479 ± 0.222
0.0LysHis: 0.0 ± 0.0
2.219LysIle: 2.219 ± 0.204
9.615LysLys: 9.615 ± 3.05
0.74LysLeu: 0.74 ± 0.647
0.74LysMet: 0.74 ± 0.426
0.74LysAsn: 0.74 ± 0.647
4.438LysPro: 4.438 ± 0.665
2.219LysGln: 2.219 ± 0.204
7.396LysArg: 7.396 ± 0.034
4.438LysSer: 4.438 ± 0.409
2.959LysThr: 2.959 ± 0.443
2.959LysVal: 2.959 ± 0.443
2.219LysTrp: 2.219 ± 0.869
2.959LysTyr: 2.959 ± 1.517
0.0LysXaa: 0.0 ± 0.0
Leu
5.178LeuAla: 5.178 ± 0.239
3.698LeuCys: 3.698 ± 2.13
3.698LeuAsp: 3.698 ± 1.056
6.657LeuGlu: 6.657 ± 0.46
3.698LeuPhe: 3.698 ± 1.056
6.657LeuGly: 6.657 ± 1.534
2.219LeuHis: 2.219 ± 0.869
2.959LeuIle: 2.959 ± 0.443
4.438LeuLys: 4.438 ± 1.482
9.615LeuLeu: 9.615 ± 3.391
3.698LeuMet: 3.698 ± 1.056
1.479LeuAsn: 1.479 ± 0.852
2.959LeuPro: 2.959 ± 1.517
2.219LeuGln: 2.219 ± 0.869
2.219LeuArg: 2.219 ± 0.869
7.396LeuSer: 7.396 ± 1.039
2.219LeuThr: 2.219 ± 0.204
2.959LeuVal: 2.959 ± 0.443
2.959LeuTrp: 2.959 ± 0.443
0.74LeuTyr: 0.74 ± 0.426
0.0LeuXaa: 0.0 ± 0.0
Met
1.479MetAla: 1.479 ± 1.295
2.219MetCys: 2.219 ± 0.869
0.74MetAsp: 0.74 ± 0.426
1.479MetGlu: 1.479 ± 0.852
0.74MetPhe: 0.74 ± 0.426
2.959MetGly: 2.959 ± 1.517
0.0MetHis: 0.0 ± 0.0
3.698MetIle: 3.698 ± 2.13
1.479MetLys: 1.479 ± 0.222
0.74MetLeu: 0.74 ± 0.647
1.479MetMet: 1.479 ± 0.852
0.0MetAsn: 0.0 ± 0.0
2.219MetPro: 2.219 ± 1.278
0.74MetGln: 0.74 ± 0.647
0.0MetArg: 0.0 ± 0.0
5.178MetSer: 5.178 ± 0.835
2.219MetThr: 2.219 ± 0.869
0.74MetVal: 0.74 ± 0.426
0.74MetTrp: 0.74 ± 0.426
0.74MetTyr: 0.74 ± 0.426
0.0MetXaa: 0.0 ± 0.0
Asn
0.74AsnAla: 0.74 ± 0.426
0.74AsnCys: 0.74 ± 0.426
1.479AsnAsp: 1.479 ± 0.222
0.74AsnGlu: 0.74 ± 0.426
0.74AsnPhe: 0.74 ± 0.426
5.178AsnGly: 5.178 ± 1.312
0.0AsnHis: 0.0 ± 0.0
0.0AsnIle: 0.0 ± 0.0
1.479AsnLys: 1.479 ± 0.222
0.0AsnLeu: 0.0 ± 0.0
0.74AsnMet: 0.74 ± 0.426
0.74AsnAsn: 0.74 ± 0.426
2.219AsnPro: 2.219 ± 1.278
0.74AsnGln: 0.74 ± 0.426
2.219AsnArg: 2.219 ± 0.869
2.959AsnSer: 2.959 ± 0.63
2.959AsnThr: 2.959 ± 0.63
1.479AsnVal: 1.479 ± 0.222
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
2.959ProAla: 2.959 ± 1.704
0.74ProCys: 0.74 ± 0.647
1.479ProAsp: 1.479 ± 0.222
2.959ProGlu: 2.959 ± 0.63
1.479ProPhe: 1.479 ± 0.852
5.178ProGly: 5.178 ± 0.239
0.74ProHis: 0.74 ± 0.647
2.219ProIle: 2.219 ± 0.204
1.479ProLys: 1.479 ± 1.295
2.959ProLeu: 2.959 ± 0.443
2.959ProMet: 2.959 ± 0.71
2.219ProAsn: 2.219 ± 1.278
2.959ProPro: 2.959 ± 1.704
1.479ProGln: 1.479 ± 0.222
1.479ProArg: 1.479 ± 0.852
5.917ProSer: 5.917 ± 1.261
2.959ProThr: 2.959 ± 0.443
6.657ProVal: 6.657 ± 0.46
0.0ProTrp: 0.0 ± 0.0
2.959ProTyr: 2.959 ± 0.63
0.0ProXaa: 0.0 ± 0.0
Gln
1.479GlnAla: 1.479 ± 0.222
1.479GlnCys: 1.479 ± 0.222
2.219GlnAsp: 2.219 ± 0.204
0.0GlnGlu: 0.0 ± 0.0
0.74GlnPhe: 0.74 ± 0.426
2.959GlnGly: 2.959 ± 1.517
0.74GlnHis: 0.74 ± 0.647
0.74GlnIle: 0.74 ± 0.647
2.959GlnLys: 2.959 ± 0.443
0.0GlnLeu: 0.0 ± 0.0
0.0GlnMet: 0.0 ± 0.0
1.479GlnAsn: 1.479 ± 0.222
1.479GlnPro: 1.479 ± 1.295
1.479GlnGln: 1.479 ± 0.222
3.698GlnArg: 3.698 ± 0.017
2.959GlnSer: 2.959 ± 0.63
2.219GlnThr: 2.219 ± 1.278
1.479GlnVal: 1.479 ± 0.852
1.479GlnTrp: 1.479 ± 0.852
0.74GlnTyr: 0.74 ± 0.426
0.0GlnXaa: 0.0 ± 0.0
Arg
3.698ArgAla: 3.698 ± 1.091
1.479ArgCys: 1.479 ± 0.852
3.698ArgAsp: 3.698 ± 1.091
5.178ArgGlu: 5.178 ± 1.312
0.74ArgPhe: 0.74 ± 0.426
0.74ArgGly: 0.74 ± 0.426
0.0ArgHis: 0.0 ± 0.0
3.698ArgIle: 3.698 ± 0.017
4.438ArgLys: 4.438 ± 1.738
5.178ArgLeu: 5.178 ± 2.982
0.74ArgMet: 0.74 ± 0.426
0.74ArgAsn: 0.74 ± 0.426
2.959ArgPro: 2.959 ± 1.704
1.479ArgGln: 1.479 ± 0.222
5.178ArgArg: 5.178 ± 0.239
7.396ArgSer: 7.396 ± 1.039
1.479ArgThr: 1.479 ± 0.852
5.178ArgVal: 5.178 ± 0.239
0.0ArgTrp: 0.0 ± 0.0
2.959ArgTyr: 2.959 ± 0.443
0.0ArgXaa: 0.0 ± 0.0
Ser
6.657SerAla: 6.657 ± 2.607
2.219SerCys: 2.219 ± 0.204
4.438SerAsp: 4.438 ± 0.409
2.959SerGlu: 2.959 ± 1.704
4.438SerPhe: 4.438 ± 1.482
7.396SerGly: 7.396 ± 1.108
1.479SerHis: 1.479 ± 1.295
2.219SerIle: 2.219 ± 0.204
6.657SerLys: 6.657 ± 0.46
11.834SerLeu: 11.834 ± 2.522
4.438SerMet: 4.438 ± 0.409
2.959SerAsn: 2.959 ± 0.443
7.396SerPro: 7.396 ± 1.039
2.219SerGln: 2.219 ± 0.204
8.876SerArg: 8.876 ± 1.891
8.876SerSer: 8.876 ± 1.329
5.917SerThr: 5.917 ± 0.886
9.615SerVal: 9.615 ± 0.903
2.219SerTrp: 2.219 ± 1.278
1.479SerTyr: 1.479 ± 0.852
0.0SerXaa: 0.0 ± 0.0
Thr
4.438ThrAla: 4.438 ± 1.738
1.479ThrCys: 1.479 ± 1.295
5.178ThrAsp: 5.178 ± 0.239
4.438ThrGlu: 4.438 ± 1.738
4.438ThrPhe: 4.438 ± 0.665
7.396ThrGly: 7.396 ± 2.113
2.219ThrHis: 2.219 ± 0.204
2.219ThrIle: 2.219 ± 0.204
2.959ThrLys: 2.959 ± 0.63
2.219ThrLeu: 2.219 ± 0.204
0.0ThrMet: 0.0 ± 0.0
0.74ThrAsn: 0.74 ± 0.647
2.959ThrPro: 2.959 ± 0.443
0.74ThrGln: 0.74 ± 0.426
2.219ThrArg: 2.219 ± 0.869
4.438ThrSer: 4.438 ± 1.482
2.959ThrThr: 2.959 ± 0.63
4.438ThrVal: 4.438 ± 1.482
0.74ThrTrp: 0.74 ± 0.426
0.0ThrTyr: 0.0 ± 0.0
0.0ThrXaa: 0.0 ± 0.0
Val
8.136ValAla: 8.136 ± 1.465
1.479ValCys: 1.479 ± 0.222
4.438ValAsp: 4.438 ± 1.482
2.959ValGlu: 2.959 ± 0.63
0.74ValPhe: 0.74 ± 0.647
6.657ValGly: 6.657 ± 0.46
2.219ValHis: 2.219 ± 0.204
1.479ValIle: 1.479 ± 0.222
4.438ValLys: 4.438 ± 1.738
5.178ValLeu: 5.178 ± 0.835
4.438ValMet: 4.438 ± 0.283
3.698ValAsn: 3.698 ± 0.017
3.698ValPro: 3.698 ± 0.017
1.479ValGln: 1.479 ± 0.222
2.219ValArg: 2.219 ± 0.869
6.657ValSer: 6.657 ± 2.607
2.959ValThr: 2.959 ± 1.704
7.396ValVal: 7.396 ± 4.328
2.219ValTrp: 2.219 ± 0.204
2.219ValTyr: 2.219 ± 1.278
0.0ValXaa: 0.0 ± 0.0
Trp
1.479TrpAla: 1.479 ± 0.852
0.0TrpCys: 0.0 ± 0.0
0.74TrpAsp: 0.74 ± 0.426
0.74TrpGlu: 0.74 ± 0.426
0.74TrpPhe: 0.74 ± 0.426
0.74TrpGly: 0.74 ± 0.647
0.74TrpHis: 0.74 ± 0.647
0.74TrpIle: 0.74 ± 0.647
0.0TrpLys: 0.0 ± 0.0
0.74TrpLeu: 0.74 ± 0.426
0.74TrpMet: 0.74 ± 0.647
1.479TrpAsn: 1.479 ± 0.852
0.74TrpPro: 0.74 ± 0.426
1.479TrpGln: 1.479 ± 0.222
0.74TrpArg: 0.74 ± 0.426
2.959TrpSer: 2.959 ± 0.63
0.0TrpThr: 0.0 ± 0.0
0.0TrpVal: 0.0 ± 0.0
0.0TrpTrp: 0.0 ± 0.0
1.479TrpTyr: 1.479 ± 0.852
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.479TyrAla: 1.479 ± 0.222
0.74TyrCys: 0.74 ± 0.647
2.219TyrAsp: 2.219 ± 0.204
2.959TyrGlu: 2.959 ± 0.443
0.0TyrPhe: 0.0 ± 0.0
3.698TyrGly: 3.698 ± 1.056
0.0TyrHis: 0.0 ± 0.0
0.0TyrIle: 0.0 ± 0.0
2.219TyrLys: 2.219 ± 0.204
3.698TyrLeu: 3.698 ± 0.017
0.0TyrMet: 0.0 ± 0.0
0.74TyrAsn: 0.74 ± 0.426
0.74TyrPro: 0.74 ± 0.426
0.0TyrGln: 0.0 ± 0.0
2.219TyrArg: 2.219 ± 0.869
4.438TyrSer: 4.438 ± 0.409
2.219TyrThr: 2.219 ± 1.278
2.219TyrVal: 2.219 ± 0.204
0.74TyrTrp: 0.74 ± 0.647
1.479TyrTyr: 1.479 ± 0.852
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1353 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski