Amino acid dipepetide frequency for Beihai weivirus-like virus 12

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
14.48AlaAla: 14.48 ± 1.586
0.852AlaCys: 0.852 ± 0.468
3.407AlaAsp: 3.407 ± 2.901
3.407AlaGlu: 3.407 ± 1.309
1.704AlaPhe: 1.704 ± 0.655
6.814AlaGly: 6.814 ± 0.564
2.555AlaHis: 2.555 ± 0.186
6.814AlaIle: 6.814 ± 1.027
1.704AlaLys: 1.704 ± 0.937
10.221AlaLeu: 10.221 ± 3.928
3.407AlaMet: 3.407 ± 1.894
2.555AlaAsn: 2.555 ± 0.186
7.666AlaPro: 7.666 ± 3.742
1.704AlaGln: 1.704 ± 0.937
9.37AlaArg: 9.37 ± 1.214
5.111AlaSer: 5.111 ± 1.218
4.259AlaThr: 4.259 ± 0.841
8.518AlaVal: 8.518 ± 0.091
2.555AlaTrp: 2.555 ± 1.405
3.407AlaTyr: 3.407 ± 1.309
0.0AlaXaa: 0.0 ± 0.0
Cys
0.852CysAla: 0.852 ± 0.468
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
0.852CysGlu: 0.852 ± 0.468
0.852CysPhe: 0.852 ± 0.468
0.0CysGly: 0.0 ± 0.0
2.555CysHis: 2.555 ± 0.186
0.0CysIle: 0.0 ± 0.0
0.852CysLys: 0.852 ± 0.468
1.704CysLeu: 1.704 ± 0.937
0.852CysMet: 0.852 ± 0.468
0.0CysAsn: 0.0 ± 0.0
1.704CysPro: 1.704 ± 0.937
0.0CysGln: 0.0 ± 0.0
1.704CysArg: 1.704 ± 0.937
0.852CysSer: 0.852 ± 1.123
1.704CysThr: 1.704 ± 0.937
0.852CysVal: 0.852 ± 0.468
0.852CysTrp: 0.852 ± 0.468
0.852CysTyr: 0.852 ± 0.468
0.0CysXaa: 0.0 ± 0.0
Asp
7.666AspAla: 7.666 ± 0.559
0.0AspCys: 0.0 ± 0.0
5.111AspAsp: 5.111 ± 2.81
5.963AspGlu: 5.963 ± 1.687
2.555AspPhe: 2.555 ± 1.778
5.111AspGly: 5.111 ± 1.218
0.0AspHis: 0.0 ± 0.0
3.407AspIle: 3.407 ± 1.309
0.852AspLys: 0.852 ± 0.468
7.666AspLeu: 7.666 ± 0.559
0.0AspMet: 0.0 ± 0.0
1.704AspAsn: 1.704 ± 0.655
2.555AspPro: 2.555 ± 0.186
1.704AspGln: 1.704 ± 0.655
3.407AspArg: 3.407 ± 1.873
3.407AspSer: 3.407 ± 1.873
3.407AspThr: 3.407 ± 1.309
5.963AspVal: 5.963 ± 0.096
0.852AspTrp: 0.852 ± 0.468
0.852AspTyr: 0.852 ± 0.468
0.0AspXaa: 0.0 ± 0.0
Glu
5.111GluAla: 5.111 ± 2.81
0.852GluCys: 0.852 ± 0.468
2.555GluAsp: 2.555 ± 1.778
1.704GluGlu: 1.704 ± 0.937
0.852GluPhe: 0.852 ± 0.468
3.407GluGly: 3.407 ± 1.309
1.704GluHis: 1.704 ± 0.937
1.704GluIle: 1.704 ± 0.937
0.852GluLys: 0.852 ± 0.468
2.555GluLeu: 2.555 ± 1.405
0.852GluMet: 0.852 ± 0.468
2.555GluAsn: 2.555 ± 1.405
1.704GluPro: 1.704 ± 0.937
5.111GluGln: 5.111 ± 2.81
3.407GluArg: 3.407 ± 1.873
5.963GluSer: 5.963 ± 0.096
2.555GluThr: 2.555 ± 0.186
4.259GluVal: 4.259 ± 2.341
0.852GluTrp: 0.852 ± 1.123
0.0GluTyr: 0.0 ± 0.0
0.0GluXaa: 0.0 ± 0.0
Phe
2.555PheAla: 2.555 ± 0.186
1.704PheCys: 1.704 ± 0.937
3.407PheAsp: 3.407 ± 0.282
2.555PheGlu: 2.555 ± 1.778
3.407PhePhe: 3.407 ± 0.282
2.555PheGly: 2.555 ± 0.186
0.852PheHis: 0.852 ± 0.468
0.852PheIle: 0.852 ± 0.468
2.555PheLys: 2.555 ± 0.186
4.259PheLeu: 4.259 ± 0.841
0.852PheMet: 0.852 ± 0.37
0.852PheAsn: 0.852 ± 0.468
2.555PhePro: 2.555 ± 0.186
0.0PheGln: 0.0 ± 0.0
0.852PheArg: 0.852 ± 0.468
2.555PheSer: 2.555 ± 1.405
5.111PheThr: 5.111 ± 1.218
4.259PheVal: 4.259 ± 0.841
0.852PheTrp: 0.852 ± 0.468
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
7.666GlyAla: 7.666 ± 2.15
0.852GlyCys: 0.852 ± 0.468
7.666GlyAsp: 7.666 ± 2.623
0.852GlyGlu: 0.852 ± 0.468
2.555GlyPhe: 2.555 ± 0.186
3.407GlyGly: 3.407 ± 2.901
2.555GlyHis: 2.555 ± 1.778
1.704GlyIle: 1.704 ± 2.246
4.259GlyLys: 4.259 ± 0.841
3.407GlyLeu: 3.407 ± 1.309
0.852GlyMet: 0.852 ± 0.468
4.259GlyAsn: 4.259 ± 0.841
0.852GlyPro: 0.852 ± 1.123
3.407GlyGln: 3.407 ± 0.282
3.407GlyArg: 3.407 ± 0.282
3.407GlySer: 3.407 ± 0.282
5.963GlyThr: 5.963 ± 1.496
6.814GlyVal: 6.814 ± 1.027
1.704GlyTrp: 1.704 ± 0.937
2.555GlyTyr: 2.555 ± 0.186
0.0GlyXaa: 0.0 ± 0.0
His
0.852HisAla: 0.852 ± 1.123
0.0HisCys: 0.0 ± 0.0
0.852HisAsp: 0.852 ± 0.468
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
2.555HisGly: 2.555 ± 0.186
0.0HisHis: 0.0 ± 0.0
0.0HisIle: 0.0 ± 0.0
0.0HisLys: 0.0 ± 0.0
3.407HisLeu: 3.407 ± 0.282
1.704HisMet: 1.704 ± 0.937
0.852HisAsn: 0.852 ± 0.468
0.852HisPro: 0.852 ± 1.123
0.852HisGln: 0.852 ± 0.468
4.259HisArg: 4.259 ± 2.341
0.852HisSer: 0.852 ± 0.468
0.852HisThr: 0.852 ± 1.123
3.407HisVal: 3.407 ± 0.282
0.852HisTrp: 0.852 ± 0.468
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
11.073IleAla: 11.073 ± 0.277
0.852IleCys: 0.852 ± 0.468
0.852IleAsp: 0.852 ± 1.123
1.704IleGlu: 1.704 ± 0.655
3.407IlePhe: 3.407 ± 0.282
0.852IleGly: 0.852 ± 1.123
0.0IleHis: 0.0 ± 0.0
2.555IleIle: 2.555 ± 0.186
4.259IleLys: 4.259 ± 0.841
1.704IleLeu: 1.704 ± 0.937
1.704IleMet: 1.704 ± 0.655
2.555IleAsn: 2.555 ± 1.778
0.852IlePro: 0.852 ± 1.123
0.0IleGln: 0.0 ± 0.0
3.407IleArg: 3.407 ± 0.282
1.704IleSer: 1.704 ± 0.937
3.407IleThr: 3.407 ± 1.873
0.852IleVal: 0.852 ± 0.468
0.0IleTrp: 0.0 ± 0.0
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
1.704LysAla: 1.704 ± 0.655
0.852LysCys: 0.852 ± 0.468
2.555LysAsp: 2.555 ± 1.405
0.852LysGlu: 0.852 ± 0.468
4.259LysPhe: 4.259 ± 0.75
4.259LysGly: 4.259 ± 0.75
0.0LysHis: 0.0 ± 0.0
0.852LysIle: 0.852 ± 0.468
3.407LysLys: 3.407 ± 2.901
5.111LysLeu: 5.111 ± 1.218
1.704LysMet: 1.704 ± 0.937
0.852LysAsn: 0.852 ± 0.468
1.704LysPro: 1.704 ± 0.937
1.704LysGln: 1.704 ± 2.246
2.555LysArg: 2.555 ± 1.405
4.259LysSer: 4.259 ± 0.841
3.407LysThr: 3.407 ± 0.282
2.555LysVal: 2.555 ± 0.186
0.852LysTrp: 0.852 ± 0.468
1.704LysTyr: 1.704 ± 0.655
0.0LysXaa: 0.0 ± 0.0
Leu
6.814LeuAla: 6.814 ± 4.21
2.555LeuCys: 2.555 ± 0.186
4.259LeuAsp: 4.259 ± 0.841
5.963LeuGlu: 5.963 ± 3.278
1.704LeuPhe: 1.704 ± 0.937
7.666LeuGly: 7.666 ± 2.623
2.555LeuHis: 2.555 ± 0.186
2.555LeuIle: 2.555 ± 1.405
1.704LeuLys: 1.704 ± 2.246
3.407LeuLeu: 3.407 ± 0.282
4.259LeuMet: 4.259 ± 0.75
2.555LeuAsn: 2.555 ± 1.778
5.111LeuPro: 5.111 ± 1.218
1.704LeuGln: 1.704 ± 2.246
9.37LeuArg: 9.37 ± 1.969
9.37LeuSer: 9.37 ± 1.214
4.259LeuThr: 4.259 ± 2.341
3.407LeuVal: 3.407 ± 1.309
1.704LeuTrp: 1.704 ± 0.937
3.407LeuTyr: 3.407 ± 1.309
0.0LeuXaa: 0.0 ± 0.0
Met
2.555MetAla: 2.555 ± 1.405
0.0MetCys: 0.0 ± 0.0
0.852MetAsp: 0.852 ± 1.123
1.704MetGlu: 1.704 ± 0.937
1.704MetPhe: 1.704 ± 0.655
0.852MetGly: 0.852 ± 1.123
0.852MetHis: 0.852 ± 0.468
2.555MetIle: 2.555 ± 0.186
1.704MetLys: 1.704 ± 0.655
0.852MetLeu: 0.852 ± 0.468
0.852MetMet: 0.852 ± 0.468
0.852MetAsn: 0.852 ± 1.123
0.852MetPro: 0.852 ± 1.123
1.704MetGln: 1.704 ± 0.937
1.704MetArg: 1.704 ± 0.655
0.852MetSer: 0.852 ± 0.468
4.259MetThr: 4.259 ± 0.75
3.407MetVal: 3.407 ± 1.309
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.259AsnAla: 4.259 ± 4.023
1.704AsnCys: 1.704 ± 0.937
2.555AsnAsp: 2.555 ± 1.405
0.0AsnGlu: 0.0 ± 0.0
0.852AsnPhe: 0.852 ± 0.468
3.407AsnGly: 3.407 ± 2.901
0.852AsnHis: 0.852 ± 0.468
2.555AsnIle: 2.555 ± 0.186
0.852AsnLys: 0.852 ± 0.468
2.555AsnLeu: 2.555 ± 1.778
0.852AsnMet: 0.852 ± 0.468
0.852AsnAsn: 0.852 ± 1.123
3.407AsnPro: 3.407 ± 1.309
1.704AsnGln: 1.704 ± 0.655
0.852AsnArg: 0.852 ± 0.468
2.555AsnSer: 2.555 ± 1.778
1.704AsnThr: 1.704 ± 0.655
2.555AsnVal: 2.555 ± 1.405
0.0AsnTrp: 0.0 ± 0.0
0.0AsnTyr: 0.0 ± 0.0
0.0AsnXaa: 0.0 ± 0.0
Pro
4.259ProAla: 4.259 ± 4.023
0.852ProCys: 0.852 ± 1.123
5.111ProAsp: 5.111 ± 0.373
2.555ProGlu: 2.555 ± 1.405
1.704ProPhe: 1.704 ± 0.655
4.259ProGly: 4.259 ± 0.75
0.0ProHis: 0.0 ± 0.0
0.852ProIle: 0.852 ± 1.123
0.852ProLys: 0.852 ± 0.468
2.555ProLeu: 2.555 ± 1.778
0.852ProMet: 0.852 ± 0.468
0.852ProAsn: 0.852 ± 1.123
1.704ProPro: 1.704 ± 0.937
0.0ProGln: 0.0 ± 0.0
8.518ProArg: 8.518 ± 1.682
5.111ProSer: 5.111 ± 1.218
5.111ProThr: 5.111 ± 1.218
2.555ProVal: 2.555 ± 1.405
0.0ProTrp: 0.0 ± 0.0
0.0ProTyr: 0.0 ± 0.0
0.0ProXaa: 0.0 ± 0.0
Gln
4.259GlnAla: 4.259 ± 0.841
0.0GlnCys: 0.0 ± 0.0
0.0GlnAsp: 0.0 ± 0.0
1.704GlnGlu: 1.704 ± 0.937
2.555GlnPhe: 2.555 ± 0.186
1.704GlnGly: 1.704 ± 2.246
1.704GlnHis: 1.704 ± 0.937
1.704GlnIle: 1.704 ± 0.655
0.852GlnLys: 0.852 ± 0.468
1.704GlnLeu: 1.704 ± 0.937
0.852GlnMet: 0.852 ± 0.468
0.852GlnAsn: 0.852 ± 1.123
1.704GlnPro: 1.704 ± 0.937
0.0GlnGln: 0.0 ± 0.0
2.555GlnArg: 2.555 ± 0.186
0.852GlnSer: 0.852 ± 1.123
0.852GlnThr: 0.852 ± 0.468
3.407GlnVal: 3.407 ± 0.282
0.852GlnTrp: 0.852 ± 0.468
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
10.221ArgAla: 10.221 ± 0.745
0.0ArgCys: 0.0 ± 0.0
3.407ArgAsp: 3.407 ± 0.282
3.407ArgGlu: 3.407 ± 1.873
5.111ArgPhe: 5.111 ± 1.218
5.963ArgGly: 5.963 ± 1.496
0.852ArgHis: 0.852 ± 0.468
2.555ArgIle: 2.555 ± 0.186
4.259ArgLys: 4.259 ± 2.341
9.37ArgLeu: 9.37 ± 1.969
2.555ArgMet: 2.555 ± 1.778
2.555ArgAsn: 2.555 ± 0.186
3.407ArgPro: 3.407 ± 0.282
2.555ArgGln: 2.555 ± 1.405
5.963ArgArg: 5.963 ± 1.496
5.111ArgSer: 5.111 ± 2.81
4.259ArgThr: 4.259 ± 4.023
5.111ArgVal: 5.111 ± 1.218
3.407ArgTrp: 3.407 ± 1.873
1.704ArgTyr: 1.704 ± 0.937
0.0ArgXaa: 0.0 ± 0.0
Ser
4.259SerAla: 4.259 ± 0.841
0.852SerCys: 0.852 ± 0.468
2.555SerAsp: 2.555 ± 1.778
5.111SerGlu: 5.111 ± 2.81
0.852SerPhe: 0.852 ± 0.468
4.259SerGly: 4.259 ± 0.841
1.704SerHis: 1.704 ± 0.937
2.555SerIle: 2.555 ± 1.405
7.666SerLys: 7.666 ± 0.559
5.963SerLeu: 5.963 ± 1.496
0.0SerMet: 0.0 ± 0.0
3.407SerAsn: 3.407 ± 0.282
0.0SerPro: 0.0 ± 0.0
1.704SerGln: 1.704 ± 0.655
4.259SerArg: 4.259 ± 2.341
2.555SerSer: 2.555 ± 1.778
5.111SerThr: 5.111 ± 0.373
7.666SerVal: 7.666 ± 1.032
0.852SerTrp: 0.852 ± 0.468
1.704SerTyr: 1.704 ± 0.655
0.0SerXaa: 0.0 ± 0.0
Thr
2.555ThrAla: 2.555 ± 0.186
0.852ThrCys: 0.852 ± 0.468
6.814ThrAsp: 6.814 ± 2.155
6.814ThrGlu: 6.814 ± 0.564
3.407ThrPhe: 3.407 ± 0.282
4.259ThrGly: 4.259 ± 0.75
1.704ThrHis: 1.704 ± 0.655
5.111ThrIle: 5.111 ± 0.373
1.704ThrLys: 1.704 ± 0.937
6.814ThrLeu: 6.814 ± 3.746
1.704ThrMet: 1.704 ± 2.246
0.0ThrAsn: 0.0 ± 0.0
4.259ThrPro: 4.259 ± 0.75
0.0ThrGln: 0.0 ± 0.0
6.814ThrArg: 6.814 ± 1.027
1.704ThrSer: 1.704 ± 0.937
3.407ThrThr: 3.407 ± 1.309
4.259ThrVal: 4.259 ± 2.432
3.407ThrTrp: 3.407 ± 4.492
1.704ThrTyr: 1.704 ± 0.937
0.0ThrXaa: 0.0 ± 0.0
Val
6.814ValAla: 6.814 ± 0.564
1.704ValCys: 1.704 ± 0.937
5.963ValAsp: 5.963 ± 1.687
2.555ValGlu: 2.555 ± 1.405
4.259ValPhe: 4.259 ± 0.75
4.259ValGly: 4.259 ± 0.841
1.704ValHis: 1.704 ± 0.937
0.852ValIle: 0.852 ± 0.468
3.407ValLys: 3.407 ± 1.873
9.37ValLeu: 9.37 ± 0.377
3.407ValMet: 3.407 ± 1.309
0.852ValAsn: 0.852 ± 1.123
6.814ValPro: 6.814 ± 1.027
1.704ValGln: 1.704 ± 2.246
7.666ValArg: 7.666 ± 1.032
3.407ValSer: 3.407 ± 0.282
3.407ValThr: 3.407 ± 1.309
6.814ValVal: 6.814 ± 0.564
0.852ValTrp: 0.852 ± 0.468
1.704ValTyr: 1.704 ± 0.655
0.0ValXaa: 0.0 ± 0.0
Trp
0.852TrpAla: 0.852 ± 0.468
1.704TrpCys: 1.704 ± 0.937
3.407TrpAsp: 3.407 ± 0.282
0.0TrpGlu: 0.0 ± 0.0
0.852TrpPhe: 0.852 ± 0.468
0.0TrpGly: 0.0 ± 0.0
0.0TrpHis: 0.0 ± 0.0
1.704TrpIle: 1.704 ± 0.937
1.704TrpLys: 1.704 ± 0.937
1.704TrpLeu: 1.704 ± 0.655
0.0TrpMet: 0.0 ± 0.0
0.852TrpAsn: 0.852 ± 0.468
0.0TrpPro: 0.0 ± 0.0
1.704TrpGln: 1.704 ± 0.937
1.704TrpArg: 1.704 ± 2.246
0.852TrpSer: 0.852 ± 1.123
2.555TrpThr: 2.555 ± 1.405
0.852TrpVal: 0.852 ± 0.468
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.555TyrAla: 2.555 ± 0.186
0.852TyrCys: 0.852 ± 0.468
0.852TyrAsp: 0.852 ± 0.468
0.852TyrGlu: 0.852 ± 0.468
0.0TyrPhe: 0.0 ± 0.0
2.555TyrGly: 2.555 ± 1.778
0.0TyrHis: 0.0 ± 0.0
0.852TyrIle: 0.852 ± 1.123
1.704TyrLys: 1.704 ± 0.937
0.0TyrLeu: 0.0 ± 0.0
0.0TyrMet: 0.0 ± 0.0
4.259TyrAsn: 4.259 ± 0.841
0.0TyrPro: 0.0 ± 0.0
0.852TyrGln: 0.852 ± 0.468
0.0TyrArg: 0.0 ± 0.0
2.555TyrSer: 2.555 ± 1.778
1.704TyrThr: 1.704 ± 0.937
0.0TyrVal: 0.0 ± 0.0
0.0TyrTrp: 0.0 ± 0.0
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (1175 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski