Amino acid dipepetide frequency for Wenzhou toti-like virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.925AlaAla: 8.925 ± 0.505
2.716AlaCys: 2.716 ± 0.41
6.209AlaAsp: 6.209 ± 0.095
3.492AlaGlu: 3.492 ± 2.08
3.88AlaPhe: 3.88 ± 0.501
5.821AlaGly: 5.821 ± 1.046
1.164AlaHis: 1.164 ± 0.091
4.269AlaIle: 4.269 ± 1.905
2.328AlaLys: 2.328 ± 0.994
8.925AlaLeu: 8.925 ± 2.86
1.94AlaMet: 1.94 ± 1.221
3.104AlaAsn: 3.104 ± 0.636
5.821AlaPro: 5.821 ± 1.635
4.269AlaGln: 4.269 ± 1.317
8.537AlaArg: 8.537 ± 0.31
6.597AlaSer: 6.597 ± 0.322
3.88AlaThr: 3.88 ± 0.501
5.433AlaVal: 5.433 ± 0.947
1.94AlaTrp: 1.94 ± 0.044
2.328AlaTyr: 2.328 ± 0.994
0.0AlaXaa: 0.0 ± 0.0
Cys
0.388CysAla: 0.388 ± 0.227
0.388CysCys: 0.388 ± 0.227
1.164CysAsp: 1.164 ± 0.497
0.776CysGlu: 0.776 ± 0.135
0.0CysPhe: 0.0 ± 0.0
1.552CysGly: 1.552 ± 0.318
0.0CysHis: 0.0 ± 0.0
0.776CysIle: 0.776 ± 0.135
0.0CysLys: 0.0 ± 0.0
1.552CysLeu: 1.552 ± 0.907
0.0CysMet: 0.0 ± 0.0
0.776CysAsn: 0.776 ± 0.135
0.388CysPro: 0.388 ± 0.227
0.388CysGln: 0.388 ± 0.227
0.776CysArg: 0.776 ± 0.135
0.776CysSer: 0.776 ± 0.453
1.164CysThr: 1.164 ± 0.68
1.164CysVal: 1.164 ± 0.091
0.0CysTrp: 0.0 ± 0.0
1.164CysTyr: 1.164 ± 0.497
0.0CysXaa: 0.0 ± 0.0
Asp
5.821AspAla: 5.821 ± 0.457
1.164AspCys: 1.164 ± 0.497
5.433AspAsp: 5.433 ± 0.231
3.88AspGlu: 3.88 ± 1.854
2.716AspPhe: 2.716 ± 0.768
3.88AspGly: 3.88 ± 0.501
1.94AspHis: 1.94 ± 0.632
1.94AspIle: 1.94 ± 0.044
1.164AspLys: 1.164 ± 0.68
5.045AspLeu: 5.045 ± 1.173
1.552AspMet: 1.552 ± 0.27
3.104AspAsn: 3.104 ± 1.814
5.821AspPro: 5.821 ± 0.131
3.492AspGln: 3.492 ± 0.863
1.552AspArg: 1.552 ± 0.27
2.716AspSer: 2.716 ± 0.998
3.88AspThr: 3.88 ± 0.088
3.104AspVal: 3.104 ± 0.048
0.388AspTrp: 0.388 ± 0.362
2.328AspTyr: 2.328 ± 0.994
0.0AspXaa: 0.0 ± 0.0
Glu
5.433GluAla: 5.433 ± 2.124
0.776GluCys: 0.776 ± 0.453
2.328GluAsp: 2.328 ± 0.406
0.776GluGlu: 0.776 ± 0.724
1.164GluPhe: 1.164 ± 0.497
3.492GluGly: 3.492 ± 0.274
0.388GluHis: 0.388 ± 0.227
2.328GluIle: 2.328 ± 0.183
2.328GluLys: 2.328 ± 1.583
5.433GluLeu: 5.433 ± 1.535
2.716GluMet: 2.716 ± 0.179
0.388GluAsn: 0.388 ± 0.362
2.328GluPro: 2.328 ± 0.406
0.776GluGln: 0.776 ± 0.135
1.94GluArg: 1.94 ± 0.632
1.94GluSer: 1.94 ± 0.044
1.94GluThr: 1.94 ± 0.545
1.94GluVal: 1.94 ± 1.221
0.776GluTrp: 0.776 ± 0.135
1.94GluTyr: 1.94 ± 1.221
0.0GluXaa: 0.0 ± 0.0
Phe
3.492PheAla: 3.492 ± 0.274
1.552PheCys: 1.552 ± 0.27
1.552PheAsp: 1.552 ± 0.859
1.552PheGlu: 1.552 ± 1.448
1.164PhePhe: 1.164 ± 0.091
3.492PheGly: 3.492 ± 0.863
0.388PheHis: 0.388 ± 0.227
1.552PheIle: 1.552 ± 0.907
2.328PheLys: 2.328 ± 1.583
2.328PheLeu: 2.328 ± 0.994
0.388PheMet: 0.388 ± 0.362
0.388PheAsn: 0.388 ± 0.227
3.104PhePro: 3.104 ± 0.048
0.776PheGln: 0.776 ± 0.135
1.94PheArg: 1.94 ± 0.044
2.328PheSer: 2.328 ± 0.772
1.552PheThr: 1.552 ± 0.27
3.492PheVal: 3.492 ± 0.314
0.776PheTrp: 0.776 ± 0.135
0.776PheTyr: 0.776 ± 0.135
0.0PheXaa: 0.0 ± 0.0
Gly
6.597GlyAla: 6.597 ± 2.677
0.388GlyCys: 0.388 ± 0.362
4.657GlyAsp: 4.657 ± 0.366
3.492GlyGlu: 3.492 ± 0.903
2.328GlyPhe: 2.328 ± 0.406
5.433GlyGly: 5.433 ± 0.358
2.716GlyHis: 2.716 ± 0.179
3.88GlyIle: 3.88 ± 0.088
1.94GlyLys: 1.94 ± 0.632
4.269GlyLeu: 4.269 ± 2.216
0.776GlyMet: 0.776 ± 0.453
2.328GlyAsn: 2.328 ± 0.183
3.88GlyPro: 3.88 ± 1.679
2.328GlyGln: 2.328 ± 0.183
3.88GlyArg: 3.88 ± 0.676
7.761GlySer: 7.761 ± 2.18
5.433GlyThr: 5.433 ± 0.231
3.104GlyVal: 3.104 ± 0.048
1.552GlyTrp: 1.552 ± 0.27
3.104GlyTyr: 3.104 ± 0.541
0.0GlyXaa: 0.0 ± 0.0
His
1.164HisAla: 1.164 ± 0.68
0.0HisCys: 0.0 ± 0.0
2.328HisAsp: 2.328 ± 0.183
0.776HisGlu: 0.776 ± 0.135
0.776HisPhe: 0.776 ± 0.135
1.94HisGly: 1.94 ± 0.044
0.0HisHis: 0.0 ± 0.0
0.388HisIle: 0.388 ± 0.227
0.388HisLys: 0.388 ± 0.362
2.328HisLeu: 2.328 ± 0.406
0.388HisMet: 0.388 ± 0.227
0.776HisAsn: 0.776 ± 0.135
1.164HisPro: 1.164 ± 0.497
1.164HisGln: 1.164 ± 0.091
1.164HisArg: 1.164 ± 0.091
1.552HisSer: 1.552 ± 0.907
0.388HisThr: 0.388 ± 0.227
0.776HisVal: 0.776 ± 0.724
0.0HisTrp: 0.0 ± 0.0
0.776HisTyr: 0.776 ± 0.453
0.0HisXaa: 0.0 ± 0.0
Ile
2.716IleAla: 2.716 ± 0.179
0.388IleCys: 0.388 ± 0.227
3.492IleAsp: 3.492 ± 0.274
2.328IleGlu: 2.328 ± 0.183
3.104IlePhe: 3.104 ± 1.225
3.104IleGly: 3.104 ± 0.636
0.776IleHis: 0.776 ± 0.135
2.328IleIle: 2.328 ± 0.772
1.164IleLys: 1.164 ± 0.68
3.492IleLeu: 3.492 ± 0.274
0.776IleMet: 0.776 ± 0.453
1.164IleAsn: 1.164 ± 0.091
3.88IlePro: 3.88 ± 0.088
1.164IleGln: 1.164 ± 0.68
1.552IleArg: 1.552 ± 0.859
2.328IleSer: 2.328 ± 0.772
2.328IleThr: 2.328 ± 0.994
1.552IleVal: 1.552 ± 0.859
0.388IleTrp: 0.388 ± 0.362
3.492IleTyr: 3.492 ± 0.274
0.0IleXaa: 0.0 ± 0.0
Lys
2.716LysAla: 2.716 ± 0.179
0.0LysCys: 0.0 ± 0.0
0.776LysAsp: 0.776 ± 0.135
0.388LysGlu: 0.388 ± 0.227
1.552LysPhe: 1.552 ± 0.859
2.716LysGly: 2.716 ± 2.534
0.388LysHis: 0.388 ± 0.362
1.552LysIle: 1.552 ± 0.859
1.552LysLys: 1.552 ± 0.27
4.269LysLeu: 4.269 ± 1.627
0.0LysMet: 0.0 ± 0.0
1.164LysAsn: 1.164 ± 0.497
1.164LysPro: 1.164 ± 0.497
1.164LysGln: 1.164 ± 0.091
2.716LysArg: 2.716 ± 0.768
1.552LysSer: 1.552 ± 0.318
3.88LysThr: 3.88 ± 1.854
3.492LysVal: 3.492 ± 1.492
0.776LysTrp: 0.776 ± 0.135
0.388LysTyr: 0.388 ± 0.227
0.0LysXaa: 0.0 ± 0.0
Leu
8.537LeuAla: 8.537 ± 1.488
1.164LeuCys: 1.164 ± 0.091
5.433LeuAsp: 5.433 ± 0.947
4.269LeuGlu: 4.269 ± 0.139
4.269LeuPhe: 4.269 ± 1.038
6.209LeuGly: 6.209 ± 0.493
0.776LeuHis: 0.776 ± 0.453
3.88LeuIle: 3.88 ± 0.088
3.492LeuLys: 3.492 ± 2.08
7.373LeuLeu: 7.373 ± 1.579
2.328LeuMet: 2.328 ± 0.183
4.269LeuAsn: 4.269 ± 1.627
10.865LeuPro: 10.865 ± 1.639
3.104LeuGln: 3.104 ± 1.225
5.433LeuArg: 5.433 ± 0.358
7.761LeuSer: 7.761 ± 1.002
6.209LeuThr: 6.209 ± 0.095
5.433LeuVal: 5.433 ± 0.231
2.716LeuTrp: 2.716 ± 1.356
3.104LeuTyr: 3.104 ± 0.048
0.0LeuXaa: 0.0 ± 0.0
Met
3.104MetAla: 3.104 ± 0.541
0.0MetCys: 0.0 ± 0.0
0.388MetAsp: 0.388 ± 0.227
0.388MetGlu: 0.388 ± 0.362
0.776MetPhe: 0.776 ± 0.724
0.388MetGly: 0.388 ± 0.362
0.388MetHis: 0.388 ± 0.227
0.388MetIle: 0.388 ± 0.362
1.164MetLys: 1.164 ± 0.091
1.164MetLeu: 1.164 ± 0.091
0.0MetMet: 0.0 ± 0.0
1.164MetAsn: 1.164 ± 0.68
1.94MetPro: 1.94 ± 1.134
0.776MetGln: 0.776 ± 0.453
1.164MetArg: 1.164 ± 0.091
1.94MetSer: 1.94 ± 0.545
3.88MetThr: 3.88 ± 1.854
0.388MetVal: 0.388 ± 0.362
0.388MetTrp: 0.388 ± 0.362
0.776MetTyr: 0.776 ± 0.453
0.0MetXaa: 0.0 ± 0.0
Asn
3.104AsnAla: 3.104 ± 0.636
0.388AsnCys: 0.388 ± 0.227
2.328AsnAsp: 2.328 ± 0.406
1.94AsnGlu: 1.94 ± 0.632
0.776AsnPhe: 0.776 ± 0.453
2.328AsnGly: 2.328 ± 1.36
0.776AsnHis: 0.776 ± 0.453
0.776AsnIle: 0.776 ± 0.453
1.552AsnLys: 1.552 ± 0.859
3.492AsnLeu: 3.492 ± 0.314
1.164AsnMet: 1.164 ± 0.091
1.164AsnAsn: 1.164 ± 0.091
2.328AsnPro: 2.328 ± 0.772
1.552AsnGln: 1.552 ± 0.318
2.716AsnArg: 2.716 ± 0.768
2.716AsnSer: 2.716 ± 0.998
3.104AsnThr: 3.104 ± 0.636
1.164AsnVal: 1.164 ± 0.68
1.94AsnTrp: 1.94 ± 0.044
0.388AsnTyr: 0.388 ± 0.362
0.0AsnXaa: 0.0 ± 0.0
Pro
6.985ProAla: 6.985 ± 1.138
0.776ProCys: 0.776 ± 0.453
5.433ProAsp: 5.433 ± 0.231
1.164ProGlu: 1.164 ± 0.091
1.552ProPhe: 1.552 ± 0.318
5.045ProGly: 5.045 ± 1.181
0.388ProHis: 0.388 ± 0.227
3.104ProIle: 3.104 ± 0.541
1.164ProLys: 1.164 ± 0.497
6.597ProLeu: 6.597 ± 0.322
2.328ProMet: 2.328 ± 0.672
1.94ProAsn: 1.94 ± 0.545
12.418ProPro: 12.418 ± 5.49
5.433ProGln: 5.433 ± 1.997
5.821ProArg: 5.821 ± 0.131
11.253ProSer: 11.253 ± 0.688
5.821ProThr: 5.821 ± 1.635
5.045ProVal: 5.045 ± 1.181
2.328ProTrp: 2.328 ± 1.583
2.328ProTyr: 2.328 ± 1.36
0.0ProXaa: 0.0 ± 0.0
Gln
5.045GlnAla: 5.045 ± 2.359
0.0GlnCys: 0.0 ± 0.0
2.328GlnAsp: 2.328 ± 1.36
1.552GlnGlu: 1.552 ± 0.907
1.164GlnPhe: 1.164 ± 0.497
2.716GlnGly: 2.716 ± 0.41
1.552GlnHis: 1.552 ± 0.859
0.776GlnIle: 0.776 ± 0.453
1.552GlnLys: 1.552 ± 0.27
2.328GlnLeu: 2.328 ± 0.183
0.776GlnMet: 0.776 ± 0.135
0.776GlnAsn: 0.776 ± 0.453
2.716GlnPro: 2.716 ± 0.998
3.104GlnGln: 3.104 ± 1.225
3.104GlnArg: 3.104 ± 1.225
3.88GlnSer: 3.88 ± 0.501
1.94GlnThr: 1.94 ± 0.044
2.716GlnVal: 2.716 ± 0.41
1.164GlnTrp: 1.164 ± 0.091
1.164GlnTyr: 1.164 ± 0.091
0.0GlnXaa: 0.0 ± 0.0
Arg
4.657ArgAla: 4.657 ± 0.223
0.776ArgCys: 0.776 ± 0.135
3.104ArgAsp: 3.104 ± 0.541
3.88ArgGlu: 3.88 ± 0.676
3.492ArgPhe: 3.492 ± 2.08
3.492ArgGly: 3.492 ± 0.314
1.164ArgHis: 1.164 ± 0.68
2.328ArgIle: 2.328 ± 0.406
3.104ArgLys: 3.104 ± 0.541
5.821ArgLeu: 5.821 ± 1.897
1.552ArgMet: 1.552 ± 0.27
3.104ArgAsn: 3.104 ± 0.636
6.985ArgPro: 6.985 ± 0.04
1.94ArgGln: 1.94 ± 0.044
5.433ArgArg: 5.433 ± 2.713
3.88ArgSer: 3.88 ± 0.088
2.716ArgThr: 2.716 ± 0.179
5.045ArgVal: 5.045 ± 0.004
0.388ArgTrp: 0.388 ± 0.227
1.552ArgTyr: 1.552 ± 0.318
0.0ArgXaa: 0.0 ± 0.0
Ser
5.433SerAla: 5.433 ± 0.947
0.388SerCys: 0.388 ± 0.227
2.716SerAsp: 2.716 ± 0.998
5.045SerGlu: 5.045 ± 0.585
2.328SerPhe: 2.328 ± 0.772
6.597SerGly: 6.597 ± 0.911
1.94SerHis: 1.94 ± 0.545
2.716SerIle: 2.716 ± 0.179
2.716SerLys: 2.716 ± 0.768
11.253SerLeu: 11.253 ± 2.454
0.388SerMet: 0.388 ± 0.227
3.88SerAsn: 3.88 ± 1.09
5.821SerPro: 5.821 ± 1.635
2.716SerGln: 2.716 ± 0.998
4.269SerArg: 4.269 ± 0.728
6.209SerSer: 6.209 ± 1.862
7.761SerThr: 7.761 ± 1.591
3.104SerVal: 3.104 ± 0.048
1.94SerTrp: 1.94 ± 1.221
1.552SerTyr: 1.552 ± 0.318
0.0SerXaa: 0.0 ± 0.0
Thr
5.433ThrAla: 5.433 ± 1.408
0.388ThrCys: 0.388 ± 0.227
4.269ThrAsp: 4.269 ± 0.139
0.776ThrGlu: 0.776 ± 0.135
2.328ThrPhe: 2.328 ± 0.772
5.045ThrGly: 5.045 ± 1.173
1.164ThrHis: 1.164 ± 0.091
4.657ThrIle: 4.657 ± 0.366
1.94ThrLys: 1.94 ± 0.632
10.089ThrLeu: 10.089 ± 0.581
1.164ThrMet: 1.164 ± 0.091
3.104ThrAsn: 3.104 ± 0.541
6.597ThrPro: 6.597 ± 0.322
1.552ThrGln: 1.552 ± 0.27
5.433ThrArg: 5.433 ± 0.947
3.492ThrSer: 3.492 ± 1.452
3.88ThrThr: 3.88 ± 2.267
3.88ThrVal: 3.88 ± 0.501
1.552ThrTrp: 1.552 ± 1.448
0.776ThrTyr: 0.776 ± 0.135
0.0ThrXaa: 0.0 ± 0.0
Val
6.985ValAla: 6.985 ± 1.138
0.776ValCys: 0.776 ± 0.135
2.328ValAsp: 2.328 ± 0.406
2.328ValGlu: 2.328 ± 0.994
1.552ValPhe: 1.552 ± 0.27
2.716ValGly: 2.716 ± 0.179
1.164ValHis: 1.164 ± 0.091
1.552ValIle: 1.552 ± 0.907
1.552ValLys: 1.552 ± 1.448
5.045ValLeu: 5.045 ± 0.004
0.388ValMet: 0.388 ± 0.362
1.552ValAsn: 1.552 ± 0.318
6.209ValPro: 6.209 ± 0.095
3.88ValGln: 3.88 ± 0.501
4.269ValArg: 4.269 ± 1.038
6.597ValSer: 6.597 ± 0.267
4.269ValThr: 4.269 ± 0.45
2.716ValVal: 2.716 ± 0.41
0.776ValTrp: 0.776 ± 0.135
0.388ValTyr: 0.388 ± 0.362
0.0ValXaa: 0.0 ± 0.0
Trp
1.552TrpAla: 1.552 ± 0.27
0.388TrpCys: 0.388 ± 0.227
2.716TrpAsp: 2.716 ± 1.356
1.552TrpGlu: 1.552 ± 0.318
0.0TrpPhe: 0.0 ± 0.0
1.552TrpGly: 1.552 ± 0.859
0.388TrpHis: 0.388 ± 0.362
0.776TrpIle: 0.776 ± 0.724
0.0TrpLys: 0.0 ± 0.0
1.94TrpLeu: 1.94 ± 0.044
0.388TrpMet: 0.388 ± 0.559
0.776TrpAsn: 0.776 ± 0.135
1.552TrpPro: 1.552 ± 0.27
0.388TrpGln: 0.388 ± 0.362
1.94TrpArg: 1.94 ± 0.632
1.552TrpSer: 1.552 ± 1.448
0.776TrpThr: 0.776 ± 0.724
1.164TrpVal: 1.164 ± 0.091
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.492TyrAla: 3.492 ± 0.314
0.776TyrCys: 0.776 ± 0.453
1.94TyrAsp: 1.94 ± 0.044
0.776TyrGlu: 0.776 ± 0.724
0.0TyrPhe: 0.0 ± 0.0
2.328TyrGly: 2.328 ± 0.183
0.776TyrHis: 0.776 ± 0.135
1.552TyrIle: 1.552 ± 0.318
0.776TyrLys: 0.776 ± 0.135
3.88TyrLeu: 3.88 ± 0.676
1.164TyrMet: 1.164 ± 0.091
0.776TyrAsn: 0.776 ± 0.453
1.94TyrPro: 1.94 ± 1.134
0.388TyrGln: 0.388 ± 0.362
0.776TyrArg: 0.776 ± 0.135
1.94TyrSer: 1.94 ± 1.221
2.716TyrThr: 2.716 ± 0.179
2.328TyrVal: 2.328 ± 0.406
0.0TyrTrp: 0.0 ± 0.0
0.776TyrTyr: 0.776 ± 0.135
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2578 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski