Amino acid dipepetide frequency for Fig badnavirus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.257AlaAla: 4.257 ± 4.467
0.946AlaCys: 0.946 ± 0.452
2.365AlaAsp: 2.365 ± 1.237
5.676AlaGlu: 5.676 ± 2.668
1.892AlaPhe: 1.892 ± 0.904
1.419AlaGly: 1.419 ± 1.489
0.0AlaHis: 0.0 ± 0.0
3.311AlaIle: 3.311 ± 1.861
1.419AlaLys: 1.419 ± 0.678
3.784AlaLeu: 3.784 ± 1.053
4.73AlaMet: 4.73 ± 2.259
1.419AlaAsn: 1.419 ± 1.522
1.892AlaPro: 1.892 ± 0.904
2.365AlaGln: 2.365 ± 1.13
3.784AlaArg: 3.784 ± 1.125
3.311AlaSer: 3.311 ± 1.118
2.365AlaThr: 2.365 ± 1.237
4.73AlaVal: 4.73 ± 2.472
0.0AlaTrp: 0.0 ± 0.0
2.365AlaTyr: 2.365 ± 1.13
0.0AlaXaa: 0.0 ± 0.0
Cys
0.473CysAla: 0.473 ± 0.226
0.473CysCys: 0.473 ± 0.226
0.473CysAsp: 0.473 ± 0.226
0.946CysGlu: 0.946 ± 0.452
1.892CysPhe: 1.892 ± 0.904
0.946CysGly: 0.946 ± 0.452
0.0CysHis: 0.0 ± 0.0
0.473CysIle: 0.473 ± 0.226
1.419CysLys: 1.419 ± 0.678
0.473CysLeu: 0.473 ± 0.226
0.473CysMet: 0.473 ± 0.226
1.419CysAsn: 1.419 ± 0.678
0.946CysPro: 0.946 ± 0.452
1.419CysGln: 1.419 ± 0.678
1.419CysArg: 1.419 ± 0.678
0.946CysSer: 0.946 ± 0.452
0.473CysThr: 0.473 ± 0.226
0.946CysVal: 0.946 ± 0.452
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 1.582
1.419AspCys: 1.419 ± 0.678
1.419AspAsp: 1.419 ± 0.678
4.73AspGlu: 4.73 ± 2.259
2.838AspPhe: 2.838 ± 1.157
3.784AspGly: 3.784 ± 1.808
0.946AspHis: 0.946 ± 0.452
3.784AspIle: 3.784 ± 1.125
3.784AspLys: 3.784 ± 1.125
2.838AspLeu: 2.838 ± 1.134
0.946AspMet: 0.946 ± 0.614
2.365AspAsn: 2.365 ± 1.13
2.365AspPro: 2.365 ± 1.237
2.365AspGln: 2.365 ± 1.13
1.419AspArg: 1.419 ± 1.522
1.419AspSer: 1.419 ± 0.678
1.892AspThr: 1.892 ± 0.904
0.946AspVal: 0.946 ± 0.452
0.946AspTrp: 0.946 ± 1.692
3.784AspTyr: 3.784 ± 1.808
0.0AspXaa: 0.0 ± 0.0
Glu
6.149GluAla: 6.149 ± 2.487
0.946GluCys: 0.946 ± 0.452
6.149GluAsp: 6.149 ± 2.937
15.137GluGlu: 15.137 ± 5.357
2.838GluPhe: 2.838 ± 1.356
4.73GluGly: 4.73 ± 1.264
0.946GluHis: 0.946 ± 0.452
5.676GluIle: 5.676 ± 1.408
10.88GluLys: 10.88 ± 8.572
7.096GluLeu: 7.096 ± 2.08
1.892GluMet: 1.892 ± 0.904
3.311GluAsn: 3.311 ± 1.071
3.311GluPro: 3.311 ± 1.582
5.203GluGln: 5.203 ± 0.958
4.73GluArg: 4.73 ± 7.211
5.203GluSer: 5.203 ± 4.399
1.892GluThr: 1.892 ± 0.904
8.042GluVal: 8.042 ± 0.405
1.419GluTrp: 1.419 ± 0.678
2.838GluTyr: 2.838 ± 1.134
0.0GluXaa: 0.0 ± 0.0
Phe
1.892PheAla: 1.892 ± 0.904
0.946PheCys: 0.946 ± 0.452
2.838PheAsp: 2.838 ± 1.356
2.365PheGlu: 2.365 ± 1.13
0.946PhePhe: 0.946 ± 0.452
0.473PheGly: 0.473 ± 0.226
0.473PheHis: 0.473 ± 0.226
3.784PheIle: 3.784 ± 1.808
2.838PheLys: 2.838 ± 1.157
3.311PheLeu: 3.311 ± 1.582
0.0PheMet: 0.0 ± 0.0
2.365PheAsn: 2.365 ± 1.13
0.473PhePro: 0.473 ± 0.226
1.419PheGln: 1.419 ± 1.489
2.365PheArg: 2.365 ± 1.237
2.838PheSer: 2.838 ± 1.356
1.892PheThr: 1.892 ± 0.904
2.838PheVal: 2.838 ± 2.087
0.473PheTrp: 0.473 ± 0.226
0.946PheTyr: 0.946 ± 0.452
0.0PheXaa: 0.0 ± 0.0
Gly
2.838GlyAla: 2.838 ± 1.356
0.946GlyCys: 0.946 ± 0.452
3.784GlyAsp: 3.784 ± 1.808
5.676GlyGlu: 5.676 ± 2.711
2.365GlyPhe: 2.365 ± 1.237
5.203GlyGly: 5.203 ± 1.385
0.473GlyHis: 0.473 ± 0.226
4.73GlyIle: 4.73 ± 1.184
2.838GlyLys: 2.838 ± 1.356
2.838GlyLeu: 2.838 ± 1.356
2.365GlyMet: 2.365 ± 1.13
1.419GlyAsn: 1.419 ± 0.678
0.473GlyPro: 0.473 ± 0.226
0.473GlyGln: 0.473 ± 0.226
6.149GlyArg: 6.149 ± 2.937
1.419GlySer: 1.419 ± 0.678
3.784GlyThr: 3.784 ± 2.7
4.73GlyVal: 4.73 ± 1.264
1.892GlyTrp: 1.892 ± 0.904
1.892GlyTyr: 1.892 ± 0.904
0.0GlyXaa: 0.0 ± 0.0
His
1.419HisAla: 1.419 ± 0.678
0.946HisCys: 0.946 ± 0.452
0.0HisAsp: 0.0 ± 0.0
0.473HisGlu: 0.473 ± 0.226
0.473HisPhe: 0.473 ± 0.226
0.473HisGly: 0.473 ± 0.226
0.473HisHis: 0.473 ± 0.226
1.419HisIle: 1.419 ± 0.678
0.473HisLys: 0.473 ± 0.226
2.838HisLeu: 2.838 ± 1.134
0.0HisMet: 0.0 ± 0.0
1.892HisAsn: 1.892 ± 1.368
0.946HisPro: 0.946 ± 0.452
0.946HisGln: 0.946 ± 0.452
1.419HisArg: 1.419 ± 0.678
0.0HisSer: 0.0 ± 0.0
0.0HisThr: 0.0 ± 0.0
1.419HisVal: 1.419 ± 0.678
0.473HisTrp: 0.473 ± 0.226
1.892HisTyr: 1.892 ± 0.904
0.0HisXaa: 0.0 ± 0.0
Ile
1.892IleAla: 1.892 ± 1.35
1.419IleCys: 1.419 ± 0.678
2.365IleAsp: 2.365 ± 1.13
5.203IleGlu: 5.203 ± 1.385
1.419IlePhe: 1.419 ± 0.678
4.73IleGly: 4.73 ± 2.259
2.365IleHis: 2.365 ± 1.13
4.257IleIle: 4.257 ± 2.034
5.676IleLys: 5.676 ± 2.315
5.203IleLeu: 5.203 ± 0.958
0.473IleMet: 0.473 ± 0.226
3.784IleAsn: 3.784 ± 1.808
4.73IlePro: 4.73 ± 2.259
6.149IleGln: 6.149 ± 2.468
3.784IleArg: 3.784 ± 1.125
3.784IleSer: 3.784 ± 1.053
1.892IleThr: 1.892 ± 1.35
4.257IleVal: 4.257 ± 1.409
0.946IleTrp: 0.946 ± 0.452
1.419IleTyr: 1.419 ± 0.678
0.0IleXaa: 0.0 ± 0.0
Lys
1.419LysAla: 1.419 ± 3.465
0.946LysCys: 0.946 ± 0.452
3.311LysAsp: 3.311 ± 4.779
9.934LysGlu: 9.934 ± 3.06
2.365LysPhe: 2.365 ± 1.13
3.784LysGly: 3.784 ± 1.053
1.892LysHis: 1.892 ± 0.904
4.257LysIle: 4.257 ± 2.034
8.515LysLys: 8.515 ± 1.57
5.676LysLeu: 5.676 ± 4.746
2.365LysMet: 2.365 ± 0.915
5.676LysAsn: 5.676 ± 1.408
3.311LysPro: 3.311 ± 2.885
2.365LysGln: 2.365 ± 2.312
6.623LysArg: 6.623 ± 1.868
3.784LysSer: 3.784 ± 1.808
2.838LysThr: 2.838 ± 1.356
2.365LysVal: 2.365 ± 2.312
1.419LysTrp: 1.419 ± 1.522
3.784LysTyr: 3.784 ± 1.808
0.0LysXaa: 0.0 ± 0.0
Leu
4.257LeuAla: 4.257 ± 4.566
2.365LeuCys: 2.365 ± 1.13
2.838LeuAsp: 2.838 ± 1.134
8.988LeuGlu: 8.988 ± 7.507
1.892LeuPhe: 1.892 ± 0.904
6.149LeuGly: 6.149 ± 1.692
0.473LeuHis: 0.473 ± 0.226
4.257LeuIle: 4.257 ± 2.58
10.407LeuLys: 10.407 ± 1.916
5.676LeuLeu: 5.676 ± 1.53
1.419LeuMet: 1.419 ± 0.678
4.73LeuAsn: 4.73 ± 1.158
5.203LeuPro: 5.203 ± 1.385
5.676LeuGln: 5.676 ± 2.668
3.784LeuArg: 3.784 ± 3.552
4.257LeuSer: 4.257 ± 1.175
3.784LeuThr: 3.784 ± 4.732
6.149LeuVal: 6.149 ± 2.487
0.0LeuTrp: 0.0 ± 0.0
2.838LeuTyr: 2.838 ± 1.134
0.0LeuXaa: 0.0 ± 0.0
Met
0.946MetAla: 0.946 ± 0.452
0.0MetCys: 0.0 ± 0.0
0.946MetAsp: 0.946 ± 0.452
1.892MetGlu: 1.892 ± 0.904
1.419MetPhe: 1.419 ± 0.678
1.419MetGly: 1.419 ± 0.678
0.473MetHis: 0.473 ± 0.226
1.892MetIle: 1.892 ± 0.904
1.892MetLys: 1.892 ± 0.904
3.311MetLeu: 3.311 ± 1.582
1.419MetMet: 1.419 ± 0.678
2.838MetAsn: 2.838 ± 1.356
1.419MetPro: 1.419 ± 0.678
0.946MetGln: 0.946 ± 0.452
1.892MetArg: 1.892 ± 0.904
1.892MetSer: 1.892 ± 2.538
2.365MetThr: 2.365 ± 1.13
0.946MetVal: 0.946 ± 0.452
0.473MetTrp: 0.473 ± 0.226
0.473MetTyr: 0.473 ± 0.226
0.0MetXaa: 0.0 ± 0.0
Asn
0.946AsnAla: 0.946 ± 0.452
0.946AsnCys: 0.946 ± 0.452
2.365AsnAsp: 2.365 ± 1.13
2.838AsnGlu: 2.838 ± 1.356
0.946AsnPhe: 0.946 ± 0.452
2.838AsnGly: 2.838 ± 1.356
1.419AsnHis: 1.419 ± 0.678
1.892AsnIle: 1.892 ± 0.904
3.311AsnLys: 3.311 ± 1.071
8.042AsnLeu: 8.042 ± 9.297
0.473AsnMet: 0.473 ± 0.226
2.365AsnAsn: 2.365 ± 2.312
1.892AsnPro: 1.892 ± 0.904
1.892AsnGln: 1.892 ± 0.904
1.419AsnArg: 1.419 ± 0.678
5.676AsnSer: 5.676 ± 2.696
3.784AsnThr: 3.784 ± 1.125
2.838AsnVal: 2.838 ± 1.356
1.419AsnTrp: 1.419 ± 0.678
1.892AsnTyr: 1.892 ± 0.904
0.0AsnXaa: 0.0 ± 0.0
Pro
3.784ProAla: 3.784 ± 2.7
0.0ProCys: 0.0 ± 0.0
4.257ProAsp: 4.257 ± 2.034
5.676ProGlu: 5.676 ± 1.53
1.892ProPhe: 1.892 ± 0.904
2.838ProGly: 2.838 ± 1.356
1.419ProHis: 1.419 ± 0.678
1.892ProIle: 1.892 ± 0.904
3.311ProLys: 3.311 ± 1.861
5.676ProLeu: 5.676 ± 0.733
0.0ProMet: 0.0 ± 0.0
2.365ProAsn: 2.365 ± 1.13
2.365ProPro: 2.365 ± 1.13
1.892ProGln: 1.892 ± 0.904
2.838ProArg: 2.838 ± 1.356
3.311ProSer: 3.311 ± 1.582
1.419ProThr: 1.419 ± 1.522
0.473ProVal: 0.473 ± 0.226
0.473ProTrp: 0.473 ± 0.226
1.419ProTyr: 1.419 ± 0.678
0.0ProXaa: 0.0 ± 0.0
Gln
2.838GlnAla: 2.838 ± 1.356
0.473GlnCys: 0.473 ± 0.226
2.838GlnAsp: 2.838 ± 1.134
5.676GlnGlu: 5.676 ± 2.268
0.473GlnPhe: 0.473 ± 0.226
2.838GlnGly: 2.838 ± 2.978
1.419GlnHis: 1.419 ± 0.678
5.676GlnIle: 5.676 ± 2.315
2.838GlnLys: 2.838 ± 2.087
3.311GlnLeu: 3.311 ± 2.833
1.892GlnMet: 1.892 ± 0.904
1.419GlnAsn: 1.419 ± 1.489
4.257GlnPro: 4.257 ± 1.084
2.365GlnGln: 2.365 ± 1.236
3.311GlnArg: 3.311 ± 1.118
2.365GlnSer: 2.365 ± 2.312
1.419GlnThr: 1.419 ± 0.678
1.892GlnVal: 1.892 ± 1.368
0.473GlnTrp: 0.473 ± 0.226
0.946GlnTyr: 0.946 ± 0.452
0.0GlnXaa: 0.0 ± 0.0
Arg
3.784ArgAla: 3.784 ± 1.808
0.473ArgCys: 0.473 ± 0.226
3.784ArgAsp: 3.784 ± 1.808
2.365ArgGlu: 2.365 ± 1.237
3.311ArgPhe: 3.311 ± 1.582
2.365ArgGly: 2.365 ± 1.13
0.473ArgHis: 0.473 ± 0.226
4.73ArgIle: 4.73 ± 1.264
5.203ArgLys: 5.203 ± 1.269
6.623ArgLeu: 6.623 ± 4.343
4.73ArgMet: 4.73 ± 2.259
4.257ArgAsn: 4.257 ± 1.084
2.838ArgPro: 2.838 ± 1.157
1.892ArgGln: 1.892 ± 2.538
7.569ArgArg: 7.569 ± 2.118
5.203ArgSer: 5.203 ± 1.269
4.257ArgThr: 4.257 ± 1.409
4.73ArgVal: 4.73 ± 8.067
1.419ArgTrp: 1.419 ± 0.678
1.419ArgTyr: 1.419 ± 0.678
0.0ArgXaa: 0.0 ± 0.0
Ser
2.838SerAla: 2.838 ± 1.157
0.473SerCys: 0.473 ± 0.226
1.419SerAsp: 1.419 ± 0.678
5.203SerGlu: 5.203 ± 1.269
3.784SerPhe: 3.784 ± 1.808
3.784SerGly: 3.784 ± 1.808
1.419SerHis: 1.419 ± 1.522
3.784SerIle: 3.784 ± 1.635
2.838SerLys: 2.838 ± 1.356
6.623SerLeu: 6.623 ± 4.274
0.946SerMet: 0.946 ± 1.133
0.473SerAsn: 0.473 ± 0.226
4.257SerPro: 4.257 ± 1.175
2.365SerGln: 2.365 ± 2.312
5.203SerArg: 5.203 ± 2.361
2.838SerSer: 2.838 ± 2.087
3.311SerThr: 3.311 ± 1.582
2.365SerVal: 2.365 ± 1.13
0.946SerTrp: 0.946 ± 1.692
1.419SerTyr: 1.419 ± 3.465
0.0SerXaa: 0.0 ± 0.0
Thr
2.365ThrAla: 2.365 ± 1.236
0.473ThrCys: 0.473 ± 0.226
2.838ThrAsp: 2.838 ± 1.356
3.784ThrGlu: 3.784 ± 5.077
1.419ThrPhe: 1.419 ± 1.489
4.73ThrGly: 4.73 ± 2.259
0.946ThrHis: 0.946 ± 0.452
2.838ThrIle: 2.838 ± 1.356
0.946ThrLys: 0.946 ± 0.452
3.311ThrLeu: 3.311 ± 1.118
1.892ThrMet: 1.892 ± 0.904
1.419ThrAsn: 1.419 ± 2.764
1.892ThrPro: 1.892 ± 0.904
3.311ThrGln: 3.311 ± 1.071
3.784ThrArg: 3.784 ± 1.808
3.311ThrSer: 3.311 ± 1.861
2.365ThrThr: 2.365 ± 3.211
1.419ThrVal: 1.419 ± 0.678
0.0ThrTrp: 0.0 ± 0.0
1.419ThrTyr: 1.419 ± 0.678
0.0ThrXaa: 0.0 ± 0.0
Val
4.73ValAla: 4.73 ± 1.264
0.946ValCys: 0.946 ± 0.452
1.419ValAsp: 1.419 ± 0.678
7.096ValGlu: 7.096 ± 8.275
3.311ValPhe: 3.311 ± 1.861
1.892ValGly: 1.892 ± 0.904
1.892ValHis: 1.892 ± 0.904
3.311ValIle: 3.311 ± 1.582
1.892ValLys: 1.892 ± 1.368
3.784ValLeu: 3.784 ± 1.125
0.946ValMet: 0.946 ± 0.452
2.365ValAsn: 2.365 ± 1.236
2.838ValPro: 2.838 ± 1.157
4.257ValGln: 4.257 ± 1.175
5.676ValArg: 5.676 ± 4.173
2.838ValSer: 2.838 ± 1.157
2.365ValThr: 2.365 ± 1.13
1.892ValVal: 1.892 ± 1.368
0.0ValTrp: 0.0 ± 0.0
1.892ValTyr: 1.892 ± 3.385
0.0ValXaa: 0.0 ± 0.0
Trp
0.0TrpAla: 0.0 ± 0.0
0.473TrpCys: 0.473 ± 0.226
0.946TrpAsp: 0.946 ± 0.452
1.419TrpGlu: 1.419 ± 1.522
0.0TrpPhe: 0.0 ± 0.0
0.473TrpGly: 0.473 ± 0.226
0.0TrpHis: 0.0 ± 0.0
1.419TrpIle: 1.419 ± 0.678
1.419TrpLys: 1.419 ± 0.678
0.946TrpLeu: 0.946 ± 0.452
0.473TrpMet: 0.473 ± 0.226
0.473TrpAsn: 0.473 ± 0.226
0.0TrpPro: 0.0 ± 0.0
0.946TrpGln: 0.946 ± 0.452
0.473TrpArg: 0.473 ± 0.226
0.473TrpSer: 0.473 ± 0.226
1.419TrpThr: 1.419 ± 1.522
0.946TrpVal: 0.946 ± 0.452
0.0TrpTrp: 0.0 ± 0.0
0.473TrpTyr: 0.473 ± 1.875
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.892TyrAla: 1.892 ± 0.904
0.0TyrCys: 0.0 ± 0.0
1.419TyrAsp: 1.419 ± 1.522
3.311TyrGlu: 3.311 ± 1.071
0.0TyrPhe: 0.0 ± 0.0
1.419TyrGly: 1.419 ± 0.678
0.473TyrHis: 0.473 ± 1.875
2.365TyrIle: 2.365 ± 1.237
4.73TyrLys: 4.73 ± 1.264
3.784TyrLeu: 3.784 ± 1.053
0.946TyrMet: 0.946 ± 0.452
2.838TyrAsn: 2.838 ± 1.134
1.892TyrPro: 1.892 ± 0.904
0.473TyrGln: 0.473 ± 0.226
3.784TyrArg: 3.784 ± 1.808
1.419TyrSer: 1.419 ± 0.678
0.946TyrThr: 0.946 ± 0.452
1.419TyrVal: 1.419 ± 0.678
0.0TyrTrp: 0.0 ± 0.0
1.892TyrTyr: 1.892 ± 0.904
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2115 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski