Amino acid dipepetide frequency for Nudaurelia capensis beta virus (isolate Pine emperor moth/South Africa) (NbetaV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
7.098AlaAla: 7.098 ± 0.269
0.789AlaCys: 0.789 ± 0.381
4.732AlaAsp: 4.732 ± 1.495
5.915AlaGlu: 5.915 ± 1.277
2.366AlaPhe: 2.366 ± 1.227
5.521AlaGly: 5.521 ± 0.297
2.366AlaHis: 2.366 ± 1.143
5.915AlaIle: 5.915 ± 1.092
3.943AlaLys: 3.943 ± 1.115
6.703AlaLeu: 6.703 ± 0.868
1.972AlaMet: 1.972 ± 1.417
5.126AlaAsn: 5.126 ± 3.053
1.972AlaPro: 1.972 ± 0.627
3.549AlaGln: 3.549 ± 0.655
6.309AlaArg: 6.309 ± 2.481
5.521AlaSer: 5.521 ± 0.493
5.521AlaThr: 5.521 ± 2.072
4.338AlaVal: 4.338 ± 0.515
1.577AlaTrp: 1.577 ± 0.028
1.183AlaTyr: 1.183 ± 1.008
0.0AlaXaa: 0.0 ± 0.0
Cys
1.577CysAla: 1.577 ± 0.762
0.0CysCys: 0.0 ± 0.0
1.183CysAsp: 1.183 ± 0.571
1.577CysGlu: 1.577 ± 0.762
0.394CysPhe: 0.394 ± 0.19
1.577CysGly: 1.577 ± 0.028
0.394CysHis: 0.394 ± 0.19
1.183CysIle: 1.183 ± 0.218
0.789CysLys: 0.789 ± 0.381
0.789CysLeu: 0.789 ± 0.381
0.0CysMet: 0.0 ± 0.0
0.394CysAsn: 0.394 ± 0.19
0.789CysPro: 0.789 ± 0.381
0.394CysGln: 0.394 ± 0.19
0.789CysArg: 0.789 ± 0.381
0.394CysSer: 0.394 ± 0.19
1.577CysThr: 1.577 ± 0.762
1.183CysVal: 1.183 ± 0.571
0.394CysTrp: 0.394 ± 0.19
0.394CysTyr: 0.394 ± 0.19
0.0CysXaa: 0.0 ± 0.0
Asp
8.675AspAla: 8.675 ± 1.03
0.789AspCys: 0.789 ± 0.409
4.338AspAsp: 4.338 ± 0.515
2.76AspGlu: 2.76 ± 1.333
1.972AspPhe: 1.972 ± 0.952
6.703AspGly: 6.703 ± 0.078
1.972AspHis: 1.972 ± 0.952
3.943AspIle: 3.943 ± 0.325
2.76AspLys: 2.76 ± 0.543
3.943AspLeu: 3.943 ± 0.325
0.0AspMet: 0.0 ± 0.0
1.972AspAsn: 1.972 ± 0.952
1.972AspPro: 1.972 ± 0.162
1.183AspGln: 1.183 ± 0.218
6.309AspArg: 6.309 ± 3.047
3.549AspSer: 3.549 ± 0.924
5.126AspThr: 5.126 ± 0.896
5.126AspVal: 5.126 ± 0.106
0.789AspTrp: 0.789 ± 1.199
1.183AspTyr: 1.183 ± 1.008
0.0AspXaa: 0.0 ± 0.0
Glu
4.338GluAla: 4.338 ± 1.064
1.972GluCys: 1.972 ± 0.952
2.366GluAsp: 2.366 ± 1.143
0.789GluGlu: 0.789 ± 0.381
2.76GluPhe: 2.76 ± 0.543
4.732GluGly: 4.732 ± 1.495
1.577GluHis: 1.577 ± 0.028
1.577GluIle: 1.577 ± 0.762
5.126GluLys: 5.126 ± 2.476
2.76GluLeu: 2.76 ± 0.543
0.394GluMet: 0.394 ± 0.599
1.183GluAsn: 1.183 ± 0.218
5.521GluPro: 5.521 ± 1.876
1.183GluGln: 1.183 ± 0.571
6.309GluArg: 6.309 ± 1.467
5.521GluSer: 5.521 ± 1.283
2.366GluThr: 2.366 ± 0.353
1.577GluVal: 1.577 ± 0.818
0.394GluTrp: 0.394 ± 0.19
1.972GluTyr: 1.972 ± 0.162
0.0GluXaa: 0.0 ± 0.0
Phe
3.549PheAla: 3.549 ± 2.235
0.394PheCys: 0.394 ± 0.19
1.577PheAsp: 1.577 ± 0.028
2.366PheGlu: 2.366 ± 0.437
0.394PhePhe: 0.394 ± 0.599
2.76PheGly: 2.76 ± 1.036
0.394PheHis: 0.394 ± 0.19
2.366PheIle: 2.366 ± 0.353
1.183PheLys: 1.183 ± 0.218
0.394PheLeu: 0.394 ± 0.19
0.394PheMet: 0.394 ± 0.19
0.789PheAsn: 0.789 ± 0.409
0.394PhePro: 0.394 ± 0.19
3.155PheGln: 3.155 ± 1.636
3.155PheArg: 3.155 ± 0.846
3.155PheSer: 3.155 ± 1.523
0.789PheThr: 0.789 ± 0.409
0.789PheVal: 0.789 ± 0.381
0.394PheTrp: 0.394 ± 0.19
1.183PheTyr: 1.183 ± 1.008
0.0PheXaa: 0.0 ± 0.0
Gly
6.309GlyAla: 6.309 ± 2.481
2.366GlyCys: 2.366 ± 1.143
2.76GlyAsp: 2.76 ± 0.543
3.155GlyGlu: 3.155 ± 0.734
2.366GlyPhe: 2.366 ± 0.437
4.338GlyGly: 4.338 ± 2.644
2.366GlyHis: 2.366 ± 1.143
1.183GlyIle: 1.183 ± 0.218
2.366GlyLys: 2.366 ± 0.353
4.732GlyLeu: 4.732 ± 1.664
2.366GlyMet: 2.366 ± 0.437
3.943GlyAsn: 3.943 ± 1.255
5.915GlyPro: 5.915 ± 0.303
4.732GlyGln: 4.732 ± 0.084
7.098GlyArg: 7.098 ± 1.059
5.915GlySer: 5.915 ± 0.487
6.309GlyThr: 6.309 ± 0.902
6.703GlyVal: 6.703 ± 0.711
0.789GlyTrp: 0.789 ± 0.381
1.972GlyTyr: 1.972 ± 0.952
0.0GlyXaa: 0.0 ± 0.0
His
2.366HisAla: 2.366 ± 0.353
0.394HisCys: 0.394 ± 0.19
1.183HisAsp: 1.183 ± 0.218
1.972HisGlu: 1.972 ± 0.952
0.0HisPhe: 0.0 ± 0.0
2.76HisGly: 2.76 ± 1.333
3.155HisHis: 3.155 ± 1.523
1.577HisIle: 1.577 ± 0.762
0.0HisLys: 0.0 ± 0.0
2.76HisLeu: 2.76 ± 0.543
0.0HisMet: 0.0 ± 0.0
1.577HisAsn: 1.577 ± 0.762
1.972HisPro: 1.972 ± 0.627
1.577HisGln: 1.577 ± 0.028
2.76HisArg: 2.76 ± 0.543
3.943HisSer: 3.943 ± 1.904
1.577HisThr: 1.577 ± 0.762
1.183HisVal: 1.183 ± 0.571
0.394HisTrp: 0.394 ± 0.19
0.789HisTyr: 0.789 ± 0.381
0.0HisXaa: 0.0 ± 0.0
Ile
5.521IleAla: 5.521 ± 0.297
0.0IleCys: 0.0 ± 0.0
3.155IleAsp: 3.155 ± 0.734
3.549IleGlu: 3.549 ± 0.924
1.577IlePhe: 1.577 ± 0.762
4.338IleGly: 4.338 ± 1.064
1.183IleHis: 1.183 ± 0.218
1.577IleIle: 1.577 ± 0.762
1.577IleLys: 1.577 ± 0.028
1.577IleLeu: 1.577 ± 0.762
1.183IleMet: 1.183 ± 0.571
2.366IleAsn: 2.366 ± 0.353
4.338IlePro: 4.338 ± 2.644
1.972IleGln: 1.972 ± 0.162
4.338IleArg: 4.338 ± 1.305
1.972IleSer: 1.972 ± 0.952
3.943IleThr: 3.943 ± 1.255
3.155IleVal: 3.155 ± 0.734
0.394IleTrp: 0.394 ± 0.19
0.789IleTyr: 0.789 ± 0.381
0.0IleXaa: 0.0 ± 0.0
Lys
3.155LysAla: 3.155 ± 1.523
1.183LysCys: 1.183 ± 0.571
1.577LysAsp: 1.577 ± 0.028
3.155LysGlu: 3.155 ± 0.056
0.789LysPhe: 0.789 ± 0.381
2.366LysGly: 2.366 ± 0.353
1.183LysHis: 1.183 ± 0.571
3.943LysIle: 3.943 ± 0.325
1.183LysLys: 1.183 ± 0.571
3.943LysLeu: 3.943 ± 1.115
0.394LysMet: 0.394 ± 0.19
1.577LysAsn: 1.577 ± 0.762
1.183LysPro: 1.183 ± 0.571
2.366LysGln: 2.366 ± 1.143
2.366LysArg: 2.366 ± 0.353
2.366LysSer: 2.366 ± 1.143
3.943LysThr: 3.943 ± 1.255
1.972LysVal: 1.972 ± 0.162
0.0LysTrp: 0.0 ± 0.0
1.577LysTyr: 1.577 ± 0.762
0.0LysXaa: 0.0 ± 0.0
Leu
5.126LeuAla: 5.126 ± 2.476
0.394LeuCys: 0.394 ± 0.19
7.098LeuAsp: 7.098 ± 0.521
2.76LeuGlu: 2.76 ± 0.543
1.972LeuPhe: 1.972 ± 0.627
4.338LeuGly: 4.338 ± 0.515
3.155LeuHis: 3.155 ± 0.734
1.183LeuIle: 1.183 ± 0.571
3.549LeuLys: 3.549 ± 0.924
3.943LeuLeu: 3.943 ± 0.465
2.366LeuMet: 2.366 ± 0.666
2.366LeuAsn: 2.366 ± 1.227
5.126LeuPro: 5.126 ± 1.473
4.338LeuGln: 4.338 ± 0.275
5.521LeuArg: 5.521 ± 1.087
1.972LeuSer: 1.972 ± 0.162
5.126LeuThr: 5.126 ± 1.473
2.76LeuVal: 2.76 ± 1.826
0.394LeuTrp: 0.394 ± 0.19
3.155LeuTyr: 3.155 ± 0.734
0.0LeuXaa: 0.0 ± 0.0
Met
1.577MetAla: 1.577 ± 1.608
0.0MetCys: 0.0 ± 0.0
1.972MetAsp: 1.972 ± 0.162
1.183MetGlu: 1.183 ± 0.218
0.789MetPhe: 0.789 ± 0.381
0.789MetGly: 0.789 ± 0.409
1.577MetHis: 1.577 ± 0.028
0.0MetIle: 0.0 ± 0.0
0.394MetLys: 0.394 ± 0.19
0.789MetLeu: 0.789 ± 0.381
1.183MetMet: 1.183 ± 1.008
0.394MetAsn: 0.394 ± 0.599
0.789MetPro: 0.789 ± 1.199
0.789MetGln: 0.789 ± 0.409
1.577MetArg: 1.577 ± 0.762
1.577MetSer: 1.577 ± 0.028
1.972MetThr: 1.972 ± 1.417
1.183MetVal: 1.183 ± 1.008
0.394MetTrp: 0.394 ± 0.599
0.789MetTyr: 0.789 ± 0.381
0.0MetXaa: 0.0 ± 0.0
Asn
2.76AsnAla: 2.76 ± 1.036
1.577AsnCys: 1.577 ± 0.762
1.577AsnAsp: 1.577 ± 0.028
1.972AsnGlu: 1.972 ± 0.162
0.394AsnPhe: 0.394 ± 0.599
2.76AsnGly: 2.76 ± 0.246
1.183AsnHis: 1.183 ± 0.218
1.972AsnIle: 1.972 ± 0.952
0.789AsnLys: 0.789 ± 0.409
4.732AsnLeu: 4.732 ± 1.664
0.789AsnMet: 0.789 ± 0.409
3.155AsnAsn: 3.155 ± 2.425
1.577AsnPro: 1.577 ± 0.818
1.972AsnGln: 1.972 ± 1.417
1.972AsnArg: 1.972 ± 0.627
2.76AsnSer: 2.76 ± 1.036
1.972AsnThr: 1.972 ± 1.417
2.76AsnVal: 2.76 ± 0.246
1.183AsnTrp: 1.183 ± 0.571
0.789AsnTyr: 0.789 ± 0.409
0.0AsnXaa: 0.0 ± 0.0
Pro
1.972ProAla: 1.972 ± 1.417
0.789ProCys: 0.789 ± 0.381
5.915ProAsp: 5.915 ± 0.303
3.549ProGlu: 3.549 ± 0.134
2.366ProPhe: 2.366 ± 1.227
6.309ProGly: 6.309 ± 0.678
1.183ProHis: 1.183 ± 0.571
1.577ProIle: 1.577 ± 0.028
1.972ProLys: 1.972 ± 0.627
3.549ProLeu: 3.549 ± 2.235
1.577ProMet: 1.577 ± 0.734
1.577ProAsn: 1.577 ± 0.028
3.943ProPro: 3.943 ± 2.834
2.366ProGln: 2.366 ± 0.353
3.155ProArg: 3.155 ± 0.056
3.943ProSer: 3.943 ± 0.325
5.915ProThr: 5.915 ± 1.092
3.549ProVal: 3.549 ± 0.655
0.789ProTrp: 0.789 ± 0.409
0.789ProTyr: 0.789 ± 0.409
0.0ProXaa: 0.0 ± 0.0
Gln
3.549GlnAla: 3.549 ± 0.134
0.394GlnCys: 0.394 ± 0.19
5.521GlnAsp: 5.521 ± 1.087
1.972GlnGlu: 1.972 ± 0.952
1.972GlnPhe: 1.972 ± 2.207
2.366GlnGly: 2.366 ± 1.227
0.394GlnHis: 0.394 ± 0.19
1.972GlnIle: 1.972 ± 0.627
1.972GlnLys: 1.972 ± 0.162
3.549GlnLeu: 3.549 ± 1.445
1.577GlnMet: 1.577 ± 0.818
0.0GlnAsn: 0.0 ± 0.0
2.76GlnPro: 2.76 ± 0.246
4.732GlnGln: 4.732 ± 2.285
3.943GlnArg: 3.943 ± 1.115
2.366GlnSer: 2.366 ± 0.353
1.577GlnThr: 1.577 ± 0.818
1.577GlnVal: 1.577 ± 0.818
0.0GlnTrp: 0.0 ± 0.0
0.394GlnTyr: 0.394 ± 0.19
0.0GlnXaa: 0.0 ± 0.0
Arg
4.732ArgAla: 4.732 ± 0.084
1.577ArgCys: 1.577 ± 0.762
5.915ArgAsp: 5.915 ± 2.856
6.703ArgGlu: 6.703 ± 3.237
2.366ArgPhe: 2.366 ± 0.353
7.492ArgGly: 7.492 ± 0.331
2.76ArgHis: 2.76 ± 0.543
4.338ArgIle: 4.338 ± 1.305
2.76ArgLys: 2.76 ± 1.333
5.521ArgLeu: 5.521 ± 1.087
0.394ArgMet: 0.394 ± 0.19
3.155ArgAsn: 3.155 ± 0.846
1.972ArgPro: 1.972 ± 0.162
1.972ArgGln: 1.972 ± 0.627
10.252ArgArg: 10.252 ± 2.157
7.886ArgSer: 7.886 ± 3.019
7.492ArgThr: 7.492 ± 0.459
7.098ArgVal: 7.098 ± 1.059
0.394ArgTrp: 0.394 ± 0.599
3.943ArgTyr: 3.943 ± 0.465
0.0ArgXaa: 0.0 ± 0.0
Ser
5.521SerAla: 5.521 ± 0.297
0.789SerCys: 0.789 ± 0.381
4.732SerAsp: 4.732 ± 2.285
4.338SerGlu: 4.338 ± 0.275
1.183SerPhe: 1.183 ± 0.571
7.886SerGly: 7.886 ± 0.65
0.789SerHis: 0.789 ± 0.381
5.126SerIle: 5.126 ± 1.686
2.366SerLys: 2.366 ± 1.143
2.76SerLeu: 2.76 ± 0.246
0.394SerMet: 0.394 ± 0.599
3.549SerAsn: 3.549 ± 1.445
3.943SerPro: 3.943 ± 0.465
1.183SerGln: 1.183 ± 0.218
6.703SerArg: 6.703 ± 3.237
3.943SerSer: 3.943 ± 1.115
4.732SerThr: 4.732 ± 1.664
4.732SerVal: 4.732 ± 0.706
1.183SerTrp: 1.183 ± 0.218
1.972SerTyr: 1.972 ± 0.627
0.0SerXaa: 0.0 ± 0.0
Thr
4.732ThrAla: 4.732 ± 0.084
0.789ThrCys: 0.789 ± 0.409
4.338ThrAsp: 4.338 ± 1.854
2.76ThrGlu: 2.76 ± 1.826
1.183ThrPhe: 1.183 ± 0.218
3.943ThrGly: 3.943 ± 0.465
1.577ThrHis: 1.577 ± 0.762
3.943ThrIle: 3.943 ± 2.834
4.338ThrLys: 4.338 ± 1.305
7.492ThrLeu: 7.492 ± 0.331
1.577ThrMet: 1.577 ± 0.818
2.366ThrAsn: 2.366 ± 0.437
4.338ThrPro: 4.338 ± 0.275
1.972ThrGln: 1.972 ± 0.627
8.281ThrArg: 8.281 ± 2.42
6.309ThrSer: 6.309 ± 3.271
4.732ThrThr: 4.732 ± 3.243
5.126ThrVal: 5.126 ± 3.053
1.183ThrTrp: 1.183 ± 1.008
1.972ThrTyr: 1.972 ± 0.162
0.0ThrXaa: 0.0 ± 0.0
Val
6.309ValAla: 6.309 ± 4.061
1.183ValCys: 1.183 ± 0.571
4.338ValAsp: 4.338 ± 0.515
2.76ValGlu: 2.76 ± 0.246
3.155ValPhe: 3.155 ± 1.636
4.732ValGly: 4.732 ± 1.664
2.76ValHis: 2.76 ± 1.333
1.972ValIle: 1.972 ± 0.162
1.577ValLys: 1.577 ± 0.762
3.943ValLeu: 3.943 ± 1.904
1.183ValMet: 1.183 ± 0.218
1.577ValAsn: 1.577 ± 0.818
5.521ValPro: 5.521 ± 0.297
1.577ValGln: 1.577 ± 0.028
5.521ValArg: 5.521 ± 0.297
3.155ValSer: 3.155 ± 0.056
4.732ValThr: 4.732 ± 1.664
4.338ValVal: 4.338 ± 0.275
0.789ValTrp: 0.789 ± 0.381
1.183ValTyr: 1.183 ± 0.571
0.0ValXaa: 0.0 ± 0.0
Trp
0.789TrpAla: 0.789 ± 0.409
0.0TrpCys: 0.0 ± 0.0
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.394TrpPhe: 0.394 ± 0.19
0.394TrpGly: 0.394 ± 0.599
0.394TrpHis: 0.394 ± 0.19
1.183TrpIle: 1.183 ± 0.218
0.394TrpLys: 0.394 ± 0.599
0.394TrpLeu: 0.394 ± 0.19
0.789TrpMet: 0.789 ± 0.381
0.394TrpAsn: 0.394 ± 0.599
0.394TrpPro: 0.394 ± 0.599
0.394TrpGln: 0.394 ± 0.599
1.183TrpArg: 1.183 ± 0.571
0.789TrpSer: 0.789 ± 0.381
2.76TrpThr: 2.76 ± 0.246
1.183TrpVal: 1.183 ± 0.571
0.394TrpTrp: 0.394 ± 0.19
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.76TyrAla: 2.76 ± 0.543
0.394TyrCys: 0.394 ± 0.19
0.789TyrAsp: 0.789 ± 0.381
1.183TyrGlu: 1.183 ± 0.571
1.183TyrPhe: 1.183 ± 1.008
1.577TyrGly: 1.577 ± 0.028
1.577TyrHis: 1.577 ± 0.028
1.972TyrIle: 1.972 ± 0.952
1.183TyrLys: 1.183 ± 0.218
2.366TyrLeu: 2.366 ± 0.353
0.789TyrMet: 0.789 ± 1.199
1.183TyrAsn: 1.183 ± 0.218
2.76TyrPro: 2.76 ± 1.036
1.183TyrGln: 1.183 ± 0.571
1.183TyrArg: 1.183 ± 0.218
0.789TyrSer: 0.789 ± 0.381
0.789TyrThr: 0.789 ± 0.409
1.972TyrVal: 1.972 ± 0.162
0.394TyrTrp: 0.394 ± 0.19
0.0TyrTyr: 0.0 ± 0.0
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2537 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski