Amino acid dipepetide frequency for Rotifer birnavirus strain Palavas

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.228AlaAla: 5.228 ± 1.498
0.475AlaCys: 0.475 ± 0.306
2.852AlaAsp: 2.852 ± 0.596
5.703AlaGlu: 5.703 ± 0.583
2.852AlaPhe: 2.852 ± 0.596
3.327AlaGly: 3.327 ± 0.289
0.475AlaHis: 0.475 ± 0.306
6.179AlaIle: 6.179 ± 0.885
4.753AlaLys: 4.753 ± 1.239
8.08AlaLeu: 8.08 ± 0.267
2.852AlaMet: 2.852 ± 0.013
3.327AlaAsn: 3.327 ± 2.115
6.654AlaPro: 6.654 ± 1.187
3.327AlaGln: 3.327 ± 1.506
3.327AlaArg: 3.327 ± 0.319
3.802AlaSer: 3.802 ± 0.591
5.228AlaThr: 5.228 ± 0.328
3.327AlaVal: 3.327 ± 1.506
0.475AlaTrp: 0.475 ± 0.306
4.278AlaTyr: 4.278 ± 0.932
0.0AlaXaa: 0.0 ± 0.0
Cys
0.951CysAla: 0.951 ± 0.604
0.0CysCys: 0.0 ± 0.0
0.475CysAsp: 0.475 ± 0.306
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.951CysGly: 0.951 ± 0.004
0.475CysHis: 0.475 ± 0.302
0.0CysIle: 0.0 ± 0.0
0.0CysLys: 0.0 ± 0.0
0.475CysLeu: 0.475 ± 0.306
0.0CysMet: 0.0 ± 0.0
0.0CysAsn: 0.0 ± 0.0
0.475CysPro: 0.475 ± 0.306
0.0CysGln: 0.0 ± 0.0
0.475CysArg: 0.475 ± 0.302
0.951CysSer: 0.951 ± 0.604
0.0CysThr: 0.0 ± 0.0
0.951CysVal: 0.951 ± 0.604
0.0CysTrp: 0.0 ± 0.0
0.951CysTyr: 0.951 ± 0.613
0.0CysXaa: 0.0 ± 0.0
Asp
1.901AspAla: 1.901 ± 0.617
0.0AspCys: 0.0 ± 0.0
2.852AspAsp: 2.852 ± 0.596
4.278AspGlu: 4.278 ± 0.324
1.901AspPhe: 1.901 ± 0.009
3.327AspGly: 3.327 ± 0.898
0.475AspHis: 0.475 ± 0.302
3.802AspIle: 3.802 ± 0.017
2.376AspLys: 2.376 ± 0.315
5.703AspLeu: 5.703 ± 0.635
0.951AspMet: 0.951 ± 0.004
3.327AspAsn: 3.327 ± 0.289
3.802AspPro: 3.802 ± 0.017
2.376AspGln: 2.376 ± 0.293
1.426AspArg: 1.426 ± 0.311
1.426AspSer: 1.426 ± 0.919
3.327AspThr: 3.327 ± 0.289
3.802AspVal: 3.802 ± 0.626
0.951AspTrp: 0.951 ± 0.004
1.426AspTyr: 1.426 ± 0.919
0.0AspXaa: 0.0 ± 0.0
Glu
5.703GluAla: 5.703 ± 0.583
0.0GluCys: 0.0 ± 0.0
1.901GluAsp: 1.901 ± 1.226
5.228GluGlu: 5.228 ± 0.889
2.852GluPhe: 2.852 ± 0.622
4.278GluGly: 4.278 ± 0.324
0.475GluHis: 0.475 ± 0.306
2.376GluIle: 2.376 ± 0.293
2.852GluLys: 2.852 ± 1.23
8.555GluLeu: 8.555 ± 0.039
1.901GluMet: 1.901 ± 0.617
1.901GluAsn: 1.901 ± 0.617
1.426GluPro: 1.426 ± 0.311
1.426GluGln: 1.426 ± 0.298
3.802GluArg: 3.802 ± 1.234
3.327GluSer: 3.327 ± 0.289
3.802GluThr: 3.802 ± 0.591
2.852GluVal: 2.852 ± 1.23
0.475GluTrp: 0.475 ± 0.302
1.901GluTyr: 1.901 ± 0.009
0.0GluXaa: 0.0 ± 0.0
Phe
4.278PheAla: 4.278 ± 0.285
0.475PheCys: 0.475 ± 0.302
2.852PheAsp: 2.852 ± 0.596
1.901PheGlu: 1.901 ± 1.226
0.0PhePhe: 0.0 ± 0.0
2.852PheGly: 2.852 ± 0.013
1.426PheHis: 1.426 ± 0.311
2.376PheIle: 2.376 ± 0.293
2.376PheLys: 2.376 ± 1.532
1.901PheLeu: 1.901 ± 1.226
0.475PheMet: 0.475 ± 0.302
0.951PheAsn: 0.951 ± 0.004
3.327PhePro: 3.327 ± 0.319
0.475PheGln: 0.475 ± 0.302
1.901PheArg: 1.901 ± 0.009
3.327PheSer: 3.327 ± 0.319
0.951PheThr: 0.951 ± 0.604
2.376PheVal: 2.376 ± 1.532
0.0PheTrp: 0.0 ± 0.0
0.951PheTyr: 0.951 ± 0.604
0.0PheXaa: 0.0 ± 0.0
Gly
4.753GlyAla: 4.753 ± 1.195
1.901GlyCys: 1.901 ± 0.009
2.376GlyAsp: 2.376 ± 0.902
2.852GlyGlu: 2.852 ± 0.622
1.426GlyPhe: 1.426 ± 0.906
3.327GlyGly: 3.327 ± 0.289
0.951GlyHis: 0.951 ± 0.004
3.802GlyIle: 3.802 ± 0.017
3.327GlyLys: 3.327 ± 0.289
3.802GlyLeu: 3.802 ± 0.591
1.426GlyMet: 1.426 ± 0.311
4.278GlyAsn: 4.278 ± 0.893
3.802GlyPro: 3.802 ± 1.808
2.376GlyGln: 2.376 ± 0.315
2.376GlyArg: 2.376 ± 0.293
6.179GlySer: 6.179 ± 1.55
4.753GlyThr: 4.753 ± 0.587
4.278GlyVal: 4.278 ± 1.502
0.951GlyTrp: 0.951 ± 0.004
1.901GlyTyr: 1.901 ± 0.009
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
1.426HisAsp: 1.426 ± 0.298
0.0HisGlu: 0.0 ± 0.0
0.0HisPhe: 0.0 ± 0.0
0.951HisGly: 0.951 ± 0.004
0.0HisHis: 0.0 ± 0.0
2.376HisIle: 2.376 ± 0.293
0.0HisLys: 0.0 ± 0.0
1.426HisLeu: 1.426 ± 0.919
0.475HisMet: 0.475 ± 0.302
0.0HisAsn: 0.0 ± 0.0
0.0HisPro: 0.0 ± 0.0
0.475HisGln: 0.475 ± 0.306
0.0HisArg: 0.0 ± 0.0
0.475HisSer: 0.475 ± 0.302
1.426HisThr: 1.426 ± 0.311
0.475HisVal: 0.475 ± 0.302
0.0HisTrp: 0.0 ± 0.0
0.475HisTyr: 0.475 ± 0.302
0.0HisXaa: 0.0 ± 0.0
Ile
5.703IleAla: 5.703 ± 1.191
0.0IleCys: 0.0 ± 0.0
4.278IleAsp: 4.278 ± 1.541
4.278IleGlu: 4.278 ± 0.932
2.852IlePhe: 2.852 ± 0.596
2.376IleGly: 2.376 ± 0.293
0.475IleHis: 0.475 ± 0.302
3.327IleIle: 3.327 ± 0.319
2.852IleLys: 2.852 ± 0.013
5.228IleLeu: 5.228 ± 0.889
1.901IleMet: 1.901 ± 0.009
3.327IleAsn: 3.327 ± 0.289
6.654IlePro: 6.654 ± 1.248
2.852IleGln: 2.852 ± 0.622
2.852IleArg: 2.852 ± 0.013
3.802IleSer: 3.802 ± 0.626
6.179IleThr: 6.179 ± 2.102
2.852IleVal: 2.852 ± 0.596
0.475IleTrp: 0.475 ± 0.302
1.901IleTyr: 1.901 ± 0.009
0.0IleXaa: 0.0 ± 0.0
Lys
6.179LysAla: 6.179 ± 0.941
0.0LysCys: 0.0 ± 0.0
3.327LysAsp: 3.327 ± 0.928
3.802LysGlu: 3.802 ± 1.843
0.951LysPhe: 0.951 ± 0.004
1.901LysGly: 1.901 ± 0.6
0.951LysHis: 0.951 ± 0.604
0.951LysIle: 0.951 ± 0.004
4.753LysLys: 4.753 ± 1.239
6.654LysLeu: 6.654 ± 0.03
0.475LysMet: 0.475 ± 0.302
4.278LysAsn: 4.278 ± 0.932
2.852LysPro: 2.852 ± 1.23
4.753LysGln: 4.753 ± 0.022
1.901LysArg: 1.901 ± 0.617
6.654LysSer: 6.654 ± 3.073
5.703LysThr: 5.703 ± 0.635
2.852LysVal: 2.852 ± 1.23
0.951LysTrp: 0.951 ± 0.613
1.901LysTyr: 1.901 ± 0.009
0.0LysXaa: 0.0 ± 0.0
Leu
5.703LeuAla: 5.703 ± 0.635
0.951LeuCys: 0.951 ± 0.604
4.753LeuAsp: 4.753 ± 1.239
4.278LeuGlu: 4.278 ± 0.932
3.802LeuPhe: 3.802 ± 1.234
4.753LeuGly: 4.753 ± 0.022
0.0LeuHis: 0.0 ± 0.0
5.703LeuIle: 5.703 ± 0.635
5.703LeuLys: 5.703 ± 0.635
6.179LeuLeu: 6.179 ± 0.885
2.376LeuMet: 2.376 ± 0.902
7.129LeuAsn: 7.129 ± 0.337
7.605LeuPro: 7.605 ± 0.643
1.901LeuGln: 1.901 ± 0.6
4.753LeuArg: 4.753 ± 1.195
7.605LeuSer: 7.605 ± 1.791
5.703LeuThr: 5.703 ± 1.191
5.228LeuVal: 5.228 ± 0.328
0.0LeuTrp: 0.0 ± 0.0
4.278LeuTyr: 4.278 ± 0.893
0.0LeuXaa: 0.0 ± 0.0
Met
1.426MetAla: 1.426 ± 0.311
0.0MetCys: 0.0 ± 0.0
1.901MetAsp: 1.901 ± 0.009
1.426MetGlu: 1.426 ± 0.311
0.951MetPhe: 0.951 ± 0.613
1.426MetGly: 1.426 ± 0.311
0.0MetHis: 0.0 ± 0.0
0.951MetIle: 0.951 ± 0.004
1.426MetLys: 1.426 ± 0.311
2.376MetLeu: 2.376 ± 0.315
0.0MetMet: 0.0 ± 0.0
1.901MetAsn: 1.901 ± 0.6
1.901MetPro: 1.901 ± 0.6
1.901MetGln: 1.901 ± 0.6
1.426MetArg: 1.426 ± 0.906
2.852MetSer: 2.852 ± 1.23
1.426MetThr: 1.426 ± 0.298
1.426MetVal: 1.426 ± 0.298
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
4.278AsnAla: 4.278 ± 0.932
0.0AsnCys: 0.0 ± 0.0
1.426AsnAsp: 1.426 ± 0.311
2.376AsnGlu: 2.376 ± 0.315
1.426AsnPhe: 1.426 ± 0.298
2.376AsnGly: 2.376 ± 0.902
0.475AsnHis: 0.475 ± 0.306
4.278AsnIle: 4.278 ± 0.324
3.327AsnLys: 3.327 ± 0.928
5.228AsnLeu: 5.228 ± 2.106
1.426AsnMet: 1.426 ± 0.311
1.901AsnAsn: 1.901 ± 0.6
3.802AsnPro: 3.802 ± 0.591
1.901AsnGln: 1.901 ± 0.009
3.802AsnArg: 3.802 ± 0.591
5.228AsnSer: 5.228 ± 0.28
4.753AsnThr: 4.753 ± 0.63
4.278AsnVal: 4.278 ± 1.502
1.901AsnTrp: 1.901 ± 0.6
4.278AsnTyr: 4.278 ± 1.502
0.0AsnXaa: 0.0 ± 0.0
Pro
6.654ProAla: 6.654 ± 1.187
0.475ProCys: 0.475 ± 0.302
3.327ProAsp: 3.327 ± 0.898
2.376ProGlu: 2.376 ± 0.902
3.802ProPhe: 3.802 ± 0.626
3.802ProGly: 3.802 ± 0.017
0.0ProHis: 0.0 ± 0.0
3.327ProIle: 3.327 ± 0.898
2.852ProLys: 2.852 ± 1.23
6.654ProLeu: 6.654 ± 0.639
1.426ProMet: 1.426 ± 0.311
5.228ProAsn: 5.228 ± 0.28
0.951ProPro: 0.951 ± 0.604
3.802ProGln: 3.802 ± 0.626
3.327ProArg: 3.327 ± 0.319
5.703ProSer: 5.703 ± 2.46
4.278ProThr: 4.278 ± 2.11
3.802ProVal: 3.802 ± 1.234
0.0ProTrp: 0.0 ± 0.0
2.376ProTyr: 2.376 ± 0.924
0.0ProXaa: 0.0 ± 0.0
Gln
1.901GlnAla: 1.901 ± 0.6
0.0GlnCys: 0.0 ± 0.0
3.327GlnAsp: 3.327 ± 0.898
2.376GlnGlu: 2.376 ± 0.924
1.426GlnPhe: 1.426 ± 0.311
2.852GlnGly: 2.852 ± 1.204
0.951GlnHis: 0.951 ± 0.004
3.327GlnIle: 3.327 ± 0.319
2.852GlnLys: 2.852 ± 0.013
3.327GlnLeu: 3.327 ± 0.319
1.901GlnMet: 1.901 ± 0.6
1.426GlnAsn: 1.426 ± 0.906
3.327GlnPro: 3.327 ± 0.319
3.327GlnGln: 3.327 ± 0.928
1.901GlnArg: 1.901 ± 1.208
0.0GlnSer: 0.0 ± 0.0
2.376GlnThr: 2.376 ± 0.315
1.426GlnVal: 1.426 ± 0.311
0.951GlnTrp: 0.951 ± 0.004
2.376GlnTyr: 2.376 ± 0.315
0.0GlnXaa: 0.0 ± 0.0
Arg
4.278ArgAla: 4.278 ± 0.285
0.951ArgCys: 0.951 ± 0.004
0.951ArgAsp: 0.951 ± 0.004
2.852ArgGlu: 2.852 ± 0.596
1.901ArgPhe: 1.901 ± 0.617
3.327ArgGly: 3.327 ± 0.289
0.475ArgHis: 0.475 ± 0.302
1.901ArgIle: 1.901 ± 0.6
4.753ArgLys: 4.753 ± 0.63
4.753ArgLeu: 4.753 ± 1.195
0.951ArgMet: 0.951 ± 0.24
4.278ArgAsn: 4.278 ± 0.285
2.376ArgPro: 2.376 ± 0.315
0.475ArgGln: 0.475 ± 0.302
0.951ArgArg: 0.951 ± 0.613
5.228ArgSer: 5.228 ± 0.28
3.327ArgThr: 3.327 ± 0.319
1.426ArgVal: 1.426 ± 0.311
0.475ArgTrp: 0.475 ± 0.306
1.901ArgTyr: 1.901 ± 0.009
0.0ArgXaa: 0.0 ± 0.0
Ser
4.753SerAla: 4.753 ± 1.804
1.426SerCys: 1.426 ± 0.311
2.376SerAsp: 2.376 ± 0.924
4.278SerGlu: 4.278 ± 0.893
1.901SerPhe: 1.901 ± 0.009
6.179SerGly: 6.179 ± 1.493
0.951SerHis: 0.951 ± 0.004
7.605SerIle: 7.605 ± 0.643
5.228SerLys: 5.228 ± 1.545
6.179SerLeu: 6.179 ± 1.55
1.901SerMet: 1.901 ± 0.789
4.278SerAsn: 4.278 ± 1.541
4.278SerPro: 4.278 ± 0.932
3.327SerGln: 3.327 ± 0.319
3.802SerArg: 3.802 ± 0.017
8.08SerSer: 8.08 ± 0.95
5.228SerThr: 5.228 ± 0.889
4.278SerVal: 4.278 ± 0.324
1.901SerTrp: 1.901 ± 0.617
0.951SerTyr: 0.951 ± 0.004
0.0SerXaa: 0.0 ± 0.0
Thr
4.278ThrAla: 4.278 ± 1.502
0.0ThrCys: 0.0 ± 0.0
0.951ThrAsp: 0.951 ± 0.004
2.852ThrGlu: 2.852 ± 1.23
4.278ThrPhe: 4.278 ± 0.324
4.278ThrGly: 4.278 ± 0.893
0.475ThrHis: 0.475 ± 0.306
6.654ThrIle: 6.654 ± 1.795
4.753ThrLys: 4.753 ± 0.022
3.802ThrLeu: 3.802 ± 1.2
1.901ThrMet: 1.901 ± 0.6
3.327ThrAsn: 3.327 ± 0.289
6.179ThrPro: 6.179 ± 0.276
2.376ThrGln: 2.376 ± 0.293
4.753ThrArg: 4.753 ± 0.63
4.753ThrSer: 4.753 ± 1.804
2.852ThrThr: 2.852 ± 0.013
4.278ThrVal: 4.278 ± 0.285
0.951ThrTrp: 0.951 ± 0.613
3.802ThrTyr: 3.802 ± 1.2
0.0ThrXaa: 0.0 ± 0.0
Val
4.753ValAla: 4.753 ± 0.587
0.0ValCys: 0.0 ± 0.0
3.327ValAsp: 3.327 ± 0.289
4.278ValGlu: 4.278 ± 0.324
0.951ValPhe: 0.951 ± 0.004
4.278ValGly: 4.278 ± 0.324
0.0ValHis: 0.0 ± 0.0
3.802ValIle: 3.802 ± 0.626
5.228ValLys: 5.228 ± 1.545
3.802ValLeu: 3.802 ± 0.591
0.951ValMet: 0.951 ± 0.004
3.327ValAsn: 3.327 ± 0.289
3.327ValPro: 3.327 ± 0.319
1.426ValGln: 1.426 ± 0.311
3.327ValArg: 3.327 ± 0.289
5.703ValSer: 5.703 ± 0.583
2.852ValThr: 2.852 ± 1.204
3.802ValVal: 3.802 ± 1.234
0.0ValTrp: 0.0 ± 0.0
0.951ValTyr: 0.951 ± 0.004
0.0ValXaa: 0.0 ± 0.0
Trp
0.475TrpAla: 0.475 ± 0.306
0.475TrpCys: 0.475 ± 0.302
1.901TrpAsp: 1.901 ± 0.009
0.475TrpGlu: 0.475 ± 0.306
0.475TrpPhe: 0.475 ± 0.306
1.426TrpGly: 1.426 ± 0.298
0.0TrpHis: 0.0 ± 0.0
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
0.475TrpLeu: 0.475 ± 0.306
0.475TrpMet: 0.475 ± 0.306
0.0TrpAsn: 0.0 ± 0.0
0.0TrpPro: 0.0 ± 0.0
0.475TrpGln: 0.475 ± 0.302
0.475TrpArg: 0.475 ± 0.306
0.951TrpSer: 0.951 ± 0.004
1.426TrpThr: 1.426 ± 0.311
0.475TrpVal: 0.475 ± 0.302
0.0TrpTrp: 0.0 ± 0.0
1.426TrpTyr: 1.426 ± 0.919
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.327TyrAla: 3.327 ± 0.289
0.0TyrCys: 0.0 ± 0.0
2.852TyrAsp: 2.852 ± 0.622
1.426TyrGlu: 1.426 ± 0.906
1.426TyrPhe: 1.426 ± 0.919
2.852TyrGly: 2.852 ± 0.013
0.951TyrHis: 0.951 ± 0.004
1.426TyrIle: 1.426 ± 0.919
2.376TyrLys: 2.376 ± 0.293
3.802TyrLeu: 3.802 ± 0.017
0.475TyrMet: 0.475 ± 0.306
4.278TyrAsn: 4.278 ± 1.502
1.426TyrPro: 1.426 ± 0.311
2.376TyrGln: 2.376 ± 0.293
1.426TyrArg: 1.426 ± 0.298
2.852TyrSer: 2.852 ± 0.622
1.901TyrThr: 1.901 ± 0.009
1.901TyrVal: 1.901 ± 0.6
0.951TyrTrp: 0.951 ± 0.613
1.426TyrTyr: 1.426 ± 0.919
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (2105 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski