Amino acid dipepetide frequency for Acheta domesticus volvovirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.102AlaAla: 3.102 ± 2.17
0.0AlaCys: 0.0 ± 0.0
1.034AlaAsp: 1.034 ± 1.156
1.034AlaGlu: 1.034 ± 1.023
7.239AlaPhe: 7.239 ± 1.711
2.068AlaGly: 2.068 ± 1.281
1.034AlaHis: 1.034 ± 0.86
7.239AlaIle: 7.239 ± 3.992
0.0AlaLys: 0.0 ± 0.0
2.068AlaLeu: 2.068 ± 0.809
0.0AlaMet: 0.0 ± 0.0
3.102AlaAsn: 3.102 ± 1.323
5.171AlaPro: 5.171 ± 1.513
1.034AlaGln: 1.034 ± 0.86
4.137AlaArg: 4.137 ± 1.199
4.137AlaSer: 4.137 ± 2.894
3.102AlaThr: 3.102 ± 2.278
4.137AlaVal: 4.137 ± 1.904
0.0AlaTrp: 0.0 ± 0.0
2.068AlaTyr: 2.068 ± 0.809
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.0CysAsp: 0.0 ± 0.0
1.034CysGlu: 1.034 ± 1.023
2.068CysPhe: 2.068 ± 1.719
0.0CysGly: 0.0 ± 0.0
0.0CysHis: 0.0 ± 0.0
1.034CysIle: 1.034 ± 0.86
0.0CysLys: 0.0 ± 0.0
0.0CysLeu: 0.0 ± 0.0
0.0CysMet: 0.0 ± 0.0
1.034CysAsn: 1.034 ± 0.86
3.102CysPro: 3.102 ± 0.641
2.068CysGln: 2.068 ± 0.809
1.034CysArg: 1.034 ± 0.86
1.034CysSer: 1.034 ± 1.023
1.034CysThr: 1.034 ± 0.86
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
1.034CysTyr: 1.034 ± 0.86
0.0CysXaa: 0.0 ± 0.0
Asp
4.137AspAla: 4.137 ± 1.194
0.0AspCys: 0.0 ± 0.0
4.137AspAsp: 4.137 ± 3.438
1.034AspGlu: 1.034 ± 0.86
2.068AspPhe: 2.068 ± 1.091
6.205AspGly: 6.205 ± 1.052
1.034AspHis: 1.034 ± 0.86
2.068AspIle: 2.068 ± 0.935
2.068AspLys: 2.068 ± 0.809
2.068AspLeu: 2.068 ± 0.809
0.0AspMet: 0.0 ± 0.0
3.102AspAsn: 3.102 ± 1.504
3.102AspPro: 3.102 ± 2.188
4.137AspGln: 4.137 ± 1.194
3.102AspArg: 3.102 ± 1.532
4.137AspSer: 4.137 ± 2.802
3.102AspThr: 3.102 ± 1.226
1.034AspVal: 1.034 ± 1.156
1.034AspTrp: 1.034 ± 0.723
1.034AspTyr: 1.034 ± 0.86
0.0AspXaa: 0.0 ± 0.0
Glu
0.0GluAla: 0.0 ± 0.0
0.0GluCys: 0.0 ± 0.0
3.102GluAsp: 3.102 ± 1.388
0.0GluGlu: 0.0 ± 0.0
2.068GluPhe: 2.068 ± 0.809
1.034GluGly: 1.034 ± 0.723
1.034GluHis: 1.034 ± 1.023
3.102GluIle: 3.102 ± 1.323
4.137GluLys: 4.137 ± 2.431
3.102GluLeu: 3.102 ± 1.932
2.068GluMet: 2.068 ± 0.805
4.137GluAsn: 4.137 ± 1.503
3.102GluPro: 3.102 ± 1.272
3.102GluGln: 3.102 ± 2.17
3.102GluArg: 3.102 ± 1.388
4.137GluSer: 4.137 ± 2.585
3.102GluThr: 3.102 ± 1.932
2.068GluVal: 2.068 ± 1.281
1.034GluTrp: 1.034 ± 0.723
1.034GluTyr: 1.034 ± 0.86
0.0GluXaa: 0.0 ± 0.0
Phe
2.068PheAla: 2.068 ± 0.809
2.068PheCys: 2.068 ± 1.719
2.068PheAsp: 2.068 ± 1.281
1.034PheGlu: 1.034 ± 1.156
3.102PhePhe: 3.102 ± 1.008
0.0PheGly: 0.0 ± 0.0
0.0PheHis: 0.0 ± 0.0
1.034PheIle: 1.034 ± 0.723
2.068PheLys: 2.068 ± 1.447
2.068PheLeu: 2.068 ± 1.148
0.0PheMet: 0.0 ± 0.0
4.137PheAsn: 4.137 ± 1.48
4.137PhePro: 4.137 ± 2.192
0.0PheGln: 0.0 ± 0.0
4.137PheArg: 4.137 ± 2.313
6.205PheSer: 6.205 ± 1.339
5.171PheThr: 5.171 ± 1.979
3.102PheVal: 3.102 ± 1.272
2.068PheTrp: 2.068 ± 1.719
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
4.137GlyAla: 4.137 ± 2.103
1.034GlyCys: 1.034 ± 1.023
3.102GlyAsp: 3.102 ± 1.677
2.068GlyGlu: 2.068 ± 1.281
3.102GlyPhe: 3.102 ± 2.17
5.171GlyGly: 5.171 ± 2.61
0.0GlyHis: 0.0 ± 0.0
0.0GlyIle: 0.0 ± 0.0
4.137GlyLys: 4.137 ± 0.778
1.034GlyLeu: 1.034 ± 0.86
0.0GlyMet: 0.0 ± 0.0
3.102GlyAsn: 3.102 ± 1.272
4.137GlyPro: 4.137 ± 2.012
2.068GlyGln: 2.068 ± 0.935
5.171GlyArg: 5.171 ± 2.578
7.239GlySer: 7.239 ± 1.184
5.171GlyThr: 5.171 ± 1.979
4.137GlyVal: 4.137 ± 2.012
2.068GlyTrp: 2.068 ± 2.312
4.137GlyTyr: 4.137 ± 0.778
0.0GlyXaa: 0.0 ± 0.0
His
0.0HisAla: 0.0 ± 0.0
0.0HisCys: 0.0 ± 0.0
0.0HisAsp: 0.0 ± 0.0
1.034HisGlu: 1.034 ± 0.86
0.0HisPhe: 0.0 ± 0.0
1.034HisGly: 1.034 ± 0.86
0.0HisHis: 0.0 ± 0.0
1.034HisIle: 1.034 ± 1.023
1.034HisLys: 1.034 ± 1.023
2.068HisLeu: 2.068 ± 1.719
0.0HisMet: 0.0 ± 0.0
1.034HisAsn: 1.034 ± 0.723
0.0HisPro: 0.0 ± 0.0
0.0HisGln: 0.0 ± 0.0
2.068HisArg: 2.068 ± 1.148
4.137HisSer: 4.137 ± 1.48
1.034HisThr: 1.034 ± 1.023
1.034HisVal: 1.034 ± 0.86
2.068HisTrp: 2.068 ± 1.719
0.0HisTyr: 0.0 ± 0.0
0.0HisXaa: 0.0 ± 0.0
Ile
3.102IleAla: 3.102 ± 1.272
1.034IleCys: 1.034 ± 0.86
3.102IleAsp: 3.102 ± 2.17
1.034IleGlu: 1.034 ± 0.86
1.034IlePhe: 1.034 ± 0.86
4.137IleGly: 4.137 ± 2.585
1.034IleHis: 1.034 ± 0.86
2.068IleIle: 2.068 ± 1.447
5.171IleLys: 5.171 ± 3.149
2.068IleLeu: 2.068 ± 1.148
0.0IleMet: 0.0 ± 0.0
1.034IleAsn: 1.034 ± 1.023
3.102IlePro: 3.102 ± 1.272
1.034IleGln: 1.034 ± 0.723
3.102IleArg: 3.102 ± 2.17
4.137IleSer: 4.137 ± 1.194
7.239IleThr: 7.239 ± 2.932
1.034IleVal: 1.034 ± 1.156
3.102IleTrp: 3.102 ± 1.226
0.0IleTyr: 0.0 ± 0.0
0.0IleXaa: 0.0 ± 0.0
Lys
5.171LysAla: 5.171 ± 1.089
4.137LysCys: 4.137 ± 2.182
0.0LysAsp: 0.0 ± 0.0
1.034LysGlu: 1.034 ± 0.723
1.034LysPhe: 1.034 ± 0.86
3.102LysGly: 3.102 ± 1.504
1.034LysHis: 1.034 ± 0.86
1.034LysIle: 1.034 ± 0.86
2.068LysLys: 2.068 ± 1.489
2.068LysLeu: 2.068 ± 2.045
0.0LysMet: 0.0 ± 0.0
4.137LysAsn: 4.137 ± 1.871
2.068LysPro: 2.068 ± 1.719
2.068LysGln: 2.068 ± 1.091
8.273LysArg: 8.273 ± 4.29
5.171LysSer: 5.171 ± 1.042
3.102LysThr: 3.102 ± 1.226
1.034LysVal: 1.034 ± 0.723
0.0LysTrp: 0.0 ± 0.0
4.137LysTyr: 4.137 ± 1.194
0.0LysXaa: 0.0 ± 0.0
Leu
2.068LeuAla: 2.068 ± 1.447
0.0LeuCys: 0.0 ± 0.0
3.102LeuAsp: 3.102 ± 1.272
4.137LeuGlu: 4.137 ± 2.313
3.102LeuPhe: 3.102 ± 1.504
5.171LeuGly: 5.171 ± 1.982
3.102LeuHis: 3.102 ± 1.504
2.068LeuIle: 2.068 ± 1.148
2.068LeuLys: 2.068 ± 1.719
7.239LeuLeu: 7.239 ± 2.787
3.102LeuMet: 3.102 ± 1.38
3.102LeuAsn: 3.102 ± 1.822
3.102LeuPro: 3.102 ± 1.822
2.068LeuGln: 2.068 ± 1.719
2.068LeuArg: 2.068 ± 1.719
5.171LeuSer: 5.171 ± 2.578
3.102LeuThr: 3.102 ± 1.323
3.102LeuVal: 3.102 ± 2.17
2.068LeuTrp: 2.068 ± 1.091
2.068LeuTyr: 2.068 ± 1.281
0.0LeuXaa: 0.0 ± 0.0
Met
1.034MetAla: 1.034 ± 0.723
0.0MetCys: 0.0 ± 0.0
1.034MetAsp: 1.034 ± 0.86
0.0MetGlu: 0.0 ± 0.0
0.0MetPhe: 0.0 ± 0.0
0.0MetGly: 0.0 ± 0.0
1.034MetHis: 1.034 ± 0.723
1.034MetIle: 1.034 ± 1.023
0.0MetLys: 0.0 ± 0.0
1.034MetLeu: 1.034 ± 0.86
0.0MetMet: 0.0 ± 0.0
1.034MetAsn: 1.034 ± 0.86
2.068MetPro: 2.068 ± 0.935
0.0MetGln: 0.0 ± 0.0
2.068MetArg: 2.068 ± 1.091
2.068MetSer: 2.068 ± 1.148
1.034MetThr: 1.034 ± 1.023
1.034MetVal: 1.034 ± 0.723
0.0MetTrp: 0.0 ± 0.0
0.0MetTyr: 0.0 ± 0.0
0.0MetXaa: 0.0 ± 0.0
Asn
1.034AsnAla: 1.034 ± 0.723
0.0AsnCys: 0.0 ± 0.0
5.171AsnAsp: 5.171 ± 2.175
3.102AsnGlu: 3.102 ± 1.323
1.034AsnPhe: 1.034 ± 0.86
1.034AsnGly: 1.034 ± 0.86
1.034AsnHis: 1.034 ± 1.023
1.034AsnIle: 1.034 ± 0.723
3.102AsnLys: 3.102 ± 0.641
9.307AsnLeu: 9.307 ± 2.819
1.034AsnMet: 1.034 ± 0.86
2.068AsnAsn: 2.068 ± 1.447
4.137AsnPro: 4.137 ± 2.585
4.137AsnGln: 4.137 ± 1.871
6.205AsnArg: 6.205 ± 2.775
3.102AsnSer: 3.102 ± 1.822
4.137AsnThr: 4.137 ± 2.893
1.034AsnVal: 1.034 ± 0.723
1.034AsnTrp: 1.034 ± 0.86
3.102AsnTyr: 3.102 ± 1.822
0.0AsnXaa: 0.0 ± 0.0
Pro
5.171ProAla: 5.171 ± 1.354
1.034ProCys: 1.034 ± 0.86
6.205ProAsp: 6.205 ± 1.692
4.137ProGlu: 4.137 ± 1.199
3.102ProPhe: 3.102 ± 3.468
2.068ProGly: 2.068 ± 0.809
0.0ProHis: 0.0 ± 0.0
5.171ProIle: 5.171 ± 3.123
4.137ProLys: 4.137 ± 3.202
3.102ProLeu: 3.102 ± 0.641
2.068ProMet: 2.068 ± 0.935
2.068ProAsn: 2.068 ± 0.809
4.137ProPro: 4.137 ± 1.199
2.068ProGln: 2.068 ± 0.809
7.239ProArg: 7.239 ± 2.073
3.102ProSer: 3.102 ± 2.461
4.137ProThr: 4.137 ± 1.871
5.171ProVal: 5.171 ± 1.704
1.034ProTrp: 1.034 ± 0.723
1.034ProTyr: 1.034 ± 0.723
0.0ProXaa: 0.0 ± 0.0
Gln
2.068GlnAla: 2.068 ± 1.447
0.0GlnCys: 0.0 ± 0.0
2.068GlnAsp: 2.068 ± 1.091
5.171GlnGlu: 5.171 ± 2.49
1.034GlnPhe: 1.034 ± 0.86
3.102GlnGly: 3.102 ± 1.532
2.068GlnHis: 2.068 ± 0.935
3.102GlnIle: 3.102 ± 1.504
1.034GlnLys: 1.034 ± 0.86
2.068GlnLeu: 2.068 ± 1.719
0.0GlnMet: 0.0 ± 0.0
1.034GlnAsn: 1.034 ± 0.723
1.034GlnPro: 1.034 ± 0.723
2.068GlnGln: 2.068 ± 0.935
5.171GlnArg: 5.171 ± 0.577
4.137GlnSer: 4.137 ± 2.893
5.171GlnThr: 5.171 ± 1.354
2.068GlnVal: 2.068 ± 1.281
3.102GlnTrp: 3.102 ± 1.532
0.0GlnTyr: 0.0 ± 0.0
0.0GlnXaa: 0.0 ± 0.0
Arg
0.0ArgAla: 0.0 ± 0.0
2.068ArgCys: 2.068 ± 0.809
2.068ArgAsp: 2.068 ± 1.489
5.171ArgGlu: 5.171 ± 1.979
4.137ArgPhe: 4.137 ± 2.313
5.171ArgGly: 5.171 ± 2.145
3.102ArgHis: 3.102 ± 1.85
1.034ArgIle: 1.034 ± 0.86
5.171ArgLys: 5.171 ± 1.513
7.239ArgLeu: 7.239 ± 3.992
1.034ArgMet: 1.034 ± 1.023
3.102ArgAsn: 3.102 ± 1.008
3.102ArgPro: 3.102 ± 1.532
5.171ArgGln: 5.171 ± 2.63
24.819ArgArg: 24.819 ± 9.623
6.205ArgSer: 6.205 ± 1.052
2.068ArgThr: 2.068 ± 1.281
1.034ArgVal: 1.034 ± 0.723
3.102ArgTrp: 3.102 ± 1.008
10.341ArgTyr: 10.341 ± 1.154
0.0ArgXaa: 0.0 ± 0.0
Ser
8.273SerAla: 8.273 ± 1.53
0.0SerCys: 0.0 ± 0.0
3.102SerAsp: 3.102 ± 1.323
6.205SerGlu: 6.205 ± 2.139
7.239SerPhe: 7.239 ± 2.781
6.205SerGly: 6.205 ± 2.806
0.0SerHis: 0.0 ± 0.0
6.205SerIle: 6.205 ± 1.902
3.102SerLys: 3.102 ± 1.932
6.205SerLeu: 6.205 ± 1.149
1.034SerMet: 1.034 ± 1.023
8.273SerAsn: 8.273 ± 4.225
4.137SerPro: 4.137 ± 1.356
4.137SerGln: 4.137 ± 2.802
5.171SerArg: 5.171 ± 1.704
7.239SerSer: 7.239 ± 1.482
12.41SerThr: 12.41 ± 1.61
4.137SerVal: 4.137 ± 1.917
2.068SerTrp: 2.068 ± 0.809
2.068SerTyr: 2.068 ± 1.091
0.0SerXaa: 0.0 ± 0.0
Thr
4.137ThrAla: 4.137 ± 3.546
0.0ThrCys: 0.0 ± 0.0
6.205ThrAsp: 6.205 ± 1.508
4.137ThrGlu: 4.137 ± 2.074
1.034ThrPhe: 1.034 ± 1.156
6.205ThrGly: 6.205 ± 2.462
0.0ThrHis: 0.0 ± 0.0
3.102ThrIle: 3.102 ± 0.641
4.137ThrLys: 4.137 ± 0.824
2.068ThrLeu: 2.068 ± 0.935
1.034ThrMet: 1.034 ± 0.86
6.205ThrAsn: 6.205 ± 2.961
7.239ThrPro: 7.239 ± 3.228
3.102ThrGln: 3.102 ± 1.388
3.102ThrArg: 3.102 ± 1.272
11.375ThrSer: 11.375 ± 5.985
4.137ThrThr: 4.137 ± 4.091
3.102ThrVal: 3.102 ± 1.532
0.0ThrTrp: 0.0 ± 0.0
4.137ThrTyr: 4.137 ± 1.871
0.0ThrXaa: 0.0 ± 0.0
Val
5.171ValAla: 5.171 ± 2.747
1.034ValCys: 1.034 ± 0.86
1.034ValAsp: 1.034 ± 0.86
3.102ValGlu: 3.102 ± 2.17
1.034ValPhe: 1.034 ± 1.156
3.102ValGly: 3.102 ± 1.532
0.0ValHis: 0.0 ± 0.0
0.0ValIle: 0.0 ± 0.0
7.239ValLys: 7.239 ± 2.767
3.102ValLeu: 3.102 ± 1.272
1.034ValMet: 1.034 ± 0.648
1.034ValAsn: 1.034 ± 0.723
5.171ValPro: 5.171 ± 1.982
0.0ValGln: 0.0 ± 0.0
2.068ValArg: 2.068 ± 0.809
4.137ValSer: 4.137 ± 1.904
2.068ValThr: 2.068 ± 2.045
2.068ValVal: 2.068 ± 1.447
0.0ValTrp: 0.0 ± 0.0
1.034ValTyr: 1.034 ± 0.723
0.0ValXaa: 0.0 ± 0.0
Trp
1.034TrpAla: 1.034 ± 0.723
1.034TrpCys: 1.034 ± 0.723
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.0TrpPhe: 0.0 ± 0.0
4.137TrpGly: 4.137 ± 1.199
1.034TrpHis: 1.034 ± 0.86
3.102TrpIle: 3.102 ± 2.579
0.0TrpLys: 0.0 ± 0.0
1.034TrpLeu: 1.034 ± 0.86
1.034TrpMet: 1.034 ± 1.168
0.0TrpAsn: 0.0 ± 0.0
1.034TrpPro: 1.034 ± 0.723
1.034TrpGln: 1.034 ± 0.86
2.068TrpArg: 2.068 ± 1.148
1.034TrpSer: 1.034 ± 1.023
3.102TrpThr: 3.102 ± 2.461
1.034TrpVal: 1.034 ± 0.723
1.034TrpTrp: 1.034 ± 0.723
2.068TrpTyr: 2.068 ± 1.447
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.0TyrAla: 0.0 ± 0.0
1.034TyrCys: 1.034 ± 0.86
2.068TyrAsp: 2.068 ± 0.809
0.0TyrGlu: 0.0 ± 0.0
1.034TyrPhe: 1.034 ± 1.023
2.068TyrGly: 2.068 ± 0.935
1.034TyrHis: 1.034 ± 1.156
3.102TyrIle: 3.102 ± 2.579
0.0TyrLys: 0.0 ± 0.0
2.068TyrLeu: 2.068 ± 1.447
0.0TyrMet: 0.0 ± 0.0
3.102TyrAsn: 3.102 ± 1.008
3.102TyrPro: 3.102 ± 0.641
6.205TyrGln: 6.205 ± 1.692
1.034TyrArg: 1.034 ± 1.156
9.307TyrSer: 9.307 ± 1.995
1.034TyrThr: 1.034 ± 1.023
2.068TyrVal: 2.068 ± 2.045
1.034TyrTrp: 1.034 ± 0.723
2.068TyrTyr: 2.068 ± 1.719
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (968 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski