Amino acid dipepetide frequency for Jamestown Canyon virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.984AlaAla: 1.984 ± 1.904
1.488AlaCys: 1.488 ± 0.705
2.975AlaAsp: 2.975 ± 0.447
2.975AlaGlu: 2.975 ± 1.026
2.727AlaPhe: 2.727 ± 0.545
1.736AlaGly: 1.736 ± 0.765
0.744AlaHis: 0.744 ± 0.197
3.967AlaIle: 3.967 ± 1.104
4.463AlaLys: 4.463 ± 1.399
3.967AlaLeu: 3.967 ± 0.991
2.232AlaMet: 2.232 ± 0.575
2.48AlaAsn: 2.48 ± 0.668
1.488AlaPro: 1.488 ± 0.395
1.488AlaGln: 1.488 ± 0.565
3.719AlaArg: 3.719 ± 1.515
2.727AlaSer: 2.727 ± 1.214
1.984AlaThr: 1.984 ± 0.408
2.48AlaVal: 2.48 ± 1.357
0.248AlaTrp: 0.248 ± 0.16
2.727AlaTyr: 2.727 ± 0.908
0.0AlaXaa: 0.0 ± 0.0
Cys
0.992CysAla: 0.992 ± 0.329
0.248CysCys: 0.248 ± 0.16
0.992CysAsp: 0.992 ± 0.579
1.488CysGlu: 1.488 ± 1.04
1.984CysPhe: 1.984 ± 0.837
1.984CysGly: 1.984 ± 1.503
0.248CysHis: 0.248 ± 0.232
1.984CysIle: 1.984 ± 0.579
2.727CysLys: 2.727 ± 0.908
3.471CysLeu: 3.471 ± 1.251
0.248CysMet: 0.248 ± 0.16
2.48CysAsn: 2.48 ± 0.724
1.24CysPro: 1.24 ± 0.486
1.488CysGln: 1.488 ± 1.151
1.24CysArg: 1.24 ± 1.161
0.992CysSer: 0.992 ± 0.329
0.992CysThr: 0.992 ± 0.929
2.232CysVal: 2.232 ± 1.388
0.248CysTrp: 0.248 ± 0.232
0.496CysTyr: 0.496 ± 0.145
0.0CysXaa: 0.0 ± 0.0
Asp
2.48AspAla: 2.48 ± 1.067
1.24AspCys: 1.24 ± 0.486
3.719AspAsp: 3.719 ± 0.81
4.463AspGlu: 4.463 ± 0.751
4.463AspPhe: 4.463 ± 1.185
1.984AspGly: 1.984 ± 0.579
0.744AspHis: 0.744 ± 0.607
6.943AspIle: 6.943 ± 2.084
4.711AspLys: 4.711 ± 1.18
5.207AspLeu: 5.207 ± 1.508
1.736AspMet: 1.736 ± 0.518
3.719AspAsn: 3.719 ± 0.987
2.727AspPro: 2.727 ± 1.578
1.984AspGln: 1.984 ± 0.579
1.736AspArg: 1.736 ± 0.518
1.488AspSer: 1.488 ± 0.434
0.992AspThr: 0.992 ± 0.29
3.471AspVal: 3.471 ± 0.859
0.992AspTrp: 0.992 ± 0.579
4.215AspTyr: 4.215 ± 1.081
0.0AspXaa: 0.0 ± 0.0
Glu
3.223GluAla: 3.223 ± 0.907
0.744GluCys: 0.744 ± 0.353
2.48GluAsp: 2.48 ± 0.724
2.48GluGlu: 2.48 ± 0.724
2.48GluPhe: 2.48 ± 0.955
2.975GluGly: 2.975 ± 0.743
1.984GluHis: 1.984 ± 0.627
6.695GluIle: 6.695 ± 1.189
3.719GluLys: 3.719 ± 0.69
4.959GluLeu: 4.959 ± 0.678
2.727GluMet: 2.727 ± 0.474
3.471GluAsn: 3.471 ± 0.885
1.736GluPro: 1.736 ± 0.518
2.727GluGln: 2.727 ± 0.776
3.719GluArg: 3.719 ± 0.69
4.463GluSer: 4.463 ± 1.95
3.967GluThr: 3.967 ± 0.889
3.223GluVal: 3.223 ± 1.323
0.744GluTrp: 0.744 ± 0.65
2.727GluTyr: 2.727 ± 1.108
0.0GluXaa: 0.0 ± 0.0
Phe
1.736PheAla: 1.736 ± 0.625
1.24PheCys: 1.24 ± 0.307
1.736PheAsp: 1.736 ± 0.434
3.471PheGlu: 3.471 ± 0.885
1.984PhePhe: 1.984 ± 1.18
2.48PheGly: 2.48 ± 1.074
0.496PheHis: 0.496 ± 0.465
3.223PheIle: 3.223 ± 0.907
4.959PheLys: 4.959 ± 0.877
5.207PheLeu: 5.207 ± 1.727
0.496PheMet: 0.496 ± 0.657
2.975PheAsn: 2.975 ± 1.263
1.488PhePro: 1.488 ± 1.202
2.232PheGln: 2.232 ± 1.04
2.727PheArg: 2.727 ± 1.108
3.967PheSer: 3.967 ± 1.891
2.727PheThr: 2.727 ± 0.474
1.984PheVal: 1.984 ± 0.464
0.496PheTrp: 0.496 ± 0.319
2.232PheTyr: 2.232 ± 1.06
0.0PheXaa: 0.0 ± 0.0
Gly
2.232GlyAla: 2.232 ± 0.803
2.48GlyCys: 2.48 ± 0.724
3.223GlyAsp: 3.223 ± 0.444
2.727GlyGlu: 2.727 ± 1.087
1.488GlyPhe: 1.488 ± 2.006
0.992GlyGly: 0.992 ± 0.549
1.24GlyHis: 1.24 ± 0.307
2.727GlyIle: 2.727 ± 1.135
1.984GlyLys: 1.984 ± 1.439
4.959GlyLeu: 4.959 ± 0.69
0.744GlyMet: 0.744 ± 0.65
3.719GlyAsn: 3.719 ± 1.43
1.984GlyPro: 1.984 ± 0.947
1.24GlyGln: 1.24 ± 0.486
1.736GlyArg: 1.736 ± 1.377
3.223GlySer: 3.223 ± 1.024
3.967GlyThr: 3.967 ± 2.635
1.24GlyVal: 1.24 ± 0.753
0.992GlyTrp: 0.992 ± 0.579
1.488GlyTyr: 1.488 ± 0.619
0.0GlyXaa: 0.0 ± 0.0
His
0.496HisAla: 0.496 ± 0.465
1.24HisCys: 1.24 ± 0.486
1.736HisAsp: 1.736 ± 0.625
1.736HisGlu: 1.736 ± 0.625
2.48HisPhe: 2.48 ± 1.074
1.488HisGly: 1.488 ± 1.3
0.496HisHis: 0.496 ± 0.319
1.488HisIle: 1.488 ± 0.434
2.727HisLys: 2.727 ± 0.908
1.488HisLeu: 1.488 ± 1.083
0.744HisMet: 0.744 ± 0.197
1.24HisAsn: 1.24 ± 0.798
0.744HisPro: 0.744 ± 1.201
0.744HisGln: 0.744 ± 0.197
1.24HisArg: 1.24 ± 1.06
1.488HisSer: 1.488 ± 0.632
1.24HisThr: 1.24 ± 0.307
0.248HisVal: 0.248 ± 0.16
0.744HisTrp: 0.744 ± 0.607
0.496HisTyr: 0.496 ± 0.145
0.0HisXaa: 0.0 ± 0.0
Ile
5.455IleAla: 5.455 ± 0.838
2.232IleCys: 2.232 ± 1.058
4.711IleAsp: 4.711 ± 1.185
4.959IleGlu: 4.959 ± 1.424
3.967IlePhe: 3.967 ± 1.358
3.967IleGly: 3.967 ± 1.358
2.975IleHis: 2.975 ± 0.962
6.943IleIle: 6.943 ± 1.897
6.695IleLys: 6.695 ± 1.59
8.926IleLeu: 8.926 ± 1.513
3.223IleMet: 3.223 ± 1.546
4.215IleAsn: 4.215 ± 0.827
1.984IlePro: 1.984 ± 0.496
1.488IleGln: 1.488 ± 1.151
4.463IleArg: 4.463 ± 0.607
7.439IleSer: 7.439 ± 2.346
5.703IleThr: 5.703 ± 1.414
3.471IleVal: 3.471 ± 0.823
0.992IleTrp: 0.992 ± 0.329
3.967IleTyr: 3.967 ± 0.816
0.0IleXaa: 0.0 ± 0.0
Lys
4.215LysAla: 4.215 ± 2.193
2.232LysCys: 2.232 ± 1.058
6.199LysAsp: 6.199 ± 0.958
5.703LysGlu: 5.703 ± 1.829
3.719LysPhe: 3.719 ± 1.61
2.727LysGly: 2.727 ± 0.845
1.488LysHis: 1.488 ± 0.837
4.463LysIle: 4.463 ± 1.363
5.207LysLys: 5.207 ± 0.832
6.447LysLeu: 6.447 ± 3.349
2.727LysMet: 2.727 ± 1.108
4.711LysAsn: 4.711 ± 1.185
2.727LysPro: 2.727 ± 1.14
1.736LysGln: 1.736 ± 0.518
1.736LysArg: 1.736 ± 0.788
5.951LysSer: 5.951 ± 1.143
5.207LysThr: 5.207 ± 1.073
4.215LysVal: 4.215 ± 1.049
1.488LysTrp: 1.488 ± 1.335
3.719LysTyr: 3.719 ± 1.763
0.0LysXaa: 0.0 ± 0.0
Leu
6.447LeuAla: 6.447 ± 0.968
2.232LeuCys: 2.232 ± 1.388
5.207LeuAsp: 5.207 ± 2.063
6.447LeuGlu: 6.447 ± 2.1
3.967LeuPhe: 3.967 ± 1.891
2.48LeuGly: 2.48 ± 1.33
2.48LeuHis: 2.48 ± 0.724
6.943LeuIle: 6.943 ± 1.746
6.447LeuLys: 6.447 ± 1.309
7.191LeuLeu: 7.191 ± 2.929
2.975LeuMet: 2.975 ± 1.046
4.463LeuAsn: 4.463 ± 0.894
3.223LeuPro: 3.223 ± 1.055
2.232LeuGln: 2.232 ± 0.575
3.719LeuArg: 3.719 ± 0.474
7.687LeuSer: 7.687 ± 4.103
7.191LeuThr: 7.191 ± 1.243
5.703LeuVal: 5.703 ± 1.889
0.992LeuTrp: 0.992 ± 2.493
2.727LeuTyr: 2.727 ± 1.421
0.0LeuXaa: 0.0 ± 0.0
Met
2.48MetAla: 2.48 ± 1.004
1.24MetCys: 1.24 ± 0.307
2.975MetAsp: 2.975 ± 1.265
1.488MetGlu: 1.488 ± 0.632
0.248MetPhe: 0.248 ± 0.701
0.992MetGly: 0.992 ± 1.531
0.0MetHis: 0.0 ± 0.0
1.736MetIle: 1.736 ± 0.518
2.727MetLys: 2.727 ± 0.716
1.736MetLeu: 1.736 ± 1.118
1.736MetMet: 1.736 ± 1.377
0.744MetAsn: 0.744 ± 0.479
1.736MetPro: 1.736 ± 0.438
1.488MetGln: 1.488 ± 2.46
0.744MetArg: 0.744 ± 0.197
1.736MetSer: 1.736 ± 1.161
2.232MetThr: 2.232 ± 0.804
1.736MetVal: 1.736 ± 0.438
0.0MetTrp: 0.0 ± 0.0
0.992MetTyr: 0.992 ± 0.29
0.0MetXaa: 0.0 ± 0.0
Asn
3.223AsnAla: 3.223 ± 0.62
1.488AsnCys: 1.488 ± 0.705
3.471AsnAsp: 3.471 ± 0.861
2.727AsnGlu: 2.727 ± 1.421
2.48AsnPhe: 2.48 ± 0.955
2.48AsnGly: 2.48 ± 1.83
0.992AsnHis: 0.992 ± 0.644
5.703AsnIle: 5.703 ± 1.414
3.223AsnLys: 3.223 ± 1.132
6.447AsnLeu: 6.447 ± 2.03
1.24AsnMet: 1.24 ± 0.798
3.471AsnAsn: 3.471 ± 2.478
2.232AsnPro: 2.232 ± 1.07
1.984AsnGln: 1.984 ± 0.496
2.232AsnArg: 2.232 ± 1.572
3.719AsnSer: 3.719 ± 1.173
3.223AsnThr: 3.223 ± 0.801
1.736AsnVal: 1.736 ± 0.931
0.992AsnTrp: 0.992 ± 0.329
2.975AsnTyr: 2.975 ± 0.743
0.0AsnXaa: 0.0 ± 0.0
Pro
2.48ProAla: 2.48 ± 0.615
0.248ProCys: 0.248 ± 0.232
2.727ProAsp: 2.727 ± 0.474
2.232ProGlu: 2.232 ± 1.201
1.488ProPhe: 1.488 ± 0.434
2.727ProGly: 2.727 ± 1.087
0.496ProHis: 0.496 ± 0.145
4.959ProIle: 4.959 ± 0.805
1.488ProLys: 1.488 ± 1.413
1.984ProLeu: 1.984 ± 1.531
0.992ProMet: 0.992 ± 0.382
0.744ProAsn: 0.744 ± 0.353
0.744ProPro: 0.744 ± 0.479
1.488ProGln: 1.488 ± 1.407
1.24ProArg: 1.24 ± 0.809
2.232ProSer: 2.232 ± 0.99
1.984ProThr: 1.984 ± 0.496
0.744ProVal: 0.744 ± 0.353
0.496ProTrp: 0.496 ± 0.319
0.496ProTyr: 0.496 ± 0.319
0.0ProXaa: 0.0 ± 0.0
Gln
1.24GlnAla: 1.24 ± 0.307
0.744GlnCys: 0.744 ± 0.197
2.48GlnAsp: 2.48 ± 0.979
1.24GlnGlu: 1.24 ± 0.307
1.736GlnPhe: 1.736 ± 0.518
1.24GlnGly: 1.24 ± 1.137
1.488GlnHis: 1.488 ± 0.705
2.727GlnIle: 2.727 ± 0.689
4.959GlnLys: 4.959 ± 0.784
1.984GlnLeu: 1.984 ± 1.288
0.744GlnMet: 0.744 ± 2.535
0.992GlnAsn: 0.992 ± 0.29
0.496GlnPro: 0.496 ± 1.247
0.992GlnGln: 0.992 ± 1.234
2.48GlnArg: 2.48 ± 1.243
1.984GlnSer: 1.984 ± 0.837
2.48GlnThr: 2.48 ± 0.724
1.984GlnVal: 1.984 ± 1.365
0.0GlnTrp: 0.0 ± 0.0
2.48GlnTyr: 2.48 ± 0.403
0.0GlnXaa: 0.0 ± 0.0
Arg
1.736ArgAla: 1.736 ± 0.438
1.736ArgCys: 1.736 ± 0.931
3.719ArgAsp: 3.719 ± 1.173
2.975ArgGlu: 2.975 ± 0.912
2.232ArgPhe: 2.232 ± 0.592
0.992ArgGly: 0.992 ± 0.549
2.232ArgHis: 2.232 ± 0.804
4.215ArgIle: 4.215 ± 1.31
4.463ArgLys: 4.463 ± 0.798
2.975ArgLeu: 2.975 ± 0.912
0.992ArgMet: 0.992 ± 0.543
3.223ArgAsn: 3.223 ± 1.771
0.744ArgPro: 0.744 ± 0.197
1.736ArgGln: 1.736 ± 0.434
1.24ArgArg: 1.24 ± 0.798
2.727ArgSer: 2.727 ± 1.472
2.727ArgThr: 2.727 ± 0.474
1.488ArgVal: 1.488 ± 0.461
0.744ArgTrp: 0.744 ± 1.572
1.736ArgTyr: 1.736 ± 0.499
0.0ArgXaa: 0.0 ± 0.0
Ser
2.48SerAla: 2.48 ± 0.668
2.727SerCys: 2.727 ± 1.189
3.471SerAsp: 3.471 ± 0.558
3.471SerGlu: 3.471 ± 1.036
1.984SerPhe: 1.984 ± 0.735
3.719SerGly: 3.719 ± 2.3
1.984SerHis: 1.984 ± 1.319
7.191SerIle: 7.191 ± 2.24
5.207SerLys: 5.207 ± 1.296
7.439SerLeu: 7.439 ± 1.977
1.488SerMet: 1.488 ± 0.441
3.223SerAsn: 3.223 ± 1.056
2.48SerPro: 2.48 ± 0.615
1.736SerGln: 1.736 ± 0.499
3.719SerArg: 3.719 ± 1.433
4.215SerSer: 4.215 ± 2.031
4.959SerThr: 4.959 ± 1.007
3.967SerVal: 3.967 ± 1.246
0.248SerTrp: 0.248 ± 0.16
2.48SerTyr: 2.48 ± 0.724
0.0SerXaa: 0.0 ± 0.0
Thr
2.232ThrAla: 2.232 ± 1.147
1.24ThrCys: 1.24 ± 0.809
3.223ThrAsp: 3.223 ± 0.391
3.719ThrGlu: 3.719 ± 0.716
3.719ThrPhe: 3.719 ± 0.474
4.463ThrGly: 4.463 ± 1.606
1.24ThrHis: 1.24 ± 0.307
5.207ThrIle: 5.207 ± 1.965
3.471ThrLys: 3.471 ± 1.576
4.215ThrLeu: 4.215 ± 0.519
0.744ThrMet: 0.744 ± 1.23
3.967ThrAsn: 3.967 ± 1.012
2.48ThrPro: 2.48 ± 0.393
1.984ThrGln: 1.984 ± 1.205
2.727ThrArg: 2.727 ± 1.108
4.463ThrSer: 4.463 ± 1.608
4.215ThrThr: 4.215 ± 2.229
4.463ThrVal: 4.463 ± 1.711
1.24ThrTrp: 1.24 ± 0.809
3.967ThrTyr: 3.967 ± 1.012
0.0ThrXaa: 0.0 ± 0.0
Val
1.736ValAla: 1.736 ± 1.147
1.736ValCys: 1.736 ± 1.216
1.984ValAsp: 1.984 ± 0.496
3.471ValGlu: 3.471 ± 1.238
2.232ValPhe: 2.232 ± 0.542
2.232ValGly: 2.232 ± 0.575
1.488ValHis: 1.488 ± 0.632
4.215ValIle: 4.215 ± 2.547
3.719ValLys: 3.719 ± 2.386
4.959ValLeu: 4.959 ± 0.703
0.992ValMet: 0.992 ± 0.329
2.232ValAsn: 2.232 ± 1.372
0.248ValPro: 0.248 ± 0.16
2.48ValGln: 2.48 ± 1.107
1.24ValArg: 1.24 ± 0.753
4.463ValSer: 4.463 ± 0.573
3.471ValThr: 3.471 ± 0.955
1.984ValVal: 1.984 ± 0.496
0.496ValTrp: 0.496 ± 0.145
2.727ValTyr: 2.727 ± 0.908
0.0ValXaa: 0.0 ± 0.0
Trp
0.248TrpAla: 0.248 ± 0.232
0.248TrpCys: 0.248 ± 0.16
0.496TrpAsp: 0.496 ± 0.145
0.744TrpGlu: 0.744 ± 0.197
0.744TrpPhe: 0.744 ± 0.197
0.744TrpGly: 0.744 ± 0.769
0.496TrpHis: 0.496 ± 1.262
0.744TrpIle: 0.744 ± 0.353
0.248TrpLys: 0.248 ± 1.283
1.736TrpLeu: 1.736 ± 1.077
0.496TrpMet: 0.496 ± 0.698
1.488TrpAsn: 1.488 ± 0.461
0.0TrpPro: 0.0 ± 0.0
1.24TrpGln: 1.24 ± 0.533
0.496TrpArg: 0.496 ± 0.145
0.992TrpSer: 0.992 ± 0.639
0.0TrpThr: 0.0 ± 0.0
0.744TrpVal: 0.744 ± 0.607
0.0TrpTrp: 0.0 ± 0.0
0.248TrpTyr: 0.248 ± 0.16
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.488TyrAla: 1.488 ± 0.705
0.992TyrCys: 0.992 ± 0.929
1.488TyrAsp: 1.488 ± 0.565
2.727TyrGlu: 2.727 ± 0.845
1.736TyrPhe: 1.736 ± 0.788
1.984TyrGly: 1.984 ± 0.627
0.992TyrHis: 0.992 ± 0.579
5.703TyrIle: 5.703 ± 1.575
3.471TyrLys: 3.471 ± 1.251
5.455TyrLeu: 5.455 ± 1.314
1.24TyrMet: 1.24 ± 0.478
2.48TyrAsn: 2.48 ± 0.712
1.488TyrPro: 1.488 ± 0.461
1.984TyrGln: 1.984 ± 0.579
2.48TyrArg: 2.48 ± 0.65
2.48TyrSer: 2.48 ± 0.615
3.719TyrThr: 3.719 ± 1.195
0.992TyrVal: 0.992 ± 0.29
0.0TyrTrp: 0.0 ± 0.0
1.488TyrTyr: 1.488 ± 0.705
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (4034 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski