Amino acid dipepetide frequency for Chinook salmon bafinivirus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.679AlaAla: 5.679 ± 1.613
0.55AlaCys: 0.55 ± 0.072
2.656AlaAsp: 2.656 ± 0.664
2.29AlaGlu: 2.29 ± 0.321
3.023AlaPhe: 3.023 ± 0.408
2.473AlaGly: 2.473 ± 0.868
3.389AlaHis: 3.389 ± 0.979
4.58AlaIle: 4.58 ± 0.56
4.03AlaLys: 4.03 ± 0.891
7.053AlaLeu: 7.053 ± 0.728
1.557AlaMet: 1.557 ± 0.779
3.48AlaAsn: 3.48 ± 0.476
3.389AlaPro: 3.389 ± 0.67
2.29AlaGln: 2.29 ± 0.226
2.656AlaArg: 2.656 ± 0.429
4.671AlaSer: 4.671 ± 1.117
7.053AlaThr: 7.053 ± 1.724
3.664AlaVal: 3.664 ± 0.851
0.366AlaTrp: 0.366 ± 0.104
4.03AlaTyr: 4.03 ± 0.751
0.092AlaXaa: 0.092 ± 0.181
Cys
1.557CysAla: 1.557 ± 0.438
0.366CysCys: 0.366 ± 0.104
1.649CysAsp: 1.649 ± 0.247
0.641CysGlu: 0.641 ± 0.089
1.099CysPhe: 1.099 ± 0.358
0.458CysGly: 0.458 ± 0.278
0.55CysHis: 0.55 ± 0.221
1.557CysIle: 1.557 ± 0.368
0.733CysLys: 0.733 ± 0.296
1.374CysLeu: 1.374 ± 0.816
0.092CysMet: 0.092 ± 0.052
1.374CysAsn: 1.374 ± 0.486
0.733CysPro: 0.733 ± 0.067
0.55CysGln: 0.55 ± 0.201
0.366CysArg: 0.366 ± 0.201
0.641CysSer: 0.641 ± 0.103
2.381CysThr: 2.381 ± 0.294
2.29CysVal: 2.29 ± 0.545
0.092CysTrp: 0.092 ± 0.257
1.008CysTyr: 1.008 ± 0.305
0.0CysXaa: 0.0 ± 0.0
Asp
2.473AspAla: 2.473 ± 0.294
1.008AspCys: 1.008 ± 0.15
1.465AspAsp: 1.465 ± 0.499
1.191AspGlu: 1.191 ± 0.392
3.114AspPhe: 3.114 ± 0.332
1.557AspGly: 1.557 ± 0.192
1.557AspHis: 1.557 ± 0.454
3.938AspIle: 3.938 ± 0.383
2.473AspLys: 2.473 ± 0.484
4.305AspLeu: 4.305 ± 0.805
1.099AspMet: 1.099 ± 0.197
2.656AspAsn: 2.656 ± 0.902
2.29AspPro: 2.29 ± 0.321
2.29AspGln: 2.29 ± 0.643
1.282AspArg: 1.282 ± 0.478
2.839AspSer: 2.839 ± 0.334
3.114AspThr: 3.114 ± 0.679
3.206AspVal: 3.206 ± 0.146
0.458AspTrp: 0.458 ± 0.157
2.839AspTyr: 2.839 ± 0.603
0.0AspXaa: 0.0 ± 0.0
Glu
1.923GluAla: 1.923 ± 0.406
0.55GluCys: 0.55 ± 0.239
1.74GluAsp: 1.74 ± 0.453
2.473GluGlu: 2.473 ± 0.821
1.099GluPhe: 1.099 ± 0.358
1.099GluGly: 1.099 ± 0.434
1.74GluHis: 1.74 ± 0.152
2.473GluIle: 2.473 ± 0.386
0.733GluLys: 0.733 ± 0.229
3.297GluLeu: 3.297 ± 0.642
0.916GluMet: 0.916 ± 0.257
1.282GluAsn: 1.282 ± 0.423
2.015GluPro: 2.015 ± 0.406
1.832GluGln: 1.832 ± 0.505
1.374GluArg: 1.374 ± 0.276
2.107GluSer: 2.107 ± 0.478
5.221GluThr: 5.221 ± 0.929
3.023GluVal: 3.023 ± 0.671
0.092GluTrp: 0.092 ± 0.052
1.282GluTyr: 1.282 ± 0.43
0.0GluXaa: 0.0 ± 0.0
Phe
2.198PheAla: 2.198 ± 0.415
1.374PheCys: 1.374 ± 0.468
2.381PheAsp: 2.381 ± 0.202
1.191PheGlu: 1.191 ± 0.523
2.198PhePhe: 2.198 ± 0.341
1.923PheGly: 1.923 ± 0.304
1.465PheHis: 1.465 ± 0.526
2.931PheIle: 2.931 ± 0.639
1.282PheLys: 1.282 ± 0.241
4.03PheLeu: 4.03 ± 0.301
1.191PheMet: 1.191 ± 0.263
3.664PheAsn: 3.664 ± 0.389
1.465PhePro: 1.465 ± 0.441
2.015PheGln: 2.015 ± 0.266
1.282PheArg: 1.282 ± 0.227
3.48PheSer: 3.48 ± 0.542
6.228PheThr: 6.228 ± 0.438
1.923PheVal: 1.923 ± 0.267
0.092PheTrp: 0.092 ± 0.052
2.656PheTyr: 2.656 ± 0.865
0.092PheXaa: 0.092 ± 0.052
Gly
2.656GlyAla: 2.656 ± 0.324
1.557GlyCys: 1.557 ± 0.575
2.198GlyAsp: 2.198 ± 0.725
2.015GlyGlu: 2.015 ± 0.475
2.198GlyPhe: 2.198 ± 0.501
2.015GlyGly: 2.015 ± 0.503
1.649GlyHis: 1.649 ± 0.352
2.565GlyIle: 2.565 ± 0.235
2.748GlyLys: 2.748 ± 0.216
4.305GlyLeu: 4.305 ± 0.216
0.55GlyMet: 0.55 ± 0.201
1.74GlyAsn: 1.74 ± 0.238
2.107GlyPro: 2.107 ± 0.276
1.832GlyGln: 1.832 ± 0.613
0.55GlyArg: 0.55 ± 0.262
2.565GlySer: 2.565 ± 0.184
4.305GlyThr: 4.305 ± 1.292
2.473GlyVal: 2.473 ± 0.597
0.275GlyTrp: 0.275 ± 0.156
2.473GlyTyr: 2.473 ± 0.311
0.0GlyXaa: 0.0 ± 0.0
His
3.572HisAla: 3.572 ± 0.822
0.824HisCys: 0.824 ± 0.216
1.282HisAsp: 1.282 ± 0.227
0.916HisGlu: 0.916 ± 0.313
1.832HisPhe: 1.832 ± 0.215
1.191HisGly: 1.191 ± 0.448
1.832HisHis: 1.832 ± 0.436
1.649HisIle: 1.649 ± 0.519
1.74HisLys: 1.74 ± 0.667
4.763HisLeu: 4.763 ± 0.56
0.641HisMet: 0.641 ± 0.107
1.923HisAsn: 1.923 ± 0.707
1.465HisPro: 1.465 ± 0.312
1.099HisGln: 1.099 ± 0.252
0.916HisArg: 0.916 ± 0.188
2.473HisSer: 2.473 ± 0.277
2.931HisThr: 2.931 ± 0.568
2.931HisVal: 2.931 ± 0.445
0.183HisTrp: 0.183 ± 0.1
2.839HisTyr: 2.839 ± 0.314
0.0HisXaa: 0.0 ± 0.0
Ile
4.122IleAla: 4.122 ± 1.186
1.465IleCys: 1.465 ± 0.389
4.488IleAsp: 4.488 ± 0.863
2.565IleGlu: 2.565 ± 0.485
3.664IlePhe: 3.664 ± 0.581
3.572IleGly: 3.572 ± 0.189
1.832IleHis: 1.832 ± 0.56
4.58IleIle: 4.58 ± 0.287
2.839IleLys: 2.839 ± 0.354
4.396IleLeu: 4.396 ± 0.566
1.74IleMet: 1.74 ± 0.208
4.488IleAsn: 4.488 ± 1.102
4.305IlePro: 4.305 ± 0.593
3.114IleGln: 3.114 ± 0.531
2.198IleArg: 2.198 ± 0.729
5.129IleSer: 5.129 ± 0.446
6.228IleThr: 6.228 ± 0.692
3.664IleVal: 3.664 ± 0.703
0.55IleTrp: 0.55 ± 0.194
2.748IleTyr: 2.748 ± 0.37
0.0IleXaa: 0.0 ± 0.0
Lys
3.206LysAla: 3.206 ± 0.913
0.641LysCys: 0.641 ± 0.089
1.923LysAsp: 1.923 ± 0.472
1.649LysGlu: 1.649 ± 0.402
1.557LysPhe: 1.557 ± 0.363
1.923LysGly: 1.923 ± 0.239
2.565LysHis: 2.565 ± 0.441
3.938LysIle: 3.938 ± 0.74
1.74LysLys: 1.74 ± 0.466
5.038LysLeu: 5.038 ± 0.896
0.641LysMet: 0.641 ± 0.089
2.107LysAsn: 2.107 ± 0.141
4.213LysPro: 4.213 ± 1.147
1.191LysGln: 1.191 ± 0.212
1.374LysArg: 1.374 ± 0.441
2.839LysSer: 2.839 ± 0.354
3.938LysThr: 3.938 ± 0.537
3.48LysVal: 3.48 ± 0.734
0.275LysTrp: 0.275 ± 0.148
2.473LysTyr: 2.473 ± 0.715
0.0LysXaa: 0.0 ± 0.0
Leu
7.602LeuAla: 7.602 ± 0.893
1.832LeuCys: 1.832 ± 0.213
4.03LeuAsp: 4.03 ± 0.263
4.03LeuGlu: 4.03 ± 0.77
3.755LeuPhe: 3.755 ± 0.679
4.946LeuGly: 4.946 ± 0.927
2.565LeuHis: 2.565 ± 0.521
6.137LeuIle: 6.137 ± 0.322
4.213LeuLys: 4.213 ± 0.392
9.251LeuLeu: 9.251 ± 1.727
1.008LeuMet: 1.008 ± 0.234
5.312LeuAsn: 5.312 ± 2.083
6.228LeuPro: 6.228 ± 0.516
4.488LeuGln: 4.488 ± 0.755
2.565LeuArg: 2.565 ± 0.653
7.511LeuSer: 7.511 ± 0.745
8.243LeuThr: 8.243 ± 1.123
5.587LeuVal: 5.587 ± 0.757
0.55LeuTrp: 0.55 ± 0.279
4.03LeuTyr: 4.03 ± 0.505
0.0LeuXaa: 0.0 ± 0.0
Met
1.374MetAla: 1.374 ± 1.009
0.55MetCys: 0.55 ± 0.194
0.275MetAsp: 0.275 ± 0.097
0.458MetGlu: 0.458 ± 0.157
0.183MetPhe: 0.183 ± 0.231
1.008MetGly: 1.008 ± 0.381
1.191MetHis: 1.191 ± 0.403
0.916MetIle: 0.916 ± 0.118
0.366MetLys: 0.366 ± 0.104
2.198MetLeu: 2.198 ± 0.155
0.458MetMet: 0.458 ± 0.213
0.916MetAsn: 0.916 ± 0.373
1.465MetPro: 1.465 ± 0.708
0.733MetGln: 0.733 ± 0.296
0.55MetArg: 0.55 ± 0.072
1.282MetSer: 1.282 ± 0.29
2.107MetThr: 2.107 ± 0.768
2.015MetVal: 2.015 ± 0.471
0.183MetTrp: 0.183 ± 0.1
0.916MetTyr: 0.916 ± 0.352
0.0MetXaa: 0.0 ± 0.0
Asn
3.755AsnAla: 3.755 ± 0.374
0.824AsnCys: 0.824 ± 0.175
1.923AsnAsp: 1.923 ± 0.305
1.191AsnGlu: 1.191 ± 0.287
2.839AsnPhe: 2.839 ± 0.605
2.839AsnGly: 2.839 ± 1.174
1.649AsnHis: 1.649 ± 0.547
5.129AsnIle: 5.129 ± 0.623
2.656AsnLys: 2.656 ± 0.678
4.396AsnLeu: 4.396 ± 1.031
1.832AsnMet: 1.832 ± 0.613
3.572AsnAsn: 3.572 ± 0.217
3.664AsnPro: 3.664 ± 0.604
1.74AsnGln: 1.74 ± 0.447
1.557AsnArg: 1.557 ± 0.438
3.755AsnSer: 3.755 ± 0.958
6.595AsnThr: 6.595 ± 1.137
2.015AsnVal: 2.015 ± 0.626
0.733AsnTrp: 0.733 ± 0.207
4.305AsnTyr: 4.305 ± 0.436
0.0AsnXaa: 0.0 ± 0.0
Pro
3.938ProAla: 3.938 ± 0.439
0.733ProCys: 0.733 ± 0.067
2.748ProAsp: 2.748 ± 0.513
2.29ProGlu: 2.29 ± 0.51
2.473ProPhe: 2.473 ± 0.26
2.839ProGly: 2.839 ± 0.613
1.557ProHis: 1.557 ± 0.467
3.297ProIle: 3.297 ± 0.543
2.931ProLys: 2.931 ± 0.699
5.862ProLeu: 5.862 ± 0.527
1.282ProMet: 1.282 ± 0.923
4.396ProAsn: 4.396 ± 0.621
3.023ProPro: 3.023 ± 0.873
2.381ProGln: 2.381 ± 0.653
1.465ProArg: 1.465 ± 0.236
4.213ProSer: 4.213 ± 0.765
6.595ProThr: 6.595 ± 0.352
3.023ProVal: 3.023 ± 0.232
0.183ProTrp: 0.183 ± 0.104
2.381ProTyr: 2.381 ± 0.355
0.0ProXaa: 0.0 ± 0.0
Gln
3.389GlnAla: 3.389 ± 0.328
0.55GlnCys: 0.55 ± 0.194
1.649GlnAsp: 1.649 ± 0.427
1.465GlnGlu: 1.465 ± 0.47
1.649GlnPhe: 1.649 ± 0.246
1.649GlnGly: 1.649 ± 0.17
1.832GlnHis: 1.832 ± 0.613
3.664GlnIle: 3.664 ± 0.499
1.465GlnLys: 1.465 ± 0.432
5.038GlnLeu: 5.038 ± 0.206
0.366GlnMet: 0.366 ± 0.119
2.381GlnAsn: 2.381 ± 0.485
2.748GlnPro: 2.748 ± 0.454
1.74GlnGln: 1.74 ± 1.436
1.557GlnArg: 1.557 ± 0.308
2.015GlnSer: 2.015 ± 0.511
4.396GlnThr: 4.396 ± 0.534
2.198GlnVal: 2.198 ± 0.298
0.366GlnTrp: 0.366 ± 0.161
1.74GlnTyr: 1.74 ± 0.43
0.0GlnXaa: 0.0 ± 0.0
Arg
2.565ArgAla: 2.565 ± 0.512
0.275ArgCys: 0.275 ± 0.097
1.099ArgAsp: 1.099 ± 0.479
0.641ArgGlu: 0.641 ± 0.24
1.008ArgPhe: 1.008 ± 0.235
1.74ArgGly: 1.74 ± 0.474
0.733ArgHis: 0.733 ± 0.194
1.374ArgIle: 1.374 ± 0.486
1.465ArgLys: 1.465 ± 0.704
2.931ArgLeu: 2.931 ± 0.565
0.458ArgMet: 0.458 ± 0.191
1.557ArgAsn: 1.557 ± 0.269
1.557ArgPro: 1.557 ± 0.363
1.557ArgGln: 1.557 ± 0.34
0.916ArgArg: 0.916 ± 0.464
2.839ArgSer: 2.839 ± 0.22
2.839ArgThr: 2.839 ± 0.191
2.656ArgVal: 2.656 ± 0.676
0.0ArgTrp: 0.0 ± 0.0
1.099ArgTyr: 1.099 ± 0.158
0.0ArgXaa: 0.0 ± 0.0
Ser
3.847SerAla: 3.847 ± 0.498
1.74SerCys: 1.74 ± 0.152
3.297SerAsp: 3.297 ± 0.394
4.213SerGlu: 4.213 ± 0.731
3.572SerPhe: 3.572 ± 0.858
3.389SerGly: 3.389 ± 0.32
2.29SerHis: 2.29 ± 0.471
4.58SerIle: 4.58 ± 0.277
3.206SerLys: 3.206 ± 0.532
6.411SerLeu: 6.411 ± 0.344
1.465SerMet: 1.465 ± 0.225
3.023SerAsn: 3.023 ± 1.11
3.572SerPro: 3.572 ± 0.281
3.389SerGln: 3.389 ± 0.449
2.198SerArg: 2.198 ± 0.426
5.77SerSer: 5.77 ± 1.003
8.518SerThr: 8.518 ± 1.159
4.488SerVal: 4.488 ± 1.281
0.55SerTrp: 0.55 ± 0.194
2.565SerTyr: 2.565 ± 0.497
0.092SerXaa: 0.092 ± 0.181
Thr
7.327ThrAla: 7.327 ± 1.262
2.473ThrCys: 2.473 ± 0.311
3.664ThrAsp: 3.664 ± 0.315
3.389ThrGlu: 3.389 ± 0.966
4.305ThrPhe: 4.305 ± 1.132
4.03ThrGly: 4.03 ± 0.379
4.122ThrHis: 4.122 ± 0.397
6.961ThrIle: 6.961 ± 0.968
5.679ThrLys: 5.679 ± 0.786
9.617ThrLeu: 9.617 ± 1.327
1.282ThrMet: 1.282 ± 0.232
4.58ThrAsn: 4.58 ± 0.429
7.511ThrPro: 7.511 ± 0.631
4.58ThrGln: 4.58 ± 0.685
3.389ThrArg: 3.389 ± 0.455
9.068ThrSer: 9.068 ± 0.842
13.464ThrThr: 13.464 ± 2.131
4.671ThrVal: 4.671 ± 0.844
0.824ThrTrp: 0.824 ± 0.194
4.122ThrTyr: 4.122 ± 0.346
0.0ThrXaa: 0.0 ± 0.0
Val
4.03ValAla: 4.03 ± 0.772
1.008ValCys: 1.008 ± 0.136
3.023ValAsp: 3.023 ± 0.2
1.74ValGlu: 1.74 ± 0.751
1.923ValPhe: 1.923 ± 0.309
2.107ValGly: 2.107 ± 0.414
2.198ValHis: 2.198 ± 0.54
3.847ValIle: 3.847 ± 0.515
3.023ValLys: 3.023 ± 0.564
5.953ValLeu: 5.953 ± 0.96
0.916ValMet: 0.916 ± 0.592
3.847ValAsn: 3.847 ± 0.518
3.755ValPro: 3.755 ± 0.809
2.473ValGln: 2.473 ± 0.606
1.465ValArg: 1.465 ± 0.395
4.854ValSer: 4.854 ± 0.867
5.496ValThr: 5.496 ± 0.723
3.114ValVal: 3.114 ± 0.63
0.458ValTrp: 0.458 ± 0.191
3.114ValTyr: 3.114 ± 0.252
0.0ValXaa: 0.0 ± 0.0
Trp
0.458TrpAla: 0.458 ± 0.157
0.0TrpCys: 0.0 ± 0.0
0.366TrpAsp: 0.366 ± 0.201
0.275TrpGlu: 0.275 ± 0.151
0.366TrpPhe: 0.366 ± 0.217
0.0TrpGly: 0.0 ± 0.0
0.183TrpHis: 0.183 ± 0.104
0.183TrpIle: 0.183 ± 0.104
0.55TrpLys: 0.55 ± 0.194
0.733TrpLeu: 0.733 ± 0.215
0.183TrpMet: 0.183 ± 0.104
0.092TrpAsn: 0.092 ± 0.052
0.55TrpPro: 0.55 ± 0.129
0.458TrpGln: 0.458 ± 0.191
0.275TrpArg: 0.275 ± 0.097
0.55TrpSer: 0.55 ± 0.279
0.641TrpThr: 0.641 ± 0.211
0.458TrpVal: 0.458 ± 0.073
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.297TyrAla: 3.297 ± 0.562
1.099TyrCys: 1.099 ± 0.403
3.297TyrAsp: 3.297 ± 0.495
1.649TyrGlu: 1.649 ± 0.17
3.206TyrPhe: 3.206 ± 0.803
2.015TyrGly: 2.015 ± 0.208
2.107TyrHis: 2.107 ± 0.163
3.389TyrIle: 3.389 ± 0.233
2.839TyrLys: 2.839 ± 0.222
2.931TyrLeu: 2.931 ± 0.462
1.374TyrMet: 1.374 ± 0.515
4.58TyrAsn: 4.58 ± 1.132
1.557TyrPro: 1.557 ± 0.536
2.107TyrGln: 2.107 ± 0.617
1.374TyrArg: 1.374 ± 0.218
3.389TyrSer: 3.389 ± 0.352
4.763TyrThr: 4.763 ± 0.229
1.557TyrVal: 1.557 ± 0.139
0.092TyrTrp: 0.092 ± 0.052
3.297TyrTyr: 3.297 ± 0.474
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.092XaaCys: 0.092 ± 0.181
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.183XaaSer: 0.183 ± 0.158
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (10919 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski