Amino acid dipepetide frequency for ANMV-1 virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.121AlaAla: 5.121 ± 1.088
1.148AlaCys: 1.148 ± 0.27
4.326AlaAsp: 4.326 ± 0.626
4.856AlaGlu: 4.856 ± 0.692
2.56AlaPhe: 2.56 ± 0.427
5.209AlaGly: 5.209 ± 0.899
2.207AlaHis: 2.207 ± 0.544
4.503AlaIle: 4.503 ± 0.858
5.386AlaLys: 5.386 ± 0.727
6.092AlaLeu: 6.092 ± 1.062
1.413AlaMet: 1.413 ± 0.353
2.649AlaAsn: 2.649 ± 0.518
2.384AlaPro: 2.384 ± 0.494
3.002AlaGln: 3.002 ± 0.585
3.973AlaArg: 3.973 ± 0.814
4.415AlaSer: 4.415 ± 0.785
3.267AlaThr: 3.267 ± 0.593
5.298AlaVal: 5.298 ± 0.954
0.441AlaTrp: 0.441 ± 0.224
1.854AlaTyr: 1.854 ± 0.397
0.0AlaXaa: 0.0 ± 0.0
Cys
1.236CysAla: 1.236 ± 0.372
0.353CysCys: 0.353 ± 0.168
2.472CysAsp: 2.472 ± 1.233
2.296CysGlu: 2.296 ± 1.193
0.618CysPhe: 0.618 ± 0.242
0.883CysGly: 0.883 ± 0.228
0.795CysHis: 0.795 ± 0.315
1.854CysIle: 1.854 ± 0.85
1.06CysLys: 1.06 ± 0.378
1.413CysLeu: 1.413 ± 0.571
0.441CysMet: 0.441 ± 0.346
1.501CysAsn: 1.501 ± 0.444
1.589CysPro: 1.589 ± 0.775
0.618CysGln: 0.618 ± 0.3
2.472CysArg: 2.472 ± 1.243
1.589CysSer: 1.589 ± 0.577
1.236CysThr: 1.236 ± 0.441
3.179CysVal: 3.179 ± 1.754
0.618CysTrp: 0.618 ± 0.251
0.795CysTyr: 0.795 ± 0.237
0.0CysXaa: 0.0 ± 0.0
Asp
4.856AspAla: 4.856 ± 0.654
2.472AspCys: 2.472 ± 1.297
5.739AspAsp: 5.739 ± 0.912
5.827AspGlu: 5.827 ± 0.743
2.737AspPhe: 2.737 ± 0.5
3.973AspGly: 3.973 ± 0.676
1.413AspHis: 1.413 ± 0.376
3.885AspIle: 3.885 ± 0.742
3.002AspLys: 3.002 ± 0.416
5.209AspLeu: 5.209 ± 0.645
1.678AspMet: 1.678 ± 0.396
3.355AspAsn: 3.355 ± 0.81
2.384AspPro: 2.384 ± 0.491
1.236AspGln: 1.236 ± 0.315
3.708AspArg: 3.708 ± 0.645
2.472AspSer: 2.472 ± 0.48
3.267AspThr: 3.267 ± 0.545
4.061AspVal: 4.061 ± 0.632
0.883AspTrp: 0.883 ± 0.247
3.355AspTyr: 3.355 ± 0.591
0.0AspXaa: 0.0 ± 0.0
Glu
6.71GluAla: 6.71 ± 0.77
2.825GluCys: 2.825 ± 1.975
5.121GluAsp: 5.121 ± 0.783
9.359GluGlu: 9.359 ± 1.548
2.384GluPhe: 2.384 ± 0.491
6.18GluGly: 6.18 ± 1.076
1.766GluHis: 1.766 ± 0.401
4.238GluIle: 4.238 ± 0.789
6.71GluLys: 6.71 ± 1.41
6.799GluLeu: 6.799 ± 1.077
2.119GluMet: 2.119 ± 0.416
3.179GluAsn: 3.179 ± 0.426
2.472GluPro: 2.472 ± 0.581
2.384GluGln: 2.384 ± 0.664
4.591GluArg: 4.591 ± 0.678
2.119GluSer: 2.119 ± 0.438
4.061GluThr: 4.061 ± 0.647
4.326GluVal: 4.326 ± 0.653
1.413GluTrp: 1.413 ± 0.313
2.649GluTyr: 2.649 ± 0.498
0.0GluXaa: 0.0 ± 0.0
Phe
2.296PheAla: 2.296 ± 0.499
1.501PheCys: 1.501 ± 0.441
3.179PheAsp: 3.179 ± 0.446
2.56PheGlu: 2.56 ± 0.518
1.678PhePhe: 1.678 ± 0.684
1.854PheGly: 1.854 ± 0.312
1.236PheHis: 1.236 ± 0.327
1.854PheIle: 1.854 ± 0.504
2.031PheLys: 2.031 ± 0.401
2.825PheLeu: 2.825 ± 0.617
0.706PheMet: 0.706 ± 0.292
1.854PheAsn: 1.854 ± 0.438
2.296PhePro: 2.296 ± 0.48
0.53PheGln: 0.53 ± 0.218
2.472PheArg: 2.472 ± 0.589
2.384PheSer: 2.384 ± 0.526
2.384PheThr: 2.384 ± 0.486
1.766PheVal: 1.766 ± 0.469
0.353PheTrp: 0.353 ± 0.156
1.148PheTyr: 1.148 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
5.209GlyAla: 5.209 ± 0.891
1.678GlyCys: 1.678 ± 0.65
4.856GlyAsp: 4.856 ± 0.894
5.121GlyGlu: 5.121 ± 0.65
3.179GlyPhe: 3.179 ± 0.494
3.885GlyGly: 3.885 ± 0.664
1.589GlyHis: 1.589 ± 0.334
5.121GlyIle: 5.121 ± 0.94
4.326GlyLys: 4.326 ± 0.513
3.443GlyLeu: 3.443 ± 0.542
1.501GlyMet: 1.501 ± 0.351
2.649GlyAsn: 2.649 ± 0.426
2.207GlyPro: 2.207 ± 0.436
1.589GlyGln: 1.589 ± 0.443
4.944GlyArg: 4.944 ± 0.852
4.591GlySer: 4.591 ± 0.746
4.238GlyThr: 4.238 ± 0.625
4.326GlyVal: 4.326 ± 0.645
1.413GlyTrp: 1.413 ± 0.499
2.472GlyTyr: 2.472 ± 0.585
0.0GlyXaa: 0.0 ± 0.0
His
1.766HisAla: 1.766 ± 0.386
1.678HisCys: 1.678 ± 0.973
1.06HisAsp: 1.06 ± 0.302
2.472HisGlu: 2.472 ± 0.457
1.854HisPhe: 1.854 ± 0.424
2.384HisGly: 2.384 ± 0.653
0.883HisHis: 0.883 ± 0.398
2.031HisIle: 2.031 ± 0.409
1.06HisLys: 1.06 ± 0.339
1.589HisLeu: 1.589 ± 0.455
0.0HisMet: 0.0 ± 0.0
1.148HisAsn: 1.148 ± 0.287
1.413HisPro: 1.413 ± 0.43
0.971HisGln: 0.971 ± 0.268
1.06HisArg: 1.06 ± 0.311
1.148HisSer: 1.148 ± 0.289
1.942HisThr: 1.942 ± 0.62
1.324HisVal: 1.324 ± 0.48
0.0HisTrp: 0.0 ± 0.0
1.148HisTyr: 1.148 ± 0.253
0.0HisXaa: 0.0 ± 0.0
Ile
5.033IleAla: 5.033 ± 0.867
1.324IleCys: 1.324 ± 0.383
4.679IleAsp: 4.679 ± 0.679
5.916IleGlu: 5.916 ± 0.764
2.207IlePhe: 2.207 ± 0.612
3.973IleGly: 3.973 ± 0.51
2.384IleHis: 2.384 ± 0.462
2.649IleIle: 2.649 ± 0.518
3.179IleLys: 3.179 ± 0.619
4.326IleLeu: 4.326 ± 0.784
0.971IleMet: 0.971 ± 0.286
1.678IleAsn: 1.678 ± 0.357
2.56IlePro: 2.56 ± 0.733
1.324IleGln: 1.324 ± 0.324
3.532IleArg: 3.532 ± 0.543
3.532IleSer: 3.532 ± 0.494
4.503IleThr: 4.503 ± 0.679
5.474IleVal: 5.474 ± 1.057
0.706IleTrp: 0.706 ± 0.251
2.296IleTyr: 2.296 ± 0.599
0.0IleXaa: 0.0 ± 0.0
Lys
4.061LysAla: 4.061 ± 0.665
1.236LysCys: 1.236 ± 0.604
3.355LysAsp: 3.355 ± 0.627
6.269LysGlu: 6.269 ± 1.268
1.854LysPhe: 1.854 ± 0.542
5.474LysGly: 5.474 ± 0.747
1.678LysHis: 1.678 ± 0.46
3.973LysIle: 3.973 ± 0.673
7.946LysLys: 7.946 ± 1.032
7.505LysLeu: 7.505 ± 1.037
2.914LysMet: 2.914 ± 0.53
2.296LysAsn: 2.296 ± 0.463
1.854LysPro: 1.854 ± 0.407
2.296LysGln: 2.296 ± 0.392
4.15LysArg: 4.15 ± 0.623
3.708LysSer: 3.708 ± 0.648
4.061LysThr: 4.061 ± 0.561
4.503LysVal: 4.503 ± 0.893
1.501LysTrp: 1.501 ± 0.292
2.296LysTyr: 2.296 ± 0.528
0.0LysXaa: 0.0 ± 0.0
Leu
6.269LeuAla: 6.269 ± 0.9
2.207LeuCys: 2.207 ± 0.612
4.503LeuAsp: 4.503 ± 0.618
7.152LeuGlu: 7.152 ± 0.836
2.914LeuPhe: 2.914 ± 0.693
5.474LeuGly: 5.474 ± 0.69
1.678LeuHis: 1.678 ± 0.434
4.415LeuIle: 4.415 ± 0.883
6.622LeuLys: 6.622 ± 1.133
5.386LeuLeu: 5.386 ± 0.846
1.589LeuMet: 1.589 ± 0.337
3.532LeuAsn: 3.532 ± 0.555
2.207LeuPro: 2.207 ± 0.406
1.678LeuGln: 1.678 ± 0.449
4.238LeuArg: 4.238 ± 0.72
4.238LeuSer: 4.238 ± 0.736
4.503LeuThr: 4.503 ± 0.842
4.856LeuVal: 4.856 ± 0.709
0.53LeuTrp: 0.53 ± 0.211
2.384LeuTyr: 2.384 ± 0.555
0.0LeuXaa: 0.0 ± 0.0
Met
1.324MetAla: 1.324 ± 0.315
0.353MetCys: 0.353 ± 0.223
0.53MetAsp: 0.53 ± 0.173
1.501MetGlu: 1.501 ± 0.406
0.53MetPhe: 0.53 ± 0.216
1.854MetGly: 1.854 ± 0.39
0.265MetHis: 0.265 ± 0.194
1.413MetIle: 1.413 ± 0.411
1.942MetLys: 1.942 ± 0.372
1.766MetLeu: 1.766 ± 0.429
0.265MetMet: 0.265 ± 0.177
1.148MetAsn: 1.148 ± 0.402
1.148MetPro: 1.148 ± 0.355
0.441MetGln: 0.441 ± 0.173
1.148MetArg: 1.148 ± 0.29
1.854MetSer: 1.854 ± 0.377
1.589MetThr: 1.589 ± 0.393
0.883MetVal: 0.883 ± 0.313
0.265MetTrp: 0.265 ± 0.146
0.706MetTyr: 0.706 ± 0.294
0.0MetXaa: 0.0 ± 0.0
Asn
2.914AsnAla: 2.914 ± 0.539
1.236AsnCys: 1.236 ± 0.355
1.678AsnAsp: 1.678 ± 0.356
2.296AsnGlu: 2.296 ± 0.47
1.766AsnPhe: 1.766 ± 0.43
2.56AsnGly: 2.56 ± 0.449
0.706AsnHis: 0.706 ± 0.231
3.179AsnIle: 3.179 ± 0.872
2.649AsnLys: 2.649 ± 0.434
2.825AsnLeu: 2.825 ± 0.422
0.618AsnMet: 0.618 ± 0.254
1.589AsnAsn: 1.589 ± 0.447
2.031AsnPro: 2.031 ± 0.551
1.06AsnGln: 1.06 ± 0.318
2.207AsnArg: 2.207 ± 0.425
1.854AsnSer: 1.854 ± 0.472
2.737AsnThr: 2.737 ± 0.639
2.384AsnVal: 2.384 ± 0.349
1.148AsnTrp: 1.148 ± 0.278
3.002AsnTyr: 3.002 ± 0.573
0.0AsnXaa: 0.0 ± 0.0
Pro
2.825ProAla: 2.825 ± 0.482
0.265ProCys: 0.265 ± 0.147
3.355ProAsp: 3.355 ± 0.581
4.503ProGlu: 4.503 ± 0.776
1.766ProPhe: 1.766 ± 0.5
3.002ProGly: 3.002 ± 0.506
0.971ProHis: 0.971 ± 0.291
1.678ProIle: 1.678 ± 0.38
3.267ProLys: 3.267 ± 0.479
2.737ProLeu: 2.737 ± 0.5
0.53ProMet: 0.53 ± 0.186
1.148ProAsn: 1.148 ± 0.319
0.883ProPro: 0.883 ± 0.268
0.883ProGln: 0.883 ± 0.276
1.766ProArg: 1.766 ± 0.401
2.031ProSer: 2.031 ± 0.413
3.002ProThr: 3.002 ± 0.594
2.384ProVal: 2.384 ± 0.348
0.441ProTrp: 0.441 ± 0.174
1.501ProTyr: 1.501 ± 0.408
0.0ProXaa: 0.0 ± 0.0
Gln
2.649GlnAla: 2.649 ± 0.574
0.53GlnCys: 0.53 ± 0.207
1.942GlnAsp: 1.942 ± 0.334
2.56GlnGlu: 2.56 ± 0.529
1.148GlnPhe: 1.148 ± 0.298
1.766GlnGly: 1.766 ± 0.394
1.148GlnHis: 1.148 ± 0.338
2.119GlnIle: 2.119 ± 0.516
2.207GlnLys: 2.207 ± 0.515
1.942GlnLeu: 1.942 ± 0.42
0.177GlnMet: 0.177 ± 0.121
0.971GlnAsn: 0.971 ± 0.273
0.883GlnPro: 0.883 ± 0.218
0.883GlnGln: 0.883 ± 0.245
1.942GlnArg: 1.942 ± 0.506
1.678GlnSer: 1.678 ± 0.329
1.148GlnThr: 1.148 ± 0.306
1.678GlnVal: 1.678 ± 0.373
0.618GlnTrp: 0.618 ± 0.215
0.618GlnTyr: 0.618 ± 0.194
0.0GlnXaa: 0.0 ± 0.0
Arg
4.061ArgAla: 4.061 ± 0.699
2.119ArgCys: 2.119 ± 1.045
4.061ArgAsp: 4.061 ± 0.693
5.298ArgGlu: 5.298 ± 0.753
1.854ArgPhe: 1.854 ± 0.517
3.708ArgGly: 3.708 ± 0.503
1.589ArgHis: 1.589 ± 0.386
3.002ArgIle: 3.002 ± 0.537
4.679ArgLys: 4.679 ± 0.804
4.503ArgLeu: 4.503 ± 0.614
1.589ArgMet: 1.589 ± 0.344
2.649ArgAsn: 2.649 ± 0.447
1.766ArgPro: 1.766 ± 0.413
2.472ArgGln: 2.472 ± 0.386
3.09ArgArg: 3.09 ± 0.612
2.825ArgSer: 2.825 ± 0.681
2.737ArgThr: 2.737 ± 0.528
2.384ArgVal: 2.384 ± 0.476
0.883ArgTrp: 0.883 ± 0.283
2.207ArgTyr: 2.207 ± 0.451
0.0ArgXaa: 0.0 ± 0.0
Ser
3.443SerAla: 3.443 ± 0.496
1.589SerCys: 1.589 ± 0.474
3.797SerAsp: 3.797 ± 0.557
2.649SerGlu: 2.649 ± 0.45
2.384SerPhe: 2.384 ± 0.553
5.033SerGly: 5.033 ± 0.662
1.413SerHis: 1.413 ± 0.386
3.797SerIle: 3.797 ± 0.6
4.15SerLys: 4.15 ± 0.737
3.09SerLeu: 3.09 ± 0.558
0.706SerMet: 0.706 ± 0.271
1.501SerAsn: 1.501 ± 0.365
2.472SerPro: 2.472 ± 0.522
1.501SerGln: 1.501 ± 0.322
2.472SerArg: 2.472 ± 0.566
2.649SerSer: 2.649 ± 0.567
3.443SerThr: 3.443 ± 0.613
3.355SerVal: 3.355 ± 0.632
1.766SerTrp: 1.766 ± 0.367
1.854SerTyr: 1.854 ± 0.306
0.0SerXaa: 0.0 ± 0.0
Thr
3.532ThrAla: 3.532 ± 0.754
1.413ThrCys: 1.413 ± 0.585
3.179ThrAsp: 3.179 ± 0.543
3.443ThrGlu: 3.443 ± 0.564
1.589ThrPhe: 1.589 ± 0.358
4.856ThrGly: 4.856 ± 0.957
1.589ThrHis: 1.589 ± 0.573
5.298ThrIle: 5.298 ± 0.861
3.885ThrLys: 3.885 ± 0.544
5.739ThrLeu: 5.739 ± 0.889
1.501ThrMet: 1.501 ± 0.416
2.472ThrAsn: 2.472 ± 0.603
3.09ThrPro: 3.09 ± 0.429
1.501ThrGln: 1.501 ± 0.363
3.002ThrArg: 3.002 ± 0.559
3.179ThrSer: 3.179 ± 0.537
3.09ThrThr: 3.09 ± 0.671
3.09ThrVal: 3.09 ± 0.594
1.06ThrTrp: 1.06 ± 0.44
2.119ThrTyr: 2.119 ± 0.359
0.0ThrXaa: 0.0 ± 0.0
Val
5.209ValAla: 5.209 ± 0.694
1.148ValCys: 1.148 ± 0.474
3.708ValAsp: 3.708 ± 0.643
5.209ValGlu: 5.209 ± 0.778
2.119ValPhe: 2.119 ± 0.343
3.62ValGly: 3.62 ± 0.495
2.119ValHis: 2.119 ± 0.491
3.708ValIle: 3.708 ± 0.663
4.591ValLys: 4.591 ± 0.744
4.061ValLeu: 4.061 ± 0.893
0.706ValMet: 0.706 ± 0.233
2.649ValAsn: 2.649 ± 0.368
3.09ValPro: 3.09 ± 0.495
2.207ValGln: 2.207 ± 0.42
3.267ValArg: 3.267 ± 0.498
3.885ValSer: 3.885 ± 0.485
3.885ValThr: 3.885 ± 0.808
3.443ValVal: 3.443 ± 0.763
0.795ValTrp: 0.795 ± 0.239
2.031ValTyr: 2.031 ± 0.43
0.0ValXaa: 0.0 ± 0.0
Trp
0.265TrpAla: 0.265 ± 0.169
0.53TrpCys: 0.53 ± 0.209
0.883TrpAsp: 0.883 ± 0.314
0.618TrpGlu: 0.618 ± 0.303
0.618TrpPhe: 0.618 ± 0.228
0.618TrpGly: 0.618 ± 0.265
0.441TrpHis: 0.441 ± 0.207
1.236TrpIle: 1.236 ± 0.353
0.971TrpLys: 0.971 ± 0.399
2.031TrpLeu: 2.031 ± 0.434
0.353TrpMet: 0.353 ± 0.174
0.971TrpAsn: 0.971 ± 0.263
0.088TrpPro: 0.088 ± 0.093
0.618TrpGln: 0.618 ± 0.227
1.236TrpArg: 1.236 ± 0.423
1.06TrpSer: 1.06 ± 0.319
1.06TrpThr: 1.06 ± 0.34
1.06TrpVal: 1.06 ± 0.351
0.353TrpTrp: 0.353 ± 0.16
0.706TrpTyr: 0.706 ± 0.269
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.324TyrAla: 1.324 ± 0.436
1.413TyrCys: 1.413 ± 0.964
3.09TyrAsp: 3.09 ± 0.642
1.501TyrGlu: 1.501 ± 0.268
1.148TyrPhe: 1.148 ± 0.364
2.031TyrGly: 2.031 ± 0.464
1.06TyrHis: 1.06 ± 0.289
2.296TyrIle: 2.296 ± 0.492
3.267TyrLys: 3.267 ± 0.503
3.09TyrLeu: 3.09 ± 0.532
1.06TyrMet: 1.06 ± 0.296
1.501TyrAsn: 1.501 ± 0.309
2.119TyrPro: 2.119 ± 0.376
1.413TyrGln: 1.413 ± 0.409
2.207TyrArg: 2.207 ± 0.444
1.942TyrSer: 1.942 ± 0.43
2.472TyrThr: 2.472 ± 0.55
1.678TyrVal: 1.678 ± 0.339
0.53TyrTrp: 0.53 ± 0.184
1.854TyrTyr: 1.854 ± 0.451
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 69 proteins (11327 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski