Amino acid dipepetide frequency for Hubei lepidoptera virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.441AlaAla: 2.441 ± 0.433
0.257AlaCys: 0.257 ± 0.159
2.57AlaAsp: 2.57 ± 0.685
3.598AlaGlu: 3.598 ± 0.643
1.799AlaPhe: 1.799 ± 0.289
2.313AlaGly: 2.313 ± 0.732
0.771AlaHis: 0.771 ± 0.279
4.24AlaIle: 4.24 ± 0.737
3.212AlaLys: 3.212 ± 0.64
3.855AlaLeu: 3.855 ± 0.678
1.799AlaMet: 1.799 ± 0.57
3.341AlaAsn: 3.341 ± 0.694
1.542AlaPro: 1.542 ± 0.39
1.542AlaGln: 1.542 ± 0.568
4.754AlaArg: 4.754 ± 0.582
3.084AlaSer: 3.084 ± 0.497
3.341AlaThr: 3.341 ± 0.657
2.57AlaVal: 2.57 ± 0.933
0.385AlaTrp: 0.385 ± 0.266
2.184AlaTyr: 2.184 ± 0.574
0.0AlaXaa: 0.0 ± 0.0
Cys
0.257CysAla: 0.257 ± 0.179
0.0CysCys: 0.0 ± 0.0
0.385CysAsp: 0.385 ± 0.247
0.385CysGlu: 0.385 ± 0.2
0.385CysPhe: 0.385 ± 0.255
0.385CysGly: 0.385 ± 0.382
0.257CysHis: 0.257 ± 0.257
0.642CysIle: 0.642 ± 0.266
0.0CysLys: 0.0 ± 0.0
0.514CysLeu: 0.514 ± 0.383
0.642CysMet: 0.642 ± 0.239
0.514CysAsn: 0.514 ± 0.29
0.0CysPro: 0.0 ± 0.0
0.257CysGln: 0.257 ± 0.153
0.514CysArg: 0.514 ± 0.271
0.257CysSer: 0.257 ± 0.227
0.385CysThr: 0.385 ± 0.269
1.156CysVal: 1.156 ± 0.557
0.128CysTrp: 0.128 ± 0.128
0.257CysTyr: 0.257 ± 0.15
0.0CysXaa: 0.0 ± 0.0
Asp
2.57AspAla: 2.57 ± 0.533
0.257AspCys: 0.257 ± 0.203
6.039AspAsp: 6.039 ± 0.918
7.581AspGlu: 7.581 ± 0.953
1.413AspPhe: 1.413 ± 0.477
5.011AspGly: 5.011 ± 0.78
0.128AspHis: 0.128 ± 0.118
4.882AspIle: 4.882 ± 0.501
4.497AspLys: 4.497 ± 1.461
2.441AspLeu: 2.441 ± 0.567
2.698AspMet: 2.698 ± 0.707
2.313AspAsn: 2.313 ± 0.633
1.413AspPro: 1.413 ± 0.441
1.542AspGln: 1.542 ± 0.536
5.011AspArg: 5.011 ± 0.446
2.827AspSer: 2.827 ± 0.73
2.441AspThr: 2.441 ± 0.536
5.396AspVal: 5.396 ± 0.989
0.514AspTrp: 0.514 ± 0.375
2.184AspTyr: 2.184 ± 0.43
0.0AspXaa: 0.0 ± 0.0
Glu
3.341GluAla: 3.341 ± 0.3
0.385GluCys: 0.385 ± 0.278
4.754GluAsp: 4.754 ± 1.204
9.122GluGlu: 9.122 ± 1.745
2.184GluPhe: 2.184 ± 0.719
4.24GluGly: 4.24 ± 0.679
1.927GluHis: 1.927 ± 0.372
9.893GluIle: 9.893 ± 1.133
7.838GluLys: 7.838 ± 1.401
9.508GluLeu: 9.508 ± 0.898
3.212GluMet: 3.212 ± 0.678
4.112GluAsn: 4.112 ± 0.521
1.413GluPro: 1.413 ± 0.31
2.441GluGln: 2.441 ± 0.399
6.167GluArg: 6.167 ± 0.729
5.396GluSer: 5.396 ± 0.607
4.882GluThr: 4.882 ± 0.716
4.625GluVal: 4.625 ± 0.881
0.385GluTrp: 0.385 ± 0.239
3.598GluTyr: 3.598 ± 1.021
0.0GluXaa: 0.0 ± 0.0
Phe
1.542PheAla: 1.542 ± 0.516
0.128PheCys: 0.128 ± 0.166
2.441PheAsp: 2.441 ± 0.505
2.57PheGlu: 2.57 ± 0.399
1.156PhePhe: 1.156 ± 0.458
1.542PheGly: 1.542 ± 0.332
0.385PheHis: 0.385 ± 0.166
3.341PheIle: 3.341 ± 0.706
2.827PheLys: 2.827 ± 0.782
2.698PheLeu: 2.698 ± 0.519
1.028PheMet: 1.028 ± 0.271
2.57PheAsn: 2.57 ± 0.635
1.67PhePro: 1.67 ± 0.624
1.028PheGln: 1.028 ± 0.342
0.899PheArg: 0.899 ± 0.381
2.056PheSer: 2.056 ± 0.56
1.156PheThr: 1.156 ± 0.478
2.441PheVal: 2.441 ± 0.369
0.257PheTrp: 0.257 ± 0.19
0.771PheTyr: 0.771 ± 0.428
0.0PheXaa: 0.0 ± 0.0
Gly
3.084GlyAla: 3.084 ± 0.574
0.514GlyCys: 0.514 ± 0.367
3.084GlyAsp: 3.084 ± 0.632
5.396GlyGlu: 5.396 ± 0.484
1.028GlyPhe: 1.028 ± 0.394
3.469GlyGly: 3.469 ± 0.891
0.642GlyHis: 0.642 ± 0.305
4.754GlyIle: 4.754 ± 0.89
3.855GlyLys: 3.855 ± 0.517
4.882GlyLeu: 4.882 ± 0.77
2.57GlyMet: 2.57 ± 0.376
4.368GlyAsn: 4.368 ± 0.717
0.514GlyPro: 0.514 ± 0.237
1.156GlyGln: 1.156 ± 0.317
3.341GlyArg: 3.341 ± 0.597
2.955GlySer: 2.955 ± 0.824
3.341GlyThr: 3.341 ± 0.337
4.497GlyVal: 4.497 ± 0.685
0.514GlyTrp: 0.514 ± 0.238
1.927GlyTyr: 1.927 ± 0.455
0.0GlyXaa: 0.0 ± 0.0
His
0.514HisAla: 0.514 ± 0.168
0.128HisCys: 0.128 ± 0.127
0.514HisAsp: 0.514 ± 0.165
0.899HisGlu: 0.899 ± 0.36
0.514HisPhe: 0.514 ± 0.159
1.67HisGly: 1.67 ± 0.565
0.257HisHis: 0.257 ± 0.169
0.771HisIle: 0.771 ± 0.256
0.899HisLys: 0.899 ± 0.438
1.285HisLeu: 1.285 ± 0.397
0.642HisMet: 0.642 ± 0.334
0.514HisAsn: 0.514 ± 0.356
0.257HisPro: 0.257 ± 0.152
0.385HisGln: 0.385 ± 0.32
0.385HisArg: 0.385 ± 0.187
1.156HisSer: 1.156 ± 0.33
1.156HisThr: 1.156 ± 0.394
1.156HisVal: 1.156 ± 0.435
0.0HisTrp: 0.0 ± 0.0
0.642HisTyr: 0.642 ± 0.34
0.0HisXaa: 0.0 ± 0.0
Ile
4.368IleAla: 4.368 ± 0.602
0.771IleCys: 0.771 ± 0.382
5.396IleAsp: 5.396 ± 0.591
7.195IleGlu: 7.195 ± 0.805
2.184IlePhe: 2.184 ± 0.646
4.24IleGly: 4.24 ± 0.773
1.285IleHis: 1.285 ± 0.36
5.396IleIle: 5.396 ± 0.984
5.91IleLys: 5.91 ± 0.815
8.095IleLeu: 8.095 ± 1.305
2.955IleMet: 2.955 ± 0.528
5.91IleAsn: 5.91 ± 0.758
2.698IlePro: 2.698 ± 0.425
1.67IleGln: 1.67 ± 0.595
6.039IleArg: 6.039 ± 0.777
5.268IleSer: 5.268 ± 0.915
3.855IleThr: 3.855 ± 0.61
5.011IleVal: 5.011 ± 0.659
1.028IleTrp: 1.028 ± 0.333
4.368IleTyr: 4.368 ± 0.692
0.0IleXaa: 0.0 ± 0.0
Lys
2.441LysAla: 2.441 ± 0.484
0.514LysCys: 0.514 ± 0.195
3.469LysAsp: 3.469 ± 0.974
6.167LysGlu: 6.167 ± 1.215
2.827LysPhe: 2.827 ± 0.457
4.754LysGly: 4.754 ± 0.795
0.385LysHis: 0.385 ± 0.254
6.296LysIle: 6.296 ± 1.105
4.497LysLys: 4.497 ± 1.444
6.938LysLeu: 6.938 ± 1.287
4.112LysMet: 4.112 ± 1.17
5.268LysAsn: 5.268 ± 0.726
2.184LysPro: 2.184 ± 0.478
2.056LysGln: 2.056 ± 0.665
3.598LysArg: 3.598 ± 0.776
2.57LysSer: 2.57 ± 0.422
2.698LysThr: 2.698 ± 0.312
5.011LysVal: 5.011 ± 0.989
0.899LysTrp: 0.899 ± 0.213
3.598LysTyr: 3.598 ± 0.628
0.0LysXaa: 0.0 ± 0.0
Leu
2.827LeuAla: 2.827 ± 0.708
0.514LeuCys: 0.514 ± 0.271
4.754LeuAsp: 4.754 ± 0.567
4.754LeuGlu: 4.754 ± 0.552
3.084LeuPhe: 3.084 ± 0.933
3.469LeuGly: 3.469 ± 0.588
1.413LeuHis: 1.413 ± 0.44
7.067LeuIle: 7.067 ± 1.223
6.553LeuLys: 6.553 ± 0.768
6.681LeuLeu: 6.681 ± 1.125
3.598LeuMet: 3.598 ± 1.067
4.368LeuAsn: 4.368 ± 0.625
2.57LeuPro: 2.57 ± 0.312
2.955LeuGln: 2.955 ± 0.53
6.039LeuArg: 6.039 ± 0.922
6.167LeuSer: 6.167 ± 0.674
5.011LeuThr: 5.011 ± 0.535
2.441LeuVal: 2.441 ± 0.308
0.514LeuTrp: 0.514 ± 0.356
3.212LeuTyr: 3.212 ± 0.467
0.0LeuXaa: 0.0 ± 0.0
Met
1.799MetAla: 1.799 ± 0.208
0.514MetCys: 0.514 ± 0.514
2.955MetAsp: 2.955 ± 0.619
2.441MetGlu: 2.441 ± 0.613
1.285MetPhe: 1.285 ± 0.29
2.441MetGly: 2.441 ± 0.534
0.642MetHis: 0.642 ± 0.306
4.754MetIle: 4.754 ± 0.906
3.084MetLys: 3.084 ± 0.532
2.827MetLeu: 2.827 ± 0.481
1.799MetMet: 1.799 ± 0.48
2.57MetAsn: 2.57 ± 0.446
1.927MetPro: 1.927 ± 0.705
1.156MetGln: 1.156 ± 0.508
2.955MetArg: 2.955 ± 0.622
3.084MetSer: 3.084 ± 0.586
2.056MetThr: 2.056 ± 0.615
1.927MetVal: 1.927 ± 0.4
0.514MetTrp: 0.514 ± 0.318
1.156MetTyr: 1.156 ± 0.388
0.0MetXaa: 0.0 ± 0.0
Asn
4.882AsnAla: 4.882 ± 0.558
0.128AsnCys: 0.128 ± 0.117
5.011AsnAsp: 5.011 ± 0.921
8.095AsnGlu: 8.095 ± 1.159
1.799AsnPhe: 1.799 ± 0.514
3.341AsnGly: 3.341 ± 0.574
0.899AsnHis: 0.899 ± 0.37
4.24AsnIle: 4.24 ± 0.622
4.882AsnLys: 4.882 ± 0.96
2.441AsnLeu: 2.441 ± 0.669
1.285AsnMet: 1.285 ± 0.366
3.084AsnAsn: 3.084 ± 0.745
1.413AsnPro: 1.413 ± 0.46
1.799AsnGln: 1.799 ± 0.414
3.855AsnArg: 3.855 ± 0.63
3.341AsnSer: 3.341 ± 0.938
2.441AsnThr: 2.441 ± 0.636
7.195AsnVal: 7.195 ± 1.111
0.771AsnTrp: 0.771 ± 0.348
2.955AsnTyr: 2.955 ± 0.637
0.0AsnXaa: 0.0 ± 0.0
Pro
1.028ProAla: 1.028 ± 0.28
0.514ProCys: 0.514 ± 0.283
1.927ProAsp: 1.927 ± 0.464
1.542ProGlu: 1.542 ± 0.569
0.899ProPhe: 0.899 ± 0.277
1.156ProGly: 1.156 ± 0.353
0.514ProHis: 0.514 ± 0.265
2.056ProIle: 2.056 ± 0.477
1.542ProLys: 1.542 ± 0.428
2.313ProLeu: 2.313 ± 0.45
0.771ProMet: 0.771 ± 0.314
1.285ProAsn: 1.285 ± 0.282
0.899ProPro: 0.899 ± 0.45
0.771ProGln: 0.771 ± 0.405
2.184ProArg: 2.184 ± 0.607
2.184ProSer: 2.184 ± 0.548
2.441ProThr: 2.441 ± 0.663
1.156ProVal: 1.156 ± 0.456
0.128ProTrp: 0.128 ± 0.117
0.642ProTyr: 0.642 ± 0.327
0.0ProXaa: 0.0 ± 0.0
Gln
1.67GlnAla: 1.67 ± 0.634
0.514GlnCys: 0.514 ± 0.264
0.899GlnAsp: 0.899 ± 0.502
2.056GlnGlu: 2.056 ± 0.475
1.156GlnPhe: 1.156 ± 0.453
1.285GlnGly: 1.285 ± 0.552
0.642GlnHis: 0.642 ± 0.196
2.57GlnIle: 2.57 ± 0.515
1.67GlnLys: 1.67 ± 0.375
2.441GlnLeu: 2.441 ± 0.618
1.542GlnMet: 1.542 ± 0.601
1.542GlnAsn: 1.542 ± 0.243
0.642GlnPro: 0.642 ± 0.44
0.771GlnGln: 0.771 ± 0.394
1.413GlnArg: 1.413 ± 0.353
1.799GlnSer: 1.799 ± 0.496
1.799GlnThr: 1.799 ± 0.388
1.413GlnVal: 1.413 ± 0.294
0.128GlnTrp: 0.128 ± 0.127
1.67GlnTyr: 1.67 ± 0.373
0.0GlnXaa: 0.0 ± 0.0
Arg
4.882ArgAla: 4.882 ± 0.783
0.385ArgCys: 0.385 ± 0.192
2.698ArgAsp: 2.698 ± 0.323
6.167ArgGlu: 6.167 ± 0.825
2.056ArgPhe: 2.056 ± 0.794
3.983ArgGly: 3.983 ± 0.865
1.156ArgHis: 1.156 ± 0.546
6.938ArgIle: 6.938 ± 0.747
4.754ArgLys: 4.754 ± 0.973
3.855ArgLeu: 3.855 ± 0.554
2.698ArgMet: 2.698 ± 0.556
4.754ArgAsn: 4.754 ± 0.423
1.285ArgPro: 1.285 ± 0.439
2.056ArgGln: 2.056 ± 0.741
2.698ArgArg: 2.698 ± 0.758
3.983ArgSer: 3.983 ± 0.566
2.827ArgThr: 2.827 ± 0.639
5.396ArgVal: 5.396 ± 0.643
0.385ArgTrp: 0.385 ± 0.256
3.469ArgTyr: 3.469 ± 0.474
0.0ArgXaa: 0.0 ± 0.0
Ser
3.855SerAla: 3.855 ± 0.938
0.514SerCys: 0.514 ± 0.253
3.726SerAsp: 3.726 ± 0.572
5.91SerGlu: 5.91 ± 0.604
2.441SerPhe: 2.441 ± 0.585
3.855SerGly: 3.855 ± 0.753
0.514SerHis: 0.514 ± 0.276
4.112SerIle: 4.112 ± 0.837
3.726SerLys: 3.726 ± 0.61
6.039SerLeu: 6.039 ± 0.989
2.57SerMet: 2.57 ± 0.384
4.112SerAsn: 4.112 ± 0.414
0.899SerPro: 0.899 ± 0.235
0.771SerGln: 0.771 ± 0.322
3.855SerArg: 3.855 ± 0.693
3.855SerSer: 3.855 ± 0.789
4.112SerThr: 4.112 ± 1.037
4.24SerVal: 4.24 ± 0.704
0.257SerTrp: 0.257 ± 0.135
2.313SerTyr: 2.313 ± 0.544
0.0SerXaa: 0.0 ± 0.0
Thr
3.084ThrAla: 3.084 ± 0.848
0.642ThrCys: 0.642 ± 0.245
2.955ThrAsp: 2.955 ± 0.804
5.011ThrGlu: 5.011 ± 0.507
2.698ThrPhe: 2.698 ± 0.748
2.955ThrGly: 2.955 ± 1.048
0.642ThrHis: 0.642 ± 0.388
2.827ThrIle: 2.827 ± 0.369
3.469ThrLys: 3.469 ± 0.568
3.212ThrLeu: 3.212 ± 0.579
3.983ThrMet: 3.983 ± 0.587
3.341ThrAsn: 3.341 ± 0.654
2.056ThrPro: 2.056 ± 0.486
1.156ThrGln: 1.156 ± 0.374
3.341ThrArg: 3.341 ± 0.991
3.726ThrSer: 3.726 ± 0.597
3.212ThrThr: 3.212 ± 1.008
3.212ThrVal: 3.212 ± 0.855
0.642ThrTrp: 0.642 ± 0.235
1.67ThrTyr: 1.67 ± 0.427
0.0ThrXaa: 0.0 ± 0.0
Val
3.212ValAla: 3.212 ± 0.689
0.257ValCys: 0.257 ± 0.179
4.497ValAsp: 4.497 ± 0.647
6.167ValGlu: 6.167 ± 0.953
2.57ValPhe: 2.57 ± 0.761
3.469ValGly: 3.469 ± 0.625
0.771ValHis: 0.771 ± 0.323
5.139ValIle: 5.139 ± 0.701
4.24ValLys: 4.24 ± 0.861
4.112ValLeu: 4.112 ± 0.502
2.313ValMet: 2.313 ± 0.694
5.268ValAsn: 5.268 ± 0.47
1.413ValPro: 1.413 ± 0.317
2.056ValGln: 2.056 ± 0.543
6.424ValArg: 6.424 ± 1.518
4.497ValSer: 4.497 ± 1.057
3.726ValThr: 3.726 ± 0.405
3.341ValVal: 3.341 ± 1.208
0.385ValTrp: 0.385 ± 0.234
3.212ValTyr: 3.212 ± 0.667
0.0ValXaa: 0.0 ± 0.0
Trp
0.128TrpAla: 0.128 ± 0.143
0.128TrpCys: 0.128 ± 0.166
0.257TrpAsp: 0.257 ± 0.184
0.257TrpGlu: 0.257 ± 0.156
0.257TrpPhe: 0.257 ± 0.167
0.128TrpGly: 0.128 ± 0.166
0.257TrpHis: 0.257 ± 0.135
1.285TrpIle: 1.285 ± 0.545
0.385TrpLys: 0.385 ± 0.235
0.899TrpLeu: 0.899 ± 0.327
0.385TrpMet: 0.385 ± 0.161
0.642TrpAsn: 0.642 ± 0.263
0.128TrpPro: 0.128 ± 0.183
0.514TrpGln: 0.514 ± 0.273
0.771TrpArg: 0.771 ± 0.361
0.514TrpSer: 0.514 ± 0.361
0.514TrpThr: 0.514 ± 0.274
0.128TrpVal: 0.128 ± 0.108
0.128TrpTrp: 0.128 ± 0.117
0.257TrpTyr: 0.257 ± 0.233
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.67TyrAla: 1.67 ± 0.385
0.257TyrCys: 0.257 ± 0.19
2.827TyrAsp: 2.827 ± 0.439
4.112TyrGlu: 4.112 ± 0.557
0.771TyrPhe: 0.771 ± 0.262
2.184TyrGly: 2.184 ± 0.497
0.257TyrHis: 0.257 ± 0.174
2.441TyrIle: 2.441 ± 0.601
2.441TyrLys: 2.441 ± 0.416
2.955TyrLeu: 2.955 ± 0.746
1.156TyrMet: 1.156 ± 0.479
3.983TyrAsn: 3.983 ± 0.622
1.028TyrPro: 1.028 ± 0.411
1.542TyrGln: 1.542 ± 0.246
2.313TyrArg: 2.313 ± 0.374
2.955TyrSer: 2.955 ± 0.801
2.441TyrThr: 2.441 ± 0.632
4.754TyrVal: 4.754 ± 0.588
0.0TyrTrp: 0.0 ± 0.0
2.056TyrTyr: 2.056 ± 0.716
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 10 proteins (7784 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski