Amino acid dipepetide frequency for Yellow head virus (YHV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.375AlaAla: 4.375 ± 0.493
1.193AlaCys: 1.193 ± 0.213
3.977AlaAsp: 3.977 ± 0.632
2.148AlaGlu: 2.148 ± 0.559
3.182AlaPhe: 3.182 ± 0.414
1.511AlaGly: 1.511 ± 1.184
2.068AlaHis: 2.068 ± 0.32
3.5AlaIle: 3.5 ± 0.423
2.943AlaLys: 2.943 ± 0.331
5.012AlaLeu: 5.012 ± 0.68
0.239AlaMet: 0.239 ± 0.704
4.375AlaAsn: 4.375 ± 0.768
2.148AlaPro: 2.148 ± 0.577
3.659AlaGln: 3.659 ± 0.474
3.182AlaArg: 3.182 ± 1.113
4.137AlaSer: 4.137 ± 0.464
4.852AlaThr: 4.852 ± 0.679
3.421AlaVal: 3.421 ± 0.436
0.239AlaTrp: 0.239 ± 0.135
3.977AlaTyr: 3.977 ± 0.314
0.0AlaXaa: 0.0 ± 0.0
Cys
1.83CysAla: 1.83 ± 0.202
0.318CysCys: 0.318 ± 0.11
1.352CysAsp: 1.352 ± 0.737
1.034CysGlu: 1.034 ± 0.177
1.352CysPhe: 1.352 ± 0.434
2.546CysGly: 2.546 ± 0.341
1.591CysHis: 1.591 ± 0.207
1.591CysIle: 1.591 ± 0.225
1.83CysLys: 1.83 ± 0.355
1.75CysLeu: 1.75 ± 0.28
0.636CysMet: 0.636 ± 0.08
2.307CysAsn: 2.307 ± 0.55
1.034CysPro: 1.034 ± 0.404
0.477CysGln: 0.477 ± 0.143
0.477CysArg: 0.477 ± 0.056
2.466CysSer: 2.466 ± 0.493
1.511CysThr: 1.511 ± 0.192
0.716CysVal: 0.716 ± 0.084
0.239CysTrp: 0.239 ± 0.097
1.75CysTyr: 1.75 ± 0.203
0.0CysXaa: 0.0 ± 0.0
Asp
2.864AspAla: 2.864 ± 0.435
1.75AspCys: 1.75 ± 0.357
2.625AspAsp: 2.625 ± 0.54
2.148AspGlu: 2.148 ± 0.577
2.625AspPhe: 2.625 ± 0.459
2.784AspGly: 2.784 ± 0.505
2.068AspHis: 2.068 ± 0.411
5.887AspIle: 5.887 ± 0.641
2.148AspLys: 2.148 ± 0.584
3.58AspLeu: 3.58 ± 0.433
1.034AspMet: 1.034 ± 0.177
3.5AspAsn: 3.5 ± 0.549
2.386AspPro: 2.386 ± 1.257
1.591AspGln: 1.591 ± 0.268
1.75AspArg: 1.75 ± 0.385
3.739AspSer: 3.739 ± 0.442
4.534AspThr: 4.534 ± 0.661
2.148AspVal: 2.148 ± 0.305
0.318AspTrp: 0.318 ± 0.229
2.943AspTyr: 2.943 ± 0.963
0.0AspXaa: 0.0 ± 0.0
Glu
3.341GluAla: 3.341 ± 0.742
1.273GluCys: 1.273 ± 0.176
2.307GluAsp: 2.307 ± 0.61
2.386GluGlu: 2.386 ± 0.428
1.989GluPhe: 1.989 ± 0.534
1.352GluGly: 1.352 ± 0.607
1.989GluHis: 1.989 ± 0.269
3.102GluIle: 3.102 ± 0.588
1.671GluLys: 1.671 ± 0.264
2.466GluLeu: 2.466 ± 0.729
0.398GluMet: 0.398 ± 0.09
1.989GluAsn: 1.989 ± 0.603
1.989GluPro: 1.989 ± 0.602
1.989GluGln: 1.989 ± 0.648
1.432GluArg: 1.432 ± 0.643
1.671GluSer: 1.671 ± 0.198
3.58GluThr: 3.58 ± 0.392
2.705GluVal: 2.705 ± 0.393
0.557GluTrp: 0.557 ± 0.171
2.784GluTyr: 2.784 ± 0.332
0.0GluXaa: 0.0 ± 0.0
Phe
3.898PheAla: 3.898 ± 0.532
1.114PheCys: 1.114 ± 0.126
2.943PheAsp: 2.943 ± 0.596
2.227PheGlu: 2.227 ± 0.274
2.148PhePhe: 2.148 ± 0.268
1.989PheGly: 1.989 ± 0.904
0.716PheHis: 0.716 ± 0.141
4.375PheIle: 4.375 ± 0.255
3.182PheLys: 3.182 ± 0.516
5.171PheLeu: 5.171 ± 0.694
0.875PheMet: 0.875 ± 0.127
2.546PheAsn: 2.546 ± 0.417
1.989PhePro: 1.989 ± 0.442
1.114PheGln: 1.114 ± 0.132
2.466PheArg: 2.466 ± 0.272
3.341PheSer: 3.341 ± 0.477
4.534PheThr: 4.534 ± 0.501
1.671PheVal: 1.671 ± 0.396
0.239PheTrp: 0.239 ± 0.071
2.943PheTyr: 2.943 ± 0.357
0.0PheXaa: 0.0 ± 0.0
Gly
3.023GlyAla: 3.023 ± 0.945
1.432GlyCys: 1.432 ± 0.582
2.705GlyAsp: 2.705 ± 0.63
1.193GlyGlu: 1.193 ± 0.356
1.671GlyPhe: 1.671 ± 0.393
1.671GlyGly: 1.671 ± 0.541
2.148GlyHis: 2.148 ± 0.363
5.807GlyIle: 5.807 ± 0.734
2.943GlyLys: 2.943 ± 2.605
3.023GlyLeu: 3.023 ± 0.653
0.636GlyMet: 0.636 ± 0.187
1.83GlyAsn: 1.83 ± 0.55
1.591GlyPro: 1.591 ± 0.285
1.909GlyGln: 1.909 ± 0.239
1.114GlyArg: 1.114 ± 0.219
3.818GlySer: 3.818 ± 0.641
3.818GlyThr: 3.818 ± 0.921
1.83GlyVal: 1.83 ± 0.355
0.398GlyTrp: 0.398 ± 0.676
2.784GlyTyr: 2.784 ± 0.854
0.0GlyXaa: 0.0 ± 0.0
His
1.909HisAla: 1.909 ± 0.552
1.034HisCys: 1.034 ± 0.245
1.75HisAsp: 1.75 ± 0.639
2.784HisGlu: 2.784 ± 0.559
1.989HisPhe: 1.989 ± 0.249
2.466HisGly: 2.466 ± 0.296
1.193HisHis: 1.193 ± 0.123
3.182HisIle: 3.182 ± 0.522
1.83HisLys: 1.83 ± 0.41
2.943HisLeu: 2.943 ± 0.669
0.318HisMet: 0.318 ± 0.079
3.341HisAsn: 3.341 ± 0.71
2.148HisPro: 2.148 ± 0.298
1.034HisGln: 1.034 ± 0.238
1.034HisArg: 1.034 ± 0.305
1.989HisSer: 1.989 ± 0.599
2.784HisThr: 2.784 ± 0.412
1.83HisVal: 1.83 ± 0.225
0.557HisTrp: 0.557 ± 0.066
1.114HisTyr: 1.114 ± 0.357
0.0HisXaa: 0.0 ± 0.0
Ile
5.727IleAla: 5.727 ± 0.877
1.989IleCys: 1.989 ± 0.217
5.012IleAsp: 5.012 ± 1.084
3.261IleGlu: 3.261 ± 1.069
3.898IlePhe: 3.898 ± 0.414
3.182IleGly: 3.182 ± 0.37
2.466IleHis: 2.466 ± 0.634
7.398IleIle: 7.398 ± 1.641
3.102IleLys: 3.102 ± 0.539
6.523IleLeu: 6.523 ± 0.21
1.273IleMet: 1.273 ± 0.159
5.648IleAsn: 5.648 ± 0.659
4.852IlePro: 4.852 ± 0.44
2.784IleGln: 2.784 ± 0.646
3.341IleArg: 3.341 ± 0.452
4.852IleSer: 4.852 ± 0.838
6.443IleThr: 6.443 ± 0.49
4.296IleVal: 4.296 ± 0.497
0.795IleTrp: 0.795 ± 0.31
5.012IleTyr: 5.012 ± 0.68
0.0IleXaa: 0.0 ± 0.0
Lys
2.307LysAla: 2.307 ± 0.516
0.557LysCys: 0.557 ± 0.066
2.227LysAsp: 2.227 ± 0.319
1.75LysGlu: 1.75 ± 0.254
3.182LysPhe: 3.182 ± 0.701
1.511LysGly: 1.511 ± 0.251
2.227LysHis: 2.227 ± 0.396
2.705LysIle: 2.705 ± 0.5
2.227LysLys: 2.227 ± 0.675
4.057LysLeu: 4.057 ± 0.535
1.193LysMet: 1.193 ± 0.588
2.068LysAsn: 2.068 ± 0.627
1.83LysPro: 1.83 ± 0.202
1.83LysGln: 1.83 ± 0.268
2.307LysArg: 2.307 ± 1.182
3.5LysSer: 3.5 ± 1.043
2.546LysThr: 2.546 ± 0.672
3.898LysVal: 3.898 ± 0.371
0.557LysTrp: 0.557 ± 0.171
3.182LysTyr: 3.182 ± 0.608
0.0LysXaa: 0.0 ± 0.0
Leu
4.375LeuAla: 4.375 ± 0.969
2.784LeuCys: 2.784 ± 0.444
4.693LeuAsp: 4.693 ± 0.465
2.943LeuGlu: 2.943 ± 0.408
4.455LeuPhe: 4.455 ± 0.612
3.341LeuGly: 3.341 ± 0.877
2.466LeuHis: 2.466 ± 0.356
7.478LeuIle: 7.478 ± 1.382
4.693LeuLys: 4.693 ± 0.609
7.0LeuLeu: 7.0 ± 0.961
1.352LeuMet: 1.352 ± 0.285
4.693LeuAsn: 4.693 ± 0.352
4.773LeuPro: 4.773 ± 1.64
3.58LeuGln: 3.58 ± 1.111
3.58LeuArg: 3.58 ± 1.011
7.875LeuSer: 7.875 ± 0.837
7.239LeuThr: 7.239 ± 1.342
2.307LeuVal: 2.307 ± 0.537
0.716LeuTrp: 0.716 ± 0.161
3.261LeuTyr: 3.261 ± 0.611
0.0LeuXaa: 0.0 ± 0.0
Met
1.511MetAla: 1.511 ± 0.558
0.318MetCys: 0.318 ± 0.079
0.557MetAsp: 0.557 ± 0.164
0.795MetGlu: 0.795 ± 0.632
0.159MetPhe: 0.159 ± 0.056
0.318MetGly: 0.318 ± 0.11
0.716MetHis: 0.716 ± 0.084
1.114MetIle: 1.114 ± 0.19
0.875MetLys: 0.875 ± 0.281
1.114MetLeu: 1.114 ± 0.132
0.398MetMet: 0.398 ± 0.083
0.955MetAsn: 0.955 ± 0.604
0.318MetPro: 0.318 ± 0.668
0.239MetGln: 0.239 ± 0.761
0.239MetArg: 0.239 ± 0.135
1.034MetSer: 1.034 ± 0.664
1.591MetThr: 1.591 ± 0.344
1.034MetVal: 1.034 ± 0.177
0.159MetTrp: 0.159 ± 0.056
0.875MetTyr: 0.875 ± 0.127
0.0MetXaa: 0.0 ± 0.0
Asn
4.216AsnAla: 4.216 ± 0.568
1.193AsnCys: 1.193 ± 0.192
3.341AsnAsp: 3.341 ± 0.481
2.625AsnGlu: 2.625 ± 0.496
2.466AsnPhe: 2.466 ± 0.467
4.455AsnGly: 4.455 ± 0.312
1.75AsnHis: 1.75 ± 0.32
5.33AsnIle: 5.33 ± 0.942
2.705AsnLys: 2.705 ± 0.304
3.977AsnLeu: 3.977 ± 0.539
1.114AsnMet: 1.114 ± 0.19
2.864AsnAsn: 2.864 ± 0.464
3.421AsnPro: 3.421 ± 1.183
2.068AsnGln: 2.068 ± 0.354
1.83AsnArg: 1.83 ± 1.224
3.659AsnSer: 3.659 ± 0.459
6.125AsnThr: 6.125 ± 0.876
3.659AsnVal: 3.659 ± 0.814
0.557AsnTrp: 0.557 ± 0.066
1.83AsnTyr: 1.83 ± 0.197
0.0AsnXaa: 0.0 ± 0.0
Pro
2.625ProAla: 2.625 ± 0.376
1.273ProCys: 1.273 ± 0.245
2.227ProAsp: 2.227 ± 0.488
1.671ProGlu: 1.671 ± 0.39
2.546ProPhe: 2.546 ± 0.442
2.466ProGly: 2.466 ± 1.615
1.75ProHis: 1.75 ± 0.515
3.023ProIle: 3.023 ± 0.412
2.466ProLys: 2.466 ± 0.452
3.023ProLeu: 3.023 ± 0.449
0.239ProMet: 0.239 ± 1.407
1.591ProAsn: 1.591 ± 0.645
2.546ProPro: 2.546 ± 0.559
1.432ProGln: 1.432 ± 0.583
2.784ProArg: 2.784 ± 0.505
5.091ProSer: 5.091 ± 0.935
3.818ProThr: 3.818 ± 0.833
3.5ProVal: 3.5 ± 0.838
0.318ProTrp: 0.318 ± 0.079
2.625ProTyr: 2.625 ± 0.285
0.0ProXaa: 0.0 ± 0.0
Gln
2.625GlnAla: 2.625 ± 0.537
1.193GlnCys: 1.193 ± 0.629
0.636GlnAsp: 0.636 ± 0.103
1.511GlnGlu: 1.511 ± 0.574
2.386GlnPhe: 2.386 ± 0.383
1.273GlnGly: 1.273 ± 0.583
1.034GlnHis: 1.034 ± 0.245
3.977GlnIle: 3.977 ± 0.442
1.034GlnLys: 1.034 ± 0.595
3.023GlnLeu: 3.023 ± 0.61
0.239GlnMet: 0.239 ± 0.263
0.795GlnAsn: 0.795 ± 0.237
1.671GlnPro: 1.671 ± 0.172
0.716GlnGln: 0.716 ± 0.141
2.068GlnArg: 2.068 ± 0.292
2.864GlnSer: 2.864 ± 1.825
3.023GlnThr: 3.023 ± 0.523
2.386GlnVal: 2.386 ± 0.488
0.0GlnTrp: 0.0 ± 0.0
1.989GlnTyr: 1.989 ± 0.442
0.0GlnXaa: 0.0 ± 0.0
Arg
1.034ArgAla: 1.034 ± 0.305
1.75ArgCys: 1.75 ± 0.254
2.784ArgAsp: 2.784 ± 0.354
1.75ArgGlu: 1.75 ± 0.385
2.864ArgPhe: 2.864 ± 0.335
2.625ArgGly: 2.625 ± 0.437
2.705ArgHis: 2.705 ± 0.303
2.625ArgIle: 2.625 ± 0.311
1.273ArgLys: 1.273 ± 0.39
3.421ArgLeu: 3.421 ± 1.204
0.557ArgMet: 0.557 ± 0.645
1.909ArgAsn: 1.909 ± 0.511
1.75ArgPro: 1.75 ± 0.599
0.875ArgGln: 0.875 ± 0.281
3.58ArgArg: 3.58 ± 1.944
3.261ArgSer: 3.261 ± 0.587
2.784ArgThr: 2.784 ± 1.259
3.977ArgVal: 3.977 ± 1.078
0.398ArgTrp: 0.398 ± 0.119
1.83ArgTyr: 1.83 ± 0.666
0.0ArgXaa: 0.0 ± 0.0
Ser
4.455SerAla: 4.455 ± 0.541
2.386SerCys: 2.386 ± 0.794
3.102SerAsp: 3.102 ± 0.39
3.261SerGlu: 3.261 ± 0.445
3.58SerPhe: 3.58 ± 0.499
2.943SerGly: 2.943 ± 0.669
2.227SerHis: 2.227 ± 0.687
5.727SerIle: 5.727 ± 1.157
1.909SerLys: 1.909 ± 0.606
8.432SerLeu: 8.432 ± 1.265
1.511SerMet: 1.511 ± 0.388
4.375SerAsn: 4.375 ± 0.564
2.943SerPro: 2.943 ± 0.676
2.068SerGln: 2.068 ± 0.248
2.943SerArg: 2.943 ± 0.596
6.284SerSer: 6.284 ± 0.411
6.205SerThr: 6.205 ± 0.758
3.5SerVal: 3.5 ± 0.838
0.08SerTrp: 0.08 ± 0.135
6.443SerTyr: 6.443 ± 0.698
0.0SerXaa: 0.0 ± 0.0
Thr
4.534ThrAla: 4.534 ± 0.378
2.148ThrCys: 2.148 ± 0.411
3.341ThrAsp: 3.341 ± 0.417
2.943ThrGlu: 2.943 ± 0.357
3.818ThrPhe: 3.818 ± 0.645
3.898ThrGly: 3.898 ± 0.414
4.137ThrHis: 4.137 ± 0.576
6.125ThrIle: 6.125 ± 0.832
4.534ThrLys: 4.534 ± 0.465
10.58ThrLeu: 10.58 ± 1.479
0.477ThrMet: 0.477 ± 0.115
3.58ThrAsn: 3.58 ± 0.439
4.852ThrPro: 4.852 ± 1.449
3.341ThrGln: 3.341 ± 1.115
3.58ThrArg: 3.58 ± 1.048
6.364ThrSer: 6.364 ± 0.71
7.796ThrThr: 7.796 ± 0.869
4.614ThrVal: 4.614 ± 0.68
1.114ThrTrp: 1.114 ± 0.126
5.091ThrTyr: 5.091 ± 0.459
0.0ThrXaa: 0.0 ± 0.0
Val
2.705ValAla: 2.705 ± 0.339
1.511ValCys: 1.511 ± 0.199
2.784ValAsp: 2.784 ± 0.416
2.466ValGlu: 2.466 ± 0.523
1.75ValPhe: 1.75 ± 0.181
2.943ValGly: 2.943 ± 0.53
1.591ValHis: 1.591 ± 0.253
4.057ValIle: 4.057 ± 0.493
1.671ValLys: 1.671 ± 0.546
3.898ValLeu: 3.898 ± 0.708
1.114ValMet: 1.114 ± 0.329
4.375ValAsn: 4.375 ± 0.965
2.864ValPro: 2.864 ± 0.493
1.114ValGln: 1.114 ± 0.339
2.705ValArg: 2.705 ± 0.348
4.057ValSer: 4.057 ± 0.443
5.171ValThr: 5.171 ± 0.607
2.546ValVal: 2.546 ± 0.629
0.318ValTrp: 0.318 ± 0.079
2.705ValTyr: 2.705 ± 0.658
0.0ValXaa: 0.0 ± 0.0
Trp
0.875TrpAla: 0.875 ± 0.579
0.159TrpCys: 0.159 ± 0.056
0.08TrpAsp: 0.08 ± 0.135
0.318TrpGlu: 0.318 ± 0.079
0.636TrpPhe: 0.636 ± 0.187
0.716TrpGly: 0.716 ± 0.226
0.318TrpHis: 0.318 ± 0.113
0.398TrpIle: 0.398 ± 0.09
0.477TrpLys: 0.477 ± 0.143
0.636TrpLeu: 0.636 ± 0.103
0.08TrpMet: 0.08 ± 0.135
0.875TrpAsn: 0.875 ± 0.258
0.239TrpPro: 0.239 ± 0.071
0.239TrpGln: 0.239 ± 0.071
0.159TrpArg: 0.159 ± 0.056
0.477TrpSer: 0.477 ± 0.498
0.398TrpThr: 0.398 ± 0.663
0.318TrpVal: 0.318 ± 0.11
0.0TrpTrp: 0.0 ± 0.0
0.318TrpTyr: 0.318 ± 0.11
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.909TyrAla: 1.909 ± 0.545
1.432TyrCys: 1.432 ± 0.452
3.977TyrAsp: 3.977 ± 0.754
1.75TyrGlu: 1.75 ± 0.254
2.784TyrPhe: 2.784 ± 0.574
1.352TyrGly: 1.352 ± 0.597
2.068TyrHis: 2.068 ± 0.326
4.455TyrIle: 4.455 ± 0.949
2.148TyrLys: 2.148 ± 0.363
3.977TyrLeu: 3.977 ± 0.954
0.557TyrMet: 0.557 ± 0.178
5.568TyrAsn: 5.568 ± 0.597
1.671TyrPro: 1.671 ± 0.386
2.386TyrGln: 2.386 ± 0.324
3.182TyrArg: 3.182 ± 0.339
3.977TyrSer: 3.977 ± 0.51
8.114TyrThr: 8.114 ± 0.83
1.989TyrVal: 1.989 ± 0.352
0.159TyrTrp: 0.159 ± 0.056
3.023TyrTyr: 3.023 ± 0.356
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (12572 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski