Amino acid dipepetide frequency for Wenling hepe-like virus 4

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.954AlaAla: 4.954 ± 1.691
1.858AlaCys: 1.858 ± 0.488
2.786AlaAsp: 2.786 ± 0.669
4.025AlaGlu: 4.025 ± 0.773
3.096AlaPhe: 3.096 ± 1.057
2.477AlaGly: 2.477 ± 1.401
1.238AlaHis: 1.238 ± 0.714
3.406AlaIle: 3.406 ± 0.803
3.406AlaLys: 3.406 ± 1.289
7.74AlaLeu: 7.74 ± 2.081
1.238AlaMet: 1.238 ± 0.701
2.477AlaAsn: 2.477 ± 0.444
3.406AlaPro: 3.406 ± 0.857
3.096AlaGln: 3.096 ± 1.326
3.096AlaArg: 3.096 ± 0.765
5.263AlaSer: 5.263 ± 2.332
2.167AlaThr: 2.167 ± 1.598
4.025AlaVal: 4.025 ± 0.632
0.31AlaTrp: 0.31 ± 0.35
2.477AlaTyr: 2.477 ± 0.724
0.0AlaXaa: 0.0 ± 0.0
Cys
1.858CysAla: 1.858 ± 0.94
0.0CysCys: 0.0 ± 0.0
1.858CysAsp: 1.858 ± 0.799
1.238CysGlu: 1.238 ± 0.492
2.477CysPhe: 2.477 ± 1.191
1.238CysGly: 1.238 ± 0.714
0.619CysHis: 0.619 ± 0.428
0.619CysIle: 0.619 ± 0.313
1.238CysLys: 1.238 ± 0.678
2.477CysLeu: 2.477 ± 0.694
0.31CysMet: 0.31 ± 0.423
1.238CysAsn: 1.238 ± 0.491
1.238CysPro: 1.238 ± 0.676
0.31CysGln: 0.31 ± 0.35
0.929CysArg: 0.929 ± 0.413
2.786CysSer: 2.786 ± 0.442
0.929CysThr: 0.929 ± 0.392
4.025CysVal: 4.025 ± 0.908
0.0CysTrp: 0.0 ± 0.0
0.929CysTyr: 0.929 ± 0.371
0.0CysXaa: 0.0 ± 0.0
Asp
2.167AspAla: 2.167 ± 0.966
1.858AspCys: 1.858 ± 0.488
3.406AspAsp: 3.406 ± 1.142
4.334AspGlu: 4.334 ± 0.874
4.025AspPhe: 4.025 ± 1.249
3.096AspGly: 3.096 ± 2.222
0.619AspHis: 0.619 ± 0.389
4.644AspIle: 4.644 ± 0.77
3.406AspLys: 3.406 ± 1.963
4.954AspLeu: 4.954 ± 1.366
0.619AspMet: 0.619 ± 0.357
1.238AspAsn: 1.238 ± 0.777
2.167AspPro: 2.167 ± 0.334
3.406AspGln: 3.406 ± 0.921
3.715AspArg: 3.715 ± 0.657
6.192AspSer: 6.192 ± 1.196
3.406AspThr: 3.406 ± 0.799
4.334AspVal: 4.334 ± 0.868
0.929AspTrp: 0.929 ± 0.413
2.477AspTyr: 2.477 ± 1.029
0.0AspXaa: 0.0 ± 0.0
Glu
2.477GluAla: 2.477 ± 1.214
1.548GluCys: 1.548 ± 0.892
2.786GluAsp: 2.786 ± 0.945
4.644GluGlu: 4.644 ± 2.134
3.406GluPhe: 3.406 ± 0.75
1.238GluGly: 1.238 ± 0.491
1.858GluHis: 1.858 ± 1.607
4.334GluIle: 4.334 ± 0.992
2.786GluLys: 2.786 ± 0.64
3.715GluLeu: 3.715 ± 0.795
2.167GluMet: 2.167 ± 0.71
2.477GluAsn: 2.477 ± 0.905
2.167GluPro: 2.167 ± 1.148
1.238GluGln: 1.238 ± 0.714
4.025GluArg: 4.025 ± 1.904
4.954GluSer: 4.954 ± 0.748
2.786GluThr: 2.786 ± 1.116
4.334GluVal: 4.334 ± 1.495
0.31GluTrp: 0.31 ± 0.178
3.096GluTyr: 3.096 ± 0.883
0.0GluXaa: 0.0 ± 0.0
Phe
5.573PheAla: 5.573 ± 1.122
1.548PheCys: 1.548 ± 0.531
2.786PheAsp: 2.786 ± 0.719
3.406PheGlu: 3.406 ± 1.137
3.096PhePhe: 3.096 ± 1.249
4.954PheGly: 4.954 ± 0.889
1.238PheHis: 1.238 ± 0.714
2.477PheIle: 2.477 ± 1.045
4.334PheLys: 4.334 ± 1.155
5.263PheLeu: 5.263 ± 1.485
1.238PheMet: 1.238 ± 0.578
2.786PheAsn: 2.786 ± 1.363
1.548PhePro: 1.548 ± 1.914
1.238PheGln: 1.238 ± 0.426
2.477PheArg: 2.477 ± 0.86
5.573PheSer: 5.573 ± 1.727
4.025PheThr: 4.025 ± 1.217
4.644PheVal: 4.644 ± 0.405
0.619PheTrp: 0.619 ± 0.836
1.548PheTyr: 1.548 ± 0.966
0.0PheXaa: 0.0 ± 0.0
Gly
2.786GlyAla: 2.786 ± 0.912
0.619GlyCys: 0.619 ± 0.357
4.025GlyAsp: 4.025 ± 1.047
4.644GlyGlu: 4.644 ± 1.046
2.786GlyPhe: 2.786 ± 1.096
1.858GlyGly: 1.858 ± 0.626
0.0GlyHis: 0.0 ± 0.0
1.858GlyIle: 1.858 ± 0.508
4.954GlyLys: 4.954 ± 0.828
3.406GlyLeu: 3.406 ± 0.91
0.929GlyMet: 0.929 ± 0.387
1.858GlyAsn: 1.858 ± 0.799
1.858GlyPro: 1.858 ± 0.57
1.548GlyGln: 1.548 ± 0.892
2.477GlyArg: 2.477 ± 1.083
4.954GlySer: 4.954 ± 1.477
3.715GlyThr: 3.715 ± 2.196
3.715GlyVal: 3.715 ± 1.016
0.929GlyTrp: 0.929 ± 0.74
4.334GlyTyr: 4.334 ± 0.797
0.0GlyXaa: 0.0 ± 0.0
His
1.858HisAla: 1.858 ± 0.799
1.238HisCys: 1.238 ± 0.714
2.167HisAsp: 2.167 ± 0.734
0.929HisGlu: 0.929 ± 0.535
0.929HisPhe: 0.929 ± 0.535
0.929HisGly: 0.929 ± 0.535
0.619HisHis: 0.619 ± 0.357
0.0HisIle: 0.0 ± 0.0
0.929HisLys: 0.929 ± 0.535
0.929HisLeu: 0.929 ± 0.535
0.619HisMet: 0.619 ± 0.635
1.548HisAsn: 1.548 ± 1.284
0.31HisPro: 0.31 ± 0.178
0.31HisGln: 0.31 ± 0.178
0.929HisArg: 0.929 ± 0.392
2.786HisSer: 2.786 ± 0.414
0.619HisThr: 0.619 ± 0.357
2.167HisVal: 2.167 ± 0.661
0.619HisTrp: 0.619 ± 0.428
0.619HisTyr: 0.619 ± 0.534
0.0HisXaa: 0.0 ± 0.0
Ile
3.406IleAla: 3.406 ± 0.704
0.619IleCys: 0.619 ± 0.635
2.167IleAsp: 2.167 ± 1.351
3.715IleGlu: 3.715 ± 1.276
3.096IlePhe: 3.096 ± 1.258
3.406IleGly: 3.406 ± 0.617
0.31IleHis: 0.31 ± 0.178
3.406IleIle: 3.406 ± 1.499
3.096IleLys: 3.096 ± 1.065
4.025IleLeu: 4.025 ± 1.764
0.929IleMet: 0.929 ± 0.431
3.096IleAsn: 3.096 ± 0.602
2.477IlePro: 2.477 ± 1.229
1.238IleGln: 1.238 ± 0.714
2.167IleArg: 2.167 ± 0.852
9.598IleSer: 9.598 ± 3.281
3.715IleThr: 3.715 ± 0.862
2.786IleVal: 2.786 ± 1.023
0.0IleTrp: 0.0 ± 0.0
2.786IleTyr: 2.786 ± 0.774
0.0IleXaa: 0.0 ± 0.0
Lys
4.025LysAla: 4.025 ± 0.808
0.929LysCys: 0.929 ± 0.535
4.644LysAsp: 4.644 ± 0.89
2.786LysGlu: 2.786 ± 0.869
3.406LysPhe: 3.406 ± 1.352
4.334LysGly: 4.334 ± 1.591
1.238LysHis: 1.238 ± 0.597
4.334LysIle: 4.334 ± 1.052
4.025LysLys: 4.025 ± 1.346
4.954LysLeu: 4.954 ± 1.63
0.619LysMet: 0.619 ± 0.346
4.025LysAsn: 4.025 ± 1.615
0.929LysPro: 0.929 ± 0.521
3.096LysGln: 3.096 ± 0.564
0.619LysArg: 0.619 ± 0.357
2.786LysSer: 2.786 ± 1.457
2.477LysThr: 2.477 ± 0.777
5.573LysVal: 5.573 ± 1.814
0.619LysTrp: 0.619 ± 0.356
3.715LysTyr: 3.715 ± 1.504
0.0LysXaa: 0.0 ± 0.0
Leu
6.502LeuAla: 6.502 ± 1.87
2.477LeuCys: 2.477 ± 1.083
7.74LeuAsp: 7.74 ± 1.055
4.644LeuGlu: 4.644 ± 1.532
4.644LeuPhe: 4.644 ± 0.39
2.786LeuGly: 2.786 ± 0.854
2.477LeuHis: 2.477 ± 1.029
4.334LeuIle: 4.334 ± 1.097
4.954LeuLys: 4.954 ± 1.724
7.74LeuLeu: 7.74 ± 0.876
1.548LeuMet: 1.548 ± 0.696
6.811LeuAsn: 6.811 ± 1.215
5.263LeuPro: 5.263 ± 1.692
1.548LeuGln: 1.548 ± 0.648
3.715LeuArg: 3.715 ± 1.355
5.573LeuSer: 5.573 ± 2.091
4.334LeuThr: 4.334 ± 1.398
4.334LeuVal: 4.334 ± 1.052
0.929LeuTrp: 0.929 ± 0.637
2.477LeuTyr: 2.477 ± 1.254
0.0LeuXaa: 0.0 ± 0.0
Met
1.548MetAla: 1.548 ± 0.967
0.31MetCys: 0.31 ± 0.178
1.548MetAsp: 1.548 ± 0.639
0.929MetGlu: 0.929 ± 0.64
0.929MetPhe: 0.929 ± 0.392
0.929MetGly: 0.929 ± 0.392
0.619MetHis: 0.619 ± 0.389
0.929MetIle: 0.929 ± 0.637
1.858MetLys: 1.858 ± 0.585
1.548MetLeu: 1.548 ± 0.531
0.31MetMet: 0.31 ± 0.178
1.858MetAsn: 1.858 ± 0.57
0.619MetPro: 0.619 ± 0.357
0.619MetGln: 0.619 ± 0.356
0.619MetArg: 0.619 ± 0.635
2.167MetSer: 2.167 ± 0.734
0.619MetThr: 0.619 ± 0.357
0.619MetVal: 0.619 ± 0.357
0.0MetTrp: 0.0 ± 0.0
0.619MetTyr: 0.619 ± 0.357
0.0MetXaa: 0.0 ± 0.0
Asn
1.858AsnAla: 1.858 ± 0.466
1.238AsnCys: 1.238 ± 0.451
1.548AsnAsp: 1.548 ± 0.804
1.548AsnGlu: 1.548 ± 0.441
4.025AsnPhe: 4.025 ± 0.694
3.096AsnGly: 3.096 ± 0.888
0.31AsnHis: 0.31 ± 0.643
3.406AsnIle: 3.406 ± 0.948
2.477AsnLys: 2.477 ± 0.678
4.954AsnLeu: 4.954 ± 1.163
1.858AsnMet: 1.858 ± 0.488
2.786AsnAsn: 2.786 ± 0.442
1.238AsnPro: 1.238 ± 0.587
1.858AsnGln: 1.858 ± 0.785
1.548AsnArg: 1.548 ± 0.701
4.644AsnSer: 4.644 ± 1.295
3.715AsnThr: 3.715 ± 1.619
6.192AsnVal: 6.192 ± 1.054
0.31AsnTrp: 0.31 ± 0.643
0.929AsnTyr: 0.929 ± 0.595
0.0AsnXaa: 0.0 ± 0.0
Pro
2.786ProAla: 2.786 ± 1.167
1.238ProCys: 1.238 ± 1.253
3.406ProAsp: 3.406 ± 0.905
1.238ProGlu: 1.238 ± 0.494
1.858ProPhe: 1.858 ± 0.821
1.548ProGly: 1.548 ± 0.682
1.238ProHis: 1.238 ± 0.597
2.167ProIle: 2.167 ± 1.351
1.548ProLys: 1.548 ± 0.639
3.715ProLeu: 3.715 ± 1.819
0.619ProMet: 0.619 ± 0.357
1.548ProAsn: 1.548 ± 0.684
1.238ProPro: 1.238 ± 0.627
1.238ProGln: 1.238 ± 0.627
1.858ProArg: 1.858 ± 0.488
1.858ProSer: 1.858 ± 0.85
3.406ProThr: 3.406 ± 1.386
3.715ProVal: 3.715 ± 0.846
0.929ProTrp: 0.929 ± 0.392
1.238ProTyr: 1.238 ± 0.29
0.0ProXaa: 0.0 ± 0.0
Gln
1.238GlnAla: 1.238 ± 0.862
1.548GlnCys: 1.548 ± 0.764
0.929GlnAsp: 0.929 ± 0.371
1.858GlnGlu: 1.858 ± 0.718
1.858GlnPhe: 1.858 ± 0.951
1.858GlnGly: 1.858 ± 0.743
1.548GlnHis: 1.548 ± 0.892
2.167GlnIle: 2.167 ± 0.966
1.858GlnLys: 1.858 ± 1.074
3.715GlnLeu: 3.715 ± 0.927
0.619GlnMet: 0.619 ± 0.357
1.548GlnAsn: 1.548 ± 0.66
1.548GlnPro: 1.548 ± 0.345
2.477GlnGln: 2.477 ± 0.531
1.858GlnArg: 1.858 ± 0.785
1.858GlnSer: 1.858 ± 0.799
1.858GlnThr: 1.858 ± 0.466
3.406GlnVal: 3.406 ± 1.259
0.0GlnTrp: 0.0 ± 0.0
2.167GlnTyr: 2.167 ± 1.022
0.0GlnXaa: 0.0 ± 0.0
Arg
2.786ArgAla: 2.786 ± 0.881
0.929ArgCys: 0.929 ± 0.392
2.786ArgAsp: 2.786 ± 0.745
4.644ArgGlu: 4.644 ± 1.706
3.406ArgPhe: 3.406 ± 1.21
3.096ArgGly: 3.096 ± 0.564
0.929ArgHis: 0.929 ± 0.433
1.858ArgIle: 1.858 ± 0.581
2.477ArgLys: 2.477 ± 1.136
3.715ArgLeu: 3.715 ± 1.115
0.0ArgMet: 0.0 ± 0.0
1.858ArgAsn: 1.858 ± 0.567
1.858ArgPro: 1.858 ± 0.585
3.406ArgGln: 3.406 ± 1.042
1.238ArgArg: 1.238 ± 0.714
2.477ArgSer: 2.477 ± 0.724
2.167ArgThr: 2.167 ± 0.595
4.954ArgVal: 4.954 ± 1.518
0.31ArgTrp: 0.31 ± 0.35
1.858ArgTyr: 1.858 ± 0.729
0.0ArgXaa: 0.0 ± 0.0
Ser
5.573SerAla: 5.573 ± 1.856
2.477SerCys: 2.477 ± 1.233
4.954SerAsp: 4.954 ± 0.88
5.882SerGlu: 5.882 ± 1.356
7.43SerPhe: 7.43 ± 2.458
5.573SerGly: 5.573 ± 2.389
1.548SerHis: 1.548 ± 0.509
6.811SerIle: 6.811 ± 1.778
4.025SerLys: 4.025 ± 1.25
5.263SerLeu: 5.263 ± 1.152
1.238SerMet: 1.238 ± 0.713
2.167SerAsn: 2.167 ± 0.892
3.406SerPro: 3.406 ± 0.504
1.858SerGln: 1.858 ± 0.57
4.025SerArg: 4.025 ± 1.183
7.74SerSer: 7.74 ± 3.214
4.334SerThr: 4.334 ± 2.451
7.43SerVal: 7.43 ± 1.103
0.31SerTrp: 0.31 ± 0.178
2.786SerTyr: 2.786 ± 0.496
0.0SerXaa: 0.0 ± 0.0
Thr
3.715ThrAla: 3.715 ± 1.718
1.548ThrCys: 1.548 ± 0.682
3.096ThrAsp: 3.096 ± 1.773
1.548ThrGlu: 1.548 ± 0.822
5.573ThrPhe: 5.573 ± 1.376
4.334ThrGly: 4.334 ± 1.054
0.31ThrHis: 0.31 ± 0.178
3.096ThrIle: 3.096 ± 0.933
3.096ThrLys: 3.096 ± 0.689
4.644ThrLeu: 4.644 ± 1.358
0.929ThrMet: 0.929 ± 0.392
1.858ThrAsn: 1.858 ± 0.716
1.548ThrPro: 1.548 ± 0.639
3.096ThrGln: 3.096 ± 1.627
2.786ThrArg: 2.786 ± 0.904
4.334ThrSer: 4.334 ± 1.181
3.096ThrThr: 3.096 ± 1.813
3.715ThrVal: 3.715 ± 1.459
1.238ThrTrp: 1.238 ± 0.492
1.548ThrTyr: 1.548 ± 0.406
0.0ThrXaa: 0.0 ± 0.0
Val
4.644ValAla: 4.644 ± 0.75
1.858ValCys: 1.858 ± 0.517
3.715ValAsp: 3.715 ± 0.639
2.477ValGlu: 2.477 ± 0.982
2.786ValPhe: 2.786 ± 1.121
4.644ValGly: 4.644 ± 1.912
3.096ValHis: 3.096 ± 1.491
2.786ValIle: 2.786 ± 0.642
6.811ValLys: 6.811 ± 1.853
6.811ValLeu: 6.811 ± 1.319
2.477ValMet: 2.477 ± 0.753
4.334ValAsn: 4.334 ± 0.411
4.644ValPro: 4.644 ± 1.27
1.858ValGln: 1.858 ± 0.466
5.882ValArg: 5.882 ± 1.114
5.882ValSer: 5.882 ± 1.652
5.882ValThr: 5.882 ± 1.456
5.573ValVal: 5.573 ± 0.986
0.0ValTrp: 0.0 ± 0.0
3.406ValTyr: 3.406 ± 1.764
0.0ValXaa: 0.0 ± 0.0
Trp
0.619TrpAla: 0.619 ± 0.684
0.619TrpCys: 0.619 ± 0.357
0.0TrpAsp: 0.0 ± 0.0
0.0TrpGlu: 0.0 ± 0.0
0.929TrpPhe: 0.929 ± 0.521
0.619TrpGly: 0.619 ± 0.575
0.31TrpHis: 0.31 ± 0.178
0.619TrpIle: 0.619 ± 0.356
0.0TrpLys: 0.0 ± 0.0
0.929TrpLeu: 0.929 ± 0.559
0.0TrpMet: 0.0 ± 0.0
0.929TrpAsn: 0.929 ± 0.732
0.0TrpPro: 0.0 ± 0.0
0.31TrpGln: 0.31 ± 0.178
1.238TrpArg: 1.238 ± 0.492
0.31TrpSer: 0.31 ± 0.405
0.31TrpThr: 0.31 ± 0.509
0.619TrpVal: 0.619 ± 0.454
0.0TrpTrp: 0.0 ± 0.0
0.0TrpTyr: 0.0 ± 0.0
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.167TyrAla: 2.167 ± 0.61
1.858TyrCys: 1.858 ± 0.374
4.644TyrAsp: 4.644 ± 1.093
1.858TyrGlu: 1.858 ± 0.782
0.619TyrPhe: 0.619 ± 0.7
1.238TyrGly: 1.238 ± 0.714
0.929TyrHis: 0.929 ± 0.569
2.786TyrIle: 2.786 ± 0.629
1.858TyrLys: 1.858 ± 0.508
4.644TyrLeu: 4.644 ± 1.705
0.619TyrMet: 0.619 ± 0.5
3.096TyrAsn: 3.096 ± 1.071
0.929TyrPro: 0.929 ± 0.521
2.167TyrGln: 2.167 ± 0.576
1.548TyrArg: 1.548 ± 0.639
3.096TyrSer: 3.096 ± 1.148
1.548TyrThr: 1.548 ± 0.758
3.406TyrVal: 3.406 ± 1.334
0.0TyrTrp: 0.0 ± 0.0
2.477TyrTyr: 2.477 ± 0.444
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3231 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski