Amino acid dipepetide frequency for Wenzhou Tick Virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.312AlaAla: 4.312 ± 0.942
1.897AlaCys: 1.897 ± 1.007
4.312AlaAsp: 4.312 ± 0.907
4.657AlaGlu: 4.657 ± 0.664
2.242AlaPhe: 2.242 ± 0.925
2.932AlaGly: 2.932 ± 1.188
0.517AlaHis: 0.517 ± 0.358
4.484AlaIle: 4.484 ± 0.415
3.277AlaLys: 3.277 ± 1.002
5.174AlaLeu: 5.174 ± 1.206
1.897AlaMet: 1.897 ± 0.494
2.932AlaAsn: 2.932 ± 1.198
2.415AlaPro: 2.415 ± 2.273
2.76AlaGln: 2.76 ± 1.209
2.587AlaArg: 2.587 ± 0.223
3.794AlaSer: 3.794 ± 0.7
3.794AlaThr: 3.794 ± 0.421
5.347AlaVal: 5.347 ± 1.217
1.38AlaTrp: 1.38 ± 0.66
3.277AlaTyr: 3.277 ± 0.408
0.0AlaXaa: 0.0 ± 0.0
Cys
1.552CysAla: 1.552 ± 0.742
1.035CysCys: 1.035 ± 0.258
0.345CysAsp: 0.345 ± 0.45
1.552CysGlu: 1.552 ± 0.386
1.207CysPhe: 1.207 ± 0.274
0.517CysGly: 0.517 ± 0.129
0.862CysHis: 0.862 ± 0.535
1.207CysIle: 1.207 ± 0.692
1.38CysLys: 1.38 ± 0.178
2.76CysLeu: 2.76 ± 0.96
0.345CysMet: 0.345 ± 0.185
0.862CysAsn: 0.862 ± 0.827
1.897CysPro: 1.897 ± 1.606
0.862CysGln: 0.862 ± 0.462
1.38CysArg: 1.38 ± 0.634
2.242CysSer: 2.242 ± 0.901
1.897CysThr: 1.897 ± 1.373
2.07CysVal: 2.07 ± 0.515
0.172CysTrp: 0.172 ± 0.092
0.517CysTyr: 0.517 ± 0.129
0.0CysXaa: 0.0 ± 0.0
Asp
3.449AspAla: 3.449 ± 2.471
2.242AspCys: 2.242 ± 1.511
2.242AspAsp: 2.242 ± 0.291
2.76AspGlu: 2.76 ± 0.314
1.552AspPhe: 1.552 ± 0.386
2.76AspGly: 2.76 ± 0.32
1.38AspHis: 1.38 ± 0.397
2.932AspIle: 2.932 ± 0.768
2.587AspLys: 2.587 ± 0.806
5.002AspLeu: 5.002 ± 1.378
1.207AspMet: 1.207 ± 1.126
1.552AspAsn: 1.552 ± 0.111
1.897AspPro: 1.897 ± 0.494
2.07AspGln: 2.07 ± 0.545
3.449AspArg: 3.449 ± 0.795
3.622AspSer: 3.622 ± 0.979
2.587AspThr: 2.587 ± 0.339
2.932AspVal: 2.932 ± 0.835
1.552AspTrp: 1.552 ± 1.504
1.207AspTyr: 1.207 ± 0.647
0.0AspXaa: 0.0 ± 0.0
Glu
5.864GluAla: 5.864 ± 1.027
1.38GluCys: 1.38 ± 0.429
4.657GluAsp: 4.657 ± 0.68
4.829GluGlu: 4.829 ± 0.915
1.897GluPhe: 1.897 ± 0.751
3.794GluGly: 3.794 ± 0.868
1.035GluHis: 1.035 ± 0.416
2.76GluIle: 2.76 ± 0.96
5.519GluLys: 5.519 ± 0.266
8.451GluLeu: 8.451 ± 2.567
1.897GluMet: 1.897 ± 0.768
2.07GluAsn: 2.07 ± 0.678
2.76GluPro: 2.76 ± 0.754
2.932GluGln: 2.932 ± 0.801
2.587GluArg: 2.587 ± 0.477
5.002GluSer: 5.002 ± 1.378
3.967GluThr: 3.967 ± 0.499
6.037GluVal: 6.037 ± 0.905
0.345GluTrp: 0.345 ± 0.45
0.862GluTyr: 0.862 ± 0.375
0.0GluXaa: 0.0 ± 0.0
Phe
2.932PheAla: 2.932 ± 0.489
0.862PheCys: 0.862 ± 0.274
1.897PheAsp: 1.897 ± 0.742
2.76PheGlu: 2.76 ± 0.895
2.242PhePhe: 2.242 ± 0.535
1.552PheGly: 1.552 ± 0.936
0.862PheHis: 0.862 ± 0.225
1.38PheIle: 1.38 ± 0.48
1.897PheLys: 1.897 ± 0.768
5.002PheLeu: 5.002 ± 0.543
1.38PheMet: 1.38 ± 0.705
1.897PheAsn: 1.897 ± 0.428
1.552PhePro: 1.552 ± 0.588
1.207PheGln: 1.207 ± 0.274
1.38PheArg: 1.38 ± 0.178
3.794PheSer: 3.794 ± 0.066
2.242PheThr: 2.242 ± 0.049
1.897PheVal: 1.897 ± 0.786
0.172PheTrp: 0.172 ± 0.092
1.552PheTyr: 1.552 ± 0.379
0.0PheXaa: 0.0 ± 0.0
Gly
2.932GlyAla: 2.932 ± 0.206
1.207GlyCys: 1.207 ± 0.98
2.587GlyAsp: 2.587 ± 1.314
2.76GlyGlu: 2.76 ± 0.515
1.897GlyPhe: 1.897 ± 0.786
3.277GlyGly: 3.277 ± 0.581
1.207GlyHis: 1.207 ± 0.274
2.07GlyIle: 2.07 ± 1.38
3.449GlyLys: 3.449 ± 0.683
7.071GlyLeu: 7.071 ± 1.335
1.897GlyMet: 1.897 ± 0.529
1.552GlyAsn: 1.552 ± 0.314
2.07GlyPro: 2.07 ± 0.678
1.897GlyGln: 1.897 ± 0.581
3.277GlyArg: 3.277 ± 1.467
5.002GlySer: 5.002 ± 0.354
4.829GlyThr: 4.829 ± 1.713
3.105GlyVal: 3.105 ± 0.673
0.517GlyTrp: 0.517 ± 0.277
1.38GlyTyr: 1.38 ± 0.723
0.172GlyXaa: 0.172 ± 0.092
His
1.035HisAla: 1.035 ± 0.555
1.035HisCys: 1.035 ± 0.555
0.862HisAsp: 0.862 ± 0.225
0.862HisGlu: 0.862 ± 0.274
0.345HisPhe: 0.345 ± 0.185
1.38HisGly: 1.38 ± 0.317
0.517HisHis: 0.517 ± 0.129
1.38HisIle: 1.38 ± 0.723
0.862HisLys: 0.862 ± 0.225
2.242HisLeu: 2.242 ± 0.523
1.207HisMet: 1.207 ± 0.706
0.345HisAsn: 0.345 ± 0.158
1.38HisPro: 1.38 ± 0.827
0.862HisGln: 0.862 ± 0.27
0.862HisArg: 0.862 ± 0.462
2.415HisSer: 2.415 ± 0.547
1.035HisThr: 1.035 ± 0.231
2.242HisVal: 2.242 ± 0.901
0.345HisTrp: 0.345 ± 0.45
0.69HisTyr: 0.69 ± 0.33
0.0HisXaa: 0.0 ± 0.0
Ile
2.932IleAla: 2.932 ± 0.693
1.552IleCys: 1.552 ± 0.585
1.035IleAsp: 1.035 ± 0.416
2.242IleGlu: 2.242 ± 0.535
2.242IlePhe: 2.242 ± 0.631
1.725IleGly: 1.725 ± 0.102
1.552IleHis: 1.552 ± 0.111
2.242IleIle: 2.242 ± 0.049
5.692IleLys: 5.692 ± 0.482
4.139IleLeu: 4.139 ± 0.744
1.38IleMet: 1.38 ± 0.258
1.38IleAsn: 1.38 ± 0.429
1.035IlePro: 1.035 ± 0.555
2.242IleGln: 2.242 ± 0.523
2.587IleArg: 2.587 ± 1.087
4.139IleSer: 4.139 ± 0.474
2.76IleThr: 2.76 ± 1.013
5.347IleVal: 5.347 ± 0.537
0.69IleTrp: 0.69 ± 0.37
1.725IleTyr: 1.725 ± 0.577
0.0IleXaa: 0.0 ± 0.0
Lys
5.864LysAla: 5.864 ± 0.085
0.345LysCys: 0.345 ± 0.185
3.794LysAsp: 3.794 ± 0.7
5.002LysGlu: 5.002 ± 0.785
1.897LysPhe: 1.897 ± 0.529
2.76LysGly: 2.76 ± 0.668
1.552LysHis: 1.552 ± 0.386
2.932LysIle: 2.932 ± 0.327
5.002LysLys: 5.002 ± 0.956
8.451LysLeu: 8.451 ± 0.989
1.552LysMet: 1.552 ± 0.506
2.242LysAsn: 2.242 ± 1.077
3.277LysPro: 3.277 ± 0.192
1.38LysGln: 1.38 ± 0.48
2.932LysArg: 2.932 ± 0.689
4.484LysSer: 4.484 ± 1.204
4.484LysThr: 4.484 ± 0.616
5.519LysVal: 5.519 ± 1.031
0.69LysTrp: 0.69 ± 0.159
2.242LysTyr: 2.242 ± 0.934
0.0LysXaa: 0.0 ± 0.0
Leu
6.726LeuAla: 6.726 ± 0.863
2.587LeuCys: 2.587 ± 0.443
4.312LeuAsp: 4.312 ± 0.882
8.796LeuGlu: 8.796 ± 1.776
4.312LeuPhe: 4.312 ± 0.071
8.106LeuGly: 8.106 ± 1.345
3.277LeuHis: 3.277 ± 0.876
4.312LeuIle: 4.312 ± 1.176
8.451LeuLys: 8.451 ± 0.787
11.211LeuLeu: 11.211 ± 1.747
3.277LeuMet: 3.277 ± 0.352
3.967LeuAsn: 3.967 ± 0.218
5.347LeuPro: 5.347 ± 1.229
3.967LeuGln: 3.967 ± 0.701
7.589LeuArg: 7.589 ± 1.241
10.176LeuSer: 10.176 ± 0.799
5.864LeuThr: 5.864 ± 0.252
6.382LeuVal: 6.382 ± 1.442
0.517LeuTrp: 0.517 ± 0.378
3.277LeuTyr: 3.277 ± 0.828
0.0LeuXaa: 0.0 ± 0.0
Met
2.932MetAla: 2.932 ± 0.327
0.517MetCys: 0.517 ± 0.358
1.207MetAsp: 1.207 ± 0.471
0.69MetGlu: 0.69 ± 0.37
0.862MetPhe: 0.862 ± 0.274
1.552MetGly: 1.552 ± 0.633
0.172MetHis: 0.172 ± 0.428
1.38MetIle: 1.38 ± 0.48
2.07MetLys: 2.07 ± 0.851
3.449MetLeu: 3.449 ± 0.852
0.69MetMet: 0.69 ± 0.159
1.035MetAsn: 1.035 ± 0.306
1.207MetPro: 1.207 ± 0.392
2.07MetGln: 2.07 ± 0.463
1.207MetArg: 1.207 ± 0.65
2.07MetSer: 2.07 ± 0.701
1.38MetThr: 1.38 ± 0.48
1.725MetVal: 1.725 ± 0.539
0.345MetTrp: 0.345 ± 0.483
0.517MetTyr: 0.517 ± 0.277
0.0MetXaa: 0.0 ± 0.0
Asn
1.035AsnAla: 1.035 ± 0.306
1.207AsnCys: 1.207 ± 0.692
1.897AsnAsp: 1.897 ± 1.225
0.69AsnGlu: 0.69 ± 0.355
0.69AsnPhe: 0.69 ± 0.33
2.07AsnGly: 2.07 ± 2.103
1.035AsnHis: 1.035 ± 0.306
2.242AsnIle: 2.242 ± 0.449
1.725AsnLys: 1.725 ± 0.539
6.899AsnLeu: 6.899 ± 0.76
1.035AsnMet: 1.035 ± 0.555
0.69AsnAsn: 0.69 ± 0.159
1.725AsnPro: 1.725 ± 0.102
0.862AsnGln: 0.862 ± 0.375
1.897AsnArg: 1.897 ± 0.159
3.794AsnSer: 3.794 ± 1.059
2.76AsnThr: 2.76 ± 0.379
1.035AsnVal: 1.035 ± 0.348
0.69AsnTrp: 0.69 ± 0.159
0.69AsnTyr: 0.69 ± 0.317
0.0AsnXaa: 0.0 ± 0.0
Pro
2.242ProAla: 2.242 ± 0.631
1.207ProCys: 1.207 ± 0.26
2.587ProAsp: 2.587 ± 0.339
5.002ProGlu: 5.002 ± 0.385
1.897ProPhe: 1.897 ± 0.968
3.105ProGly: 3.105 ± 1.946
0.69ProHis: 0.69 ± 0.317
1.897ProIle: 1.897 ± 0.159
2.587ProLys: 2.587 ± 0.676
4.312ProLeu: 4.312 ± 0.823
1.207ProMet: 1.207 ± 0.392
1.38ProAsn: 1.38 ± 0.429
1.207ProPro: 1.207 ± 0.692
0.517ProGln: 0.517 ± 0.358
1.897ProArg: 1.897 ± 0.463
3.967ProSer: 3.967 ± 1.745
2.415ProThr: 2.415 ± 1.042
1.725ProVal: 1.725 ± 1.134
0.69ProTrp: 0.69 ± 0.33
0.862ProTyr: 0.862 ± 0.274
0.0ProXaa: 0.0 ± 0.0
Gln
2.07GlnAla: 2.07 ± 0.991
1.207GlnCys: 1.207 ± 0.392
2.07GlnAsp: 2.07 ± 0.946
2.587GlnGlu: 2.587 ± 0.871
2.587GlnPhe: 2.587 ± 0.506
1.725GlnGly: 1.725 ± 0.876
0.862GlnHis: 0.862 ± 0.462
1.725GlnIle: 1.725 ± 0.451
2.07GlnLys: 2.07 ± 0.066
4.312GlnLeu: 4.312 ± 0.55
1.035GlnMet: 1.035 ± 0.306
1.552GlnAsn: 1.552 ± 0.57
1.38GlnPro: 1.38 ± 0.317
1.552GlnGln: 1.552 ± 0.61
1.897GlnArg: 1.897 ± 0.428
1.552GlnSer: 1.552 ± 0.596
2.07GlnThr: 2.07 ± 1.11
1.38GlnVal: 1.38 ± 0.258
0.517GlnTrp: 0.517 ± 0.808
1.035GlnTyr: 1.035 ± 0.715
0.0GlnXaa: 0.0 ± 0.0
Arg
3.794ArgAla: 3.794 ± 1.802
1.725ArgCys: 1.725 ± 0.397
3.105ArgAsp: 3.105 ± 0.808
3.105ArgGlu: 3.105 ± 0.499
1.725ArgPhe: 1.725 ± 0.245
1.38ArgGly: 1.38 ± 0.317
1.552ArgHis: 1.552 ± 0.379
3.105ArgIle: 3.105 ± 0.918
3.277ArgLys: 3.277 ± 0.956
7.934ArgLeu: 7.934 ± 1.654
1.38ArgMet: 1.38 ± 0.429
2.242ArgAsn: 2.242 ± 0.523
1.897ArgPro: 1.897 ± 0.428
1.897ArgGln: 1.897 ± 0.768
3.105ArgArg: 3.105 ± 0.499
4.139ArgSer: 4.139 ± 0.479
3.277ArgThr: 3.277 ± 0.352
2.76ArgVal: 2.76 ± 0.166
0.69ArgTrp: 0.69 ± 0.528
1.38ArgTyr: 1.38 ± 0.74
0.0ArgXaa: 0.0 ± 0.0
Ser
5.174SerAla: 5.174 ± 1.602
1.207SerCys: 1.207 ± 0.428
3.794SerAsp: 3.794 ± 0.925
5.519SerGlu: 5.519 ± 0.876
3.794SerPhe: 3.794 ± 0.421
5.692SerGly: 5.692 ± 0.836
1.207SerHis: 1.207 ± 0.26
4.657SerIle: 4.657 ± 1.117
6.037SerLys: 6.037 ± 1.181
8.451SerLeu: 8.451 ± 1.655
1.38SerMet: 1.38 ± 0.397
3.622SerAsn: 3.622 ± 0.365
3.105SerPro: 3.105 ± 1.266
2.415SerGln: 2.415 ± 1.042
4.829SerArg: 4.829 ± 1.567
13.28SerSer: 13.28 ± 1.457
5.519SerThr: 5.519 ± 0.629
6.554SerVal: 6.554 ± 0.877
1.552SerTrp: 1.552 ± 1.544
2.415SerTyr: 2.415 ± 0.82
0.172SerXaa: 0.172 ± 0.092
Thr
2.932ThrAla: 2.932 ± 0.765
1.552ThrCys: 1.552 ± 1.24
2.76ThrAsp: 2.76 ± 0.597
5.347ThrGlu: 5.347 ± 2.071
2.587ThrPhe: 2.587 ± 0.223
4.139ThrGly: 4.139 ± 1.012
1.38ThrHis: 1.38 ± 0.429
2.415ThrIle: 2.415 ± 0.65
2.76ThrLys: 2.76 ± 0.762
7.244ThrLeu: 7.244 ± 1.215
1.897ThrMet: 1.897 ± 0.581
2.242ThrAsn: 2.242 ± 0.595
2.415ThrPro: 2.415 ± 0.517
1.207ThrGln: 1.207 ± 0.274
3.277ThrArg: 3.277 ± 0.293
5.692ThrSer: 5.692 ± 0.461
3.967ThrThr: 3.967 ± 0.732
4.484ThrVal: 4.484 ± 0.694
0.862ThrTrp: 0.862 ± 0.274
1.725ThrTyr: 1.725 ± 0.384
0.0ThrXaa: 0.0 ± 0.0
Val
5.174ValAla: 5.174 ± 0.233
0.69ValCys: 0.69 ± 0.33
3.622ValAsp: 3.622 ± 0.365
6.382ValGlu: 6.382 ± 0.529
1.897ValPhe: 1.897 ± 0.428
2.932ValGly: 2.932 ± 0.282
1.035ValHis: 1.035 ± 0.258
2.76ValIle: 2.76 ± 0.651
5.347ValLys: 5.347 ± 0.322
6.209ValLeu: 6.209 ± 0.547
1.207ValMet: 1.207 ± 0.392
1.897ValAsn: 1.897 ± 0.786
3.622ValPro: 3.622 ± 0.57
2.76ValGln: 2.76 ± 0.668
3.622ValArg: 3.622 ± 0.979
7.244ValSer: 7.244 ± 1.55
4.139ValThr: 4.139 ± 0.755
4.829ValVal: 4.829 ± 0.536
0.345ValTrp: 0.345 ± 0.158
1.207ValTyr: 1.207 ± 0.26
0.172ValXaa: 0.172 ± 0.092
Trp
0.172TrpAla: 0.172 ± 0.428
0.172TrpCys: 0.172 ± 0.225
0.172TrpAsp: 0.172 ± 0.225
1.035TrpGlu: 1.035 ± 0.475
1.035TrpPhe: 1.035 ± 0.904
1.207TrpGly: 1.207 ± 0.521
0.172TrpHis: 0.172 ± 0.092
0.172TrpIle: 0.172 ± 0.225
0.862TrpLys: 0.862 ± 0.736
0.517TrpLeu: 0.517 ± 0.277
0.517TrpMet: 0.517 ± 0.378
0.345TrpAsn: 0.345 ± 0.383
0.517TrpPro: 0.517 ± 0.378
0.517TrpGln: 0.517 ± 0.129
1.035TrpArg: 1.035 ± 0.231
1.552TrpSer: 1.552 ± 1.082
0.862TrpThr: 0.862 ± 0.27
1.207TrpVal: 1.207 ± 0.521
0.345TrpTrp: 0.345 ± 0.45
0.517TrpTyr: 0.517 ± 0.358
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.862TyrAla: 0.862 ± 0.225
1.035TyrCys: 1.035 ± 0.306
1.725TyrAsp: 1.725 ± 0.75
2.07TyrGlu: 2.07 ± 0.066
1.552TyrPhe: 1.552 ± 0.379
1.552TyrGly: 1.552 ± 0.57
1.035TyrHis: 1.035 ± 0.416
2.587TyrIle: 2.587 ± 0.443
1.725TyrLys: 1.725 ± 0.539
3.622TyrLeu: 3.622 ± 0.523
0.517TyrMet: 0.517 ± 0.129
0.862TyrAsn: 0.862 ± 0.225
0.69TyrPro: 0.69 ± 0.816
1.035TyrGln: 1.035 ± 0.416
1.897TyrArg: 1.897 ± 0.154
2.415TyrSer: 2.415 ± 0.603
1.035TyrThr: 1.035 ± 0.258
0.517TyrVal: 0.517 ± 0.277
0.517TyrTrp: 0.517 ± 0.358
0.862TyrTyr: 0.862 ± 0.274
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.172XaaGln: 0.172 ± 0.092
0.172XaaArg: 0.172 ± 0.092
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.172XaaTyr: 0.172 ± 0.092
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski