Amino acid dipepetide frequency for Wenling crustacean virus 10

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.843AlaAla: 1.843 ± 1.73
1.58AlaCys: 1.58 ± 0.418
2.896AlaAsp: 2.896 ± 1.389
2.896AlaGlu: 2.896 ± 0.37
1.316AlaPhe: 1.316 ± 0.541
1.843AlaGly: 1.843 ± 0.684
1.053AlaHis: 1.053 ± 0.97
3.686AlaIle: 3.686 ± 1.816
3.949AlaLys: 3.949 ± 0.965
6.319AlaLeu: 6.319 ± 1.531
0.263AlaMet: 0.263 ± 0.155
2.633AlaAsn: 2.633 ± 0.721
1.316AlaPro: 1.316 ± 0.499
1.053AlaGln: 1.053 ± 0.97
2.37AlaArg: 2.37 ± 0.602
4.476AlaSer: 4.476 ± 0.705
2.633AlaThr: 2.633 ± 0.533
1.316AlaVal: 1.316 ± 0.279
1.58AlaTrp: 1.58 ± 0.546
2.633AlaTyr: 2.633 ± 1.663
0.0AlaXaa: 0.0 ± 0.0
Cys
0.79CysAla: 0.79 ± 0.595
0.527CysCys: 0.527 ± 0.33
0.79CysAsp: 0.79 ± 0.283
1.58CysGlu: 1.58 ± 0.78
1.316CysPhe: 1.316 ± 1.434
0.79CysGly: 0.79 ± 0.595
0.263CysHis: 0.263 ± 0.155
1.58CysIle: 1.58 ± 0.637
1.843CysLys: 1.843 ± 1.184
2.37CysLeu: 2.37 ± 0.927
0.263CysMet: 0.263 ± 0.155
0.527CysAsn: 0.527 ± 0.33
0.527CysPro: 0.527 ± 0.26
0.0CysGln: 0.0 ± 0.0
0.79CysArg: 0.79 ± 0.469
0.79CysSer: 0.79 ± 0.729
0.0CysThr: 0.0 ± 0.0
0.79CysVal: 0.79 ± 0.332
0.79CysTrp: 0.79 ± 0.332
1.316CysTyr: 1.316 ± 0.344
0.0CysXaa: 0.0 ± 0.0
Asp
2.37AspAla: 2.37 ± 0.855
0.79AspCys: 0.79 ± 0.595
5.266AspAsp: 5.266 ± 4.529
4.213AspGlu: 4.213 ± 2.244
3.16AspPhe: 3.16 ± 0.86
2.633AspGly: 2.633 ± 0.525
1.843AspHis: 1.843 ± 0.433
2.106AspIle: 2.106 ± 0.678
4.213AspLys: 4.213 ± 1.215
7.109AspLeu: 7.109 ± 1.409
0.527AspMet: 0.527 ± 0.54
2.633AspAsn: 2.633 ± 1.182
2.896AspPro: 2.896 ± 0.81
1.843AspGln: 1.843 ± 1.246
1.053AspArg: 1.053 ± 0.272
2.106AspSer: 2.106 ± 0.629
2.896AspThr: 2.896 ± 0.815
1.58AspVal: 1.58 ± 0.682
1.316AspTrp: 1.316 ± 0.674
1.58AspTyr: 1.58 ± 0.637
0.0AspXaa: 0.0 ± 0.0
Glu
2.896GluAla: 2.896 ± 0.898
0.527GluCys: 0.527 ± 0.26
4.476GluAsp: 4.476 ± 2.459
6.056GluGlu: 6.056 ± 1.669
2.37GluPhe: 2.37 ± 0.602
4.739GluGly: 4.739 ± 1.618
1.58GluHis: 1.58 ± 0.871
3.686GluIle: 3.686 ± 0.586
6.056GluLys: 6.056 ± 0.892
6.846GluLeu: 6.846 ± 0.655
1.843GluMet: 1.843 ± 0.57
1.58GluAsn: 1.58 ± 0.546
1.053GluPro: 1.053 ± 0.663
2.896GluGln: 2.896 ± 0.758
3.686GluArg: 3.686 ± 0.887
5.529GluSer: 5.529 ± 0.875
2.896GluThr: 2.896 ± 0.876
4.476GluVal: 4.476 ± 1.158
1.58GluTrp: 1.58 ± 0.523
0.79GluTyr: 0.79 ± 0.343
0.0GluXaa: 0.0 ± 0.0
Phe
1.053PheAla: 1.053 ± 0.578
1.053PheCys: 1.053 ± 0.578
0.79PheAsp: 0.79 ± 0.564
2.896PheGlu: 2.896 ± 0.63
1.58PhePhe: 1.58 ± 0.637
2.106PheGly: 2.106 ± 0.642
1.316PheHis: 1.316 ± 0.521
1.316PheIle: 1.316 ± 0.279
2.896PheLys: 2.896 ± 0.782
5.529PheLeu: 5.529 ± 0.322
0.79PheMet: 0.79 ± 0.332
0.263PheAsn: 0.263 ± 0.155
3.16PhePro: 3.16 ± 0.974
1.316PheGln: 1.316 ± 0.775
1.843PheArg: 1.843 ± 0.456
6.056PheSer: 6.056 ± 0.698
1.58PheThr: 1.58 ± 0.566
1.843PheVal: 1.843 ± 0.634
0.263PheTrp: 0.263 ± 0.155
1.843PheTyr: 1.843 ± 0.691
0.0PheXaa: 0.0 ± 0.0
Gly
2.633GlyAla: 2.633 ± 0.731
0.79GlyCys: 0.79 ± 0.343
2.633GlyAsp: 2.633 ± 0.358
3.686GlyGlu: 3.686 ± 1.158
3.686GlyPhe: 3.686 ± 1.653
3.686GlyGly: 3.686 ± 1.187
1.58GlyHis: 1.58 ± 0.566
2.633GlyIle: 2.633 ± 0.905
3.16GlyLys: 3.16 ± 1.819
7.109GlyLeu: 7.109 ± 1.436
1.053GlyMet: 1.053 ± 0.464
1.843GlyAsn: 1.843 ± 0.542
1.053GlyPro: 1.053 ± 0.375
2.106GlyGln: 2.106 ± 0.879
3.423GlyArg: 3.423 ± 0.463
5.266GlySer: 5.266 ± 1.418
3.686GlyThr: 3.686 ± 1.306
4.213GlyVal: 4.213 ± 0.852
2.37GlyTrp: 2.37 ± 0.839
1.316GlyTyr: 1.316 ± 0.318
0.0GlyXaa: 0.0 ± 0.0
His
0.79HisAla: 0.79 ± 0.353
0.263HisCys: 0.263 ± 0.155
0.79HisAsp: 0.79 ± 0.69
1.053HisGlu: 1.053 ± 0.528
1.316HisPhe: 1.316 ± 0.521
1.58HisGly: 1.58 ± 0.677
0.79HisHis: 0.79 ± 0.332
1.316HisIle: 1.316 ± 0.521
0.79HisLys: 0.79 ± 0.283
2.633HisLeu: 2.633 ± 0.559
0.0HisMet: 0.0 ± 0.0
1.053HisAsn: 1.053 ± 0.375
1.843HisPro: 1.843 ± 0.51
1.316HisGln: 1.316 ± 0.775
0.79HisArg: 0.79 ± 0.455
1.843HisSer: 1.843 ± 0.448
1.58HisThr: 1.58 ± 0.58
1.843HisVal: 1.843 ± 0.548
0.79HisTrp: 0.79 ± 0.283
0.79HisTyr: 0.79 ± 0.283
0.0HisXaa: 0.0 ± 0.0
Ile
3.16IleAla: 3.16 ± 0.677
1.843IleCys: 1.843 ± 0.645
2.37IleAsp: 2.37 ± 0.852
2.37IleGlu: 2.37 ± 1.407
1.843IlePhe: 1.843 ± 0.645
4.213IleGly: 4.213 ± 0.914
1.843IleHis: 1.843 ± 0.753
2.896IleIle: 2.896 ± 0.864
5.266IleLys: 5.266 ± 0.327
5.529IleLeu: 5.529 ± 2.202
2.37IleMet: 2.37 ± 0.967
2.106IleAsn: 2.106 ± 1.24
2.37IlePro: 2.37 ± 0.604
4.213IleGln: 4.213 ± 0.928
1.053IleArg: 1.053 ± 0.659
5.529IleSer: 5.529 ± 1.734
3.949IleThr: 3.949 ± 0.886
4.213IleVal: 4.213 ± 1.352
0.79IleTrp: 0.79 ± 0.343
1.843IleTyr: 1.843 ± 0.781
0.0IleXaa: 0.0 ± 0.0
Lys
2.37LysAla: 2.37 ± 1.036
1.053LysCys: 1.053 ± 0.305
3.949LysAsp: 3.949 ± 2.005
7.372LysGlu: 7.372 ± 2.272
1.58LysPhe: 1.58 ± 0.824
5.266LysGly: 5.266 ± 1.374
2.106LysHis: 2.106 ± 0.581
7.372LysIle: 7.372 ± 1.883
5.266LysLys: 5.266 ± 1.581
6.056LysLeu: 6.056 ± 2.384
0.527LysMet: 0.527 ± 0.302
1.58LysAsn: 1.58 ± 1.602
2.896LysPro: 2.896 ± 0.864
2.896LysGln: 2.896 ± 0.815
2.633LysArg: 2.633 ± 0.916
4.476LysSer: 4.476 ± 1.305
5.793LysThr: 5.793 ± 0.466
5.266LysVal: 5.266 ± 1.931
1.843LysTrp: 1.843 ± 0.48
2.37LysTyr: 2.37 ± 0.987
0.0LysXaa: 0.0 ± 0.0
Leu
5.793LeuAla: 5.793 ± 1.037
1.843LeuCys: 1.843 ± 0.578
3.686LeuAsp: 3.686 ± 0.476
6.582LeuGlu: 6.582 ± 1.658
4.739LeuPhe: 4.739 ± 0.74
5.266LeuGly: 5.266 ± 1.17
2.106LeuHis: 2.106 ± 0.669
6.846LeuIle: 6.846 ± 1.994
11.585LeuLys: 11.585 ± 2.859
8.425LeuLeu: 8.425 ± 1.185
3.686LeuMet: 3.686 ± 1.84
3.423LeuAsn: 3.423 ± 0.97
5.003LeuPro: 5.003 ± 1.869
3.423LeuGln: 3.423 ± 0.492
5.793LeuArg: 5.793 ± 2.088
13.165LeuSer: 13.165 ± 1.249
10.269LeuThr: 10.269 ± 1.476
7.109LeuVal: 7.109 ± 1.726
1.316LeuTrp: 1.316 ± 0.775
3.423LeuTyr: 3.423 ± 1.241
0.0LeuXaa: 0.0 ± 0.0
Met
2.106MetAla: 2.106 ± 0.529
0.0MetCys: 0.0 ± 0.0
0.79MetAsp: 0.79 ± 0.366
1.316MetGlu: 1.316 ± 0.425
1.316MetPhe: 1.316 ± 0.541
0.79MetGly: 0.79 ± 0.353
0.0MetHis: 0.0 ± 0.0
1.316MetIle: 1.316 ± 0.318
1.843MetLys: 1.843 ± 0.743
2.106MetLeu: 2.106 ± 0.868
1.58MetMet: 1.58 ± 0.678
0.79MetAsn: 0.79 ± 0.353
0.79MetPro: 0.79 ± 0.465
0.527MetGln: 0.527 ± 0.485
1.053MetArg: 1.053 ± 0.375
1.053MetSer: 1.053 ± 0.503
1.58MetThr: 1.58 ± 0.264
1.316MetVal: 1.316 ± 0.425
0.527MetTrp: 0.527 ± 0.33
1.053MetTyr: 1.053 ± 0.421
0.0MetXaa: 0.0 ± 0.0
Asn
0.79AsnAla: 0.79 ± 0.63
1.053AsnCys: 1.053 ± 1.18
2.633AsnAsp: 2.633 ± 0.841
0.527AsnGlu: 0.527 ± 0.33
1.053AsnPhe: 1.053 ± 0.659
2.106AsnGly: 2.106 ± 0.868
0.527AsnHis: 0.527 ± 0.26
1.58AsnIle: 1.58 ± 0.687
1.843AsnLys: 1.843 ± 0.433
5.793AsnLeu: 5.793 ± 1.366
1.843AsnMet: 1.843 ± 0.542
0.79AsnAsn: 0.79 ± 0.283
1.58AsnPro: 1.58 ± 0.637
2.37AsnGln: 2.37 ± 0.483
0.527AsnArg: 0.527 ± 0.31
1.843AsnSer: 1.843 ± 0.647
2.633AsnThr: 2.633 ± 0.655
1.053AsnVal: 1.053 ± 0.62
1.843AsnTrp: 1.843 ± 0.48
1.053AsnTyr: 1.053 ± 0.528
0.0AsnXaa: 0.0 ± 0.0
Pro
1.843ProAla: 1.843 ± 0.994
0.79ProCys: 0.79 ± 0.343
3.686ProAsp: 3.686 ± 1.124
3.949ProGlu: 3.949 ± 0.679
1.58ProPhe: 1.58 ± 0.637
1.843ProGly: 1.843 ± 0.797
0.263ProHis: 0.263 ± 0.155
2.896ProIle: 2.896 ± 1.133
2.633ProLys: 2.633 ± 1.216
5.529ProLeu: 5.529 ± 0.649
1.316ProMet: 1.316 ± 0.579
2.106ProAsn: 2.106 ± 1.392
2.896ProPro: 2.896 ± 0.589
1.053ProGln: 1.053 ± 0.503
1.843ProArg: 1.843 ± 0.48
5.793ProSer: 5.793 ± 1.153
1.843ProThr: 1.843 ± 0.41
2.633ProVal: 2.633 ± 0.849
0.79ProTrp: 0.79 ± 0.366
0.79ProTyr: 0.79 ± 0.469
0.0ProXaa: 0.0 ± 0.0
Gln
3.423GlnAla: 3.423 ± 1.359
0.527GlnCys: 0.527 ± 0.672
2.633GlnAsp: 2.633 ± 1.216
3.16GlnGlu: 3.16 ± 0.866
1.316GlnPhe: 1.316 ± 0.775
2.896GlnGly: 2.896 ± 0.399
0.79GlnHis: 0.79 ± 0.353
2.37GlnIle: 2.37 ± 2.038
2.633GlnLys: 2.633 ± 0.96
4.213GlnLeu: 4.213 ± 1.086
0.527GlnMet: 0.527 ± 0.31
1.053GlnAsn: 1.053 ± 0.412
1.58GlnPro: 1.58 ± 0.523
1.053GlnGln: 1.053 ± 0.663
1.053GlnArg: 1.053 ± 0.305
3.423GlnSer: 3.423 ± 0.542
2.106GlnThr: 2.106 ± 0.678
2.633GlnVal: 2.633 ± 1.229
1.316GlnTrp: 1.316 ± 0.532
0.79GlnTyr: 0.79 ± 0.283
0.0GlnXaa: 0.0 ± 0.0
Arg
2.106ArgAla: 2.106 ± 0.67
0.527ArgCys: 0.527 ± 0.485
2.633ArgAsp: 2.633 ± 0.936
2.896ArgGlu: 2.896 ± 0.772
3.423ArgPhe: 3.423 ± 0.861
1.843ArgGly: 1.843 ± 0.781
1.58ArgHis: 1.58 ± 0.675
3.423ArgIle: 3.423 ± 1.063
2.37ArgLys: 2.37 ± 0.649
6.319ArgLeu: 6.319 ± 0.918
1.316ArgMet: 1.316 ± 0.548
1.053ArgAsn: 1.053 ± 0.272
1.053ArgPro: 1.053 ± 0.62
2.37ArgGln: 2.37 ± 0.996
3.16ArgArg: 3.16 ± 1.194
0.79ArgSer: 0.79 ± 0.465
4.476ArgThr: 4.476 ± 1.461
1.843ArgVal: 1.843 ± 0.817
0.263ArgTrp: 0.263 ± 0.376
0.527ArgTyr: 0.527 ± 0.33
0.0ArgXaa: 0.0 ± 0.0
Ser
2.896SerAla: 2.896 ± 1.12
1.843SerCys: 1.843 ± 0.773
5.266SerAsp: 5.266 ± 1.101
5.793SerGlu: 5.793 ± 0.965
2.106SerPhe: 2.106 ± 0.425
5.266SerGly: 5.266 ± 1.264
2.37SerHis: 2.37 ± 0.625
4.213SerIle: 4.213 ± 0.704
4.739SerLys: 4.739 ± 1.09
11.058SerLeu: 11.058 ± 1.222
0.79SerMet: 0.79 ± 0.518
3.423SerAsn: 3.423 ± 1.003
5.266SerPro: 5.266 ± 0.918
4.739SerGln: 4.739 ± 1.328
4.739SerArg: 4.739 ± 1.28
10.795SerSer: 10.795 ± 1.66
6.056SerThr: 6.056 ± 1.377
3.423SerVal: 3.423 ± 1.346
2.106SerTrp: 2.106 ± 0.699
2.633SerTyr: 2.633 ± 1.23
0.0SerXaa: 0.0 ± 0.0
Thr
2.896ThrAla: 2.896 ± 0.785
1.316ThrCys: 1.316 ± 0.521
1.053ThrAsp: 1.053 ± 0.434
3.423ThrGlu: 3.423 ± 0.545
3.16ThrPhe: 3.16 ± 1.273
3.949ThrGly: 3.949 ± 0.718
1.58ThrHis: 1.58 ± 0.361
5.529ThrIle: 5.529 ± 2.013
3.16ThrLys: 3.16 ± 1.16
7.899ThrLeu: 7.899 ± 1.872
1.316ThrMet: 1.316 ± 0.715
3.16ThrAsn: 3.16 ± 0.945
5.003ThrPro: 5.003 ± 1.836
1.58ThrGln: 1.58 ± 0.637
2.896ThrArg: 2.896 ± 0.617
7.372ThrSer: 7.372 ± 1.371
5.003ThrThr: 5.003 ± 0.996
3.949ThrVal: 3.949 ± 0.482
1.58ThrTrp: 1.58 ± 0.93
1.316ThrTyr: 1.316 ± 0.578
0.0ThrXaa: 0.0 ± 0.0
Val
3.949ValAla: 3.949 ± 0.722
1.053ValCys: 1.053 ± 0.272
3.686ValAsp: 3.686 ± 1.263
2.37ValGlu: 2.37 ± 0.604
1.58ValPhe: 1.58 ± 1.01
2.633ValGly: 2.633 ± 0.975
0.79ValHis: 0.79 ± 0.864
2.106ValIle: 2.106 ± 0.586
4.739ValLys: 4.739 ± 1.026
8.162ValLeu: 8.162 ± 2.251
0.263ValMet: 0.263 ± 0.155
1.843ValAsn: 1.843 ± 0.476
3.949ValPro: 3.949 ± 1.039
2.37ValGln: 2.37 ± 0.815
3.16ValArg: 3.16 ± 0.804
4.476ValSer: 4.476 ± 1.015
4.213ValThr: 4.213 ± 1.055
5.003ValVal: 5.003 ± 1.153
0.527ValTrp: 0.527 ± 0.302
1.58ValTyr: 1.58 ± 0.361
0.0ValXaa: 0.0 ± 0.0
Trp
2.896TrpAla: 2.896 ± 0.994
0.0TrpCys: 0.0 ± 0.0
1.053TrpAsp: 1.053 ± 0.603
1.843TrpGlu: 1.843 ± 0.781
0.263TrpPhe: 0.263 ± 0.346
2.106TrpGly: 2.106 ± 0.642
0.263TrpHis: 0.263 ± 0.155
1.58TrpIle: 1.58 ± 0.418
1.843TrpLys: 1.843 ± 0.589
1.316TrpLeu: 1.316 ± 0.775
0.79TrpMet: 0.79 ± 0.366
0.79TrpAsn: 0.79 ± 0.353
0.0TrpPro: 0.0 ± 0.0
0.263TrpGln: 0.263 ± 0.376
0.79TrpArg: 0.79 ± 0.332
1.58TrpSer: 1.58 ± 0.705
2.37TrpThr: 2.37 ± 1.029
1.843TrpVal: 1.843 ± 0.687
0.263TrpTrp: 0.263 ± 0.155
0.263TrpTyr: 0.263 ± 0.155
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.053TyrAla: 1.053 ± 0.421
0.263TyrCys: 0.263 ± 0.376
1.053TyrAsp: 1.053 ± 0.653
1.58TyrGlu: 1.58 ± 0.566
0.79TyrPhe: 0.79 ± 0.564
2.37TyrGly: 2.37 ± 0.786
0.527TyrHis: 0.527 ± 0.26
1.58TyrIle: 1.58 ± 0.361
1.843TyrLys: 1.843 ± 0.263
3.16TyrLeu: 3.16 ± 0.83
0.527TyrMet: 0.527 ± 0.31
0.79TyrAsn: 0.79 ± 0.469
1.843TyrPro: 1.843 ± 0.781
2.106TyrGln: 2.106 ± 0.341
1.316TyrArg: 1.316 ± 0.775
3.16TyrSer: 3.16 ± 0.534
1.58TyrThr: 1.58 ± 0.48
2.106TyrVal: 2.106 ± 0.544
0.263TyrTrp: 0.263 ± 0.155
0.527TyrTyr: 0.527 ± 0.471
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (3799 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski