Amino acid dipepetide frequency for Wuhan heteroptera virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.854AlaAla: 2.854 ± 1.257
1.019AlaCys: 1.019 ± 0.376
2.243AlaAsp: 2.243 ± 0.745
1.427AlaGlu: 1.427 ± 0.427
3.466AlaPhe: 3.466 ± 0.473
1.019AlaGly: 1.019 ± 0.508
0.612AlaHis: 0.612 ± 0.407
3.67AlaIle: 3.67 ± 1.086
3.874AlaLys: 3.874 ± 0.601
5.097AlaLeu: 5.097 ± 0.759
0.815AlaMet: 0.815 ± 0.323
3.262AlaAsn: 3.262 ± 0.3
1.019AlaPro: 1.019 ± 0.485
1.223AlaGln: 1.223 ± 0.422
1.835AlaArg: 1.835 ± 0.676
1.427AlaSer: 1.427 ± 0.555
2.65AlaThr: 2.65 ± 0.813
2.854AlaVal: 2.854 ± 0.674
0.0AlaTrp: 0.0 ± 0.0
1.223AlaTyr: 1.223 ± 0.654
0.0AlaXaa: 0.0 ± 0.0
Cys
1.019CysAla: 1.019 ± 0.552
0.204CysCys: 0.204 ± 0.11
0.815CysAsp: 0.815 ± 0.441
0.408CysGlu: 0.408 ± 0.221
0.612CysPhe: 0.612 ± 0.331
0.815CysGly: 0.815 ± 0.58
0.0CysHis: 0.0 ± 0.0
1.223CysIle: 1.223 ± 0.465
1.223CysLys: 1.223 ± 0.498
1.835CysLeu: 1.835 ± 0.441
0.815CysMet: 0.815 ± 0.447
1.223CysAsn: 1.223 ± 0.871
0.612CysPro: 0.612 ± 0.643
0.204CysGln: 0.204 ± 0.11
0.408CysArg: 0.408 ± 0.221
1.835CysSer: 1.835 ± 0.676
0.408CysThr: 0.408 ± 0.221
1.019CysVal: 1.019 ± 0.389
0.204CysTrp: 0.204 ± 0.412
1.223CysTyr: 1.223 ± 1.508
0.0CysXaa: 0.0 ± 0.0
Asp
2.65AspAla: 2.65 ± 0.725
0.612AspCys: 0.612 ± 0.331
2.446AspAsp: 2.446 ± 0.534
2.243AspGlu: 2.243 ± 1.214
4.485AspPhe: 4.485 ± 1.225
2.039AspGly: 2.039 ± 0.402
1.223AspHis: 1.223 ± 0.663
3.058AspIle: 3.058 ± 1.13
4.689AspLys: 4.689 ± 0.832
6.728AspLeu: 6.728 ± 1.4
1.427AspMet: 1.427 ± 0.399
1.835AspAsn: 1.835 ± 0.441
1.427AspPro: 1.427 ± 0.772
1.019AspGln: 1.019 ± 0.486
2.65AspArg: 2.65 ± 0.854
5.708AspSer: 5.708 ± 1.484
2.65AspThr: 2.65 ± 0.859
2.854AspVal: 2.854 ± 0.534
0.815AspTrp: 0.815 ± 0.262
2.039AspTyr: 2.039 ± 0.604
0.0AspXaa: 0.0 ± 0.0
Glu
1.019GluAla: 1.019 ± 0.552
0.815GluCys: 0.815 ± 0.262
2.854GluAsp: 2.854 ± 0.534
2.65GluGlu: 2.65 ± 1.093
2.039GluPhe: 2.039 ± 0.69
2.446GluGly: 2.446 ± 0.838
1.019GluHis: 1.019 ± 0.552
4.893GluIle: 4.893 ± 1.557
2.65GluLys: 2.65 ± 0.79
5.708GluLeu: 5.708 ± 2.021
1.835GluMet: 1.835 ± 0.691
4.077GluAsn: 4.077 ± 1.297
1.631GluPro: 1.631 ± 0.467
2.039GluGln: 2.039 ± 0.96
1.223GluArg: 1.223 ± 0.662
2.854GluSer: 2.854 ± 0.738
3.262GluThr: 3.262 ± 1.099
2.65GluVal: 2.65 ± 0.945
0.204GluTrp: 0.204 ± 0.11
2.039GluTyr: 2.039 ± 0.338
0.0GluXaa: 0.0 ± 0.0
Phe
3.67PheAla: 3.67 ± 0.904
1.427PheCys: 1.427 ± 0.702
2.039PheAsp: 2.039 ± 0.383
1.427PheGlu: 1.427 ± 0.558
3.058PhePhe: 3.058 ± 2.181
3.466PheGly: 3.466 ± 1.133
1.427PheHis: 1.427 ± 0.629
4.689PheIle: 4.689 ± 2.671
2.854PheLys: 2.854 ± 0.804
6.116PheLeu: 6.116 ± 2.131
1.631PheMet: 1.631 ± 0.996
5.505PheAsn: 5.505 ± 1.339
2.039PhePro: 2.039 ± 0.636
1.427PheGln: 1.427 ± 0.965
3.058PheArg: 3.058 ± 1.921
4.893PheSer: 4.893 ± 0.804
4.485PheThr: 4.485 ± 2.382
5.708PheVal: 5.708 ± 1.45
0.0PheTrp: 0.0 ± 0.0
2.039PheTyr: 2.039 ± 1.071
0.0PheXaa: 0.0 ± 0.0
Gly
0.815GlyAla: 0.815 ± 0.441
0.612GlyCys: 0.612 ± 0.254
2.243GlyAsp: 2.243 ± 0.485
2.65GlyGlu: 2.65 ± 0.859
2.446GlyPhe: 2.446 ± 1.209
3.874GlyGly: 3.874 ± 0.943
0.612GlyHis: 0.612 ± 0.29
2.65GlyIle: 2.65 ± 1.076
2.446GlyLys: 2.446 ± 0.757
3.67GlyLeu: 3.67 ± 0.77
0.408GlyMet: 0.408 ± 0.221
1.019GlyAsn: 1.019 ± 0.313
1.835GlyPro: 1.835 ± 0.466
1.427GlyGln: 1.427 ± 1.067
1.223GlyArg: 1.223 ± 0.361
3.67GlySer: 3.67 ± 0.576
1.835GlyThr: 1.835 ± 0.575
3.67GlyVal: 3.67 ± 1.057
0.204GlyTrp: 0.204 ± 0.11
1.631GlyTyr: 1.631 ± 0.762
0.0GlyXaa: 0.0 ± 0.0
His
0.408HisAla: 0.408 ± 0.221
0.612HisCys: 0.612 ± 0.331
1.223HisAsp: 1.223 ± 0.662
0.408HisGlu: 0.408 ± 0.299
2.039HisPhe: 2.039 ± 0.64
1.223HisGly: 1.223 ± 0.361
0.408HisHis: 0.408 ± 0.221
1.427HisIle: 1.427 ± 0.798
1.223HisLys: 1.223 ± 0.604
1.427HisLeu: 1.427 ± 0.584
0.0HisMet: 0.0 ± 0.0
1.631HisAsn: 1.631 ± 0.57
1.223HisPro: 1.223 ± 0.604
0.204HisGln: 0.204 ± 0.38
1.019HisArg: 1.019 ± 0.693
1.631HisSer: 1.631 ± 0.628
0.815HisThr: 0.815 ± 0.36
0.815HisVal: 0.815 ± 0.441
0.204HisTrp: 0.204 ± 0.11
1.427HisTyr: 1.427 ± 0.633
0.0HisXaa: 0.0 ± 0.0
Ile
4.689IleAla: 4.689 ± 1.159
1.223IleCys: 1.223 ± 0.619
4.281IleAsp: 4.281 ± 0.763
4.077IleGlu: 4.077 ± 1.338
3.67IlePhe: 3.67 ± 1.949
2.446IleGly: 2.446 ± 0.412
1.223IleHis: 1.223 ± 0.4
6.32IleIle: 6.32 ± 0.889
4.689IleLys: 4.689 ± 1.093
6.932IleLeu: 6.932 ± 1.809
1.427IleMet: 1.427 ± 0.592
4.689IleAsn: 4.689 ± 0.555
4.893IlePro: 4.893 ± 0.672
2.243IleGln: 2.243 ± 0.517
2.446IleArg: 2.446 ± 0.93
8.563IleSer: 8.563 ± 1.552
4.077IleThr: 4.077 ± 1.041
5.912IleVal: 5.912 ± 0.985
0.815IleTrp: 0.815 ± 0.58
5.097IleTyr: 5.097 ± 1.4
0.0IleXaa: 0.0 ± 0.0
Lys
2.446LysAla: 2.446 ± 0.538
0.204LysCys: 0.204 ± 0.11
3.262LysAsp: 3.262 ± 0.49
3.058LysGlu: 3.058 ± 0.928
5.708LysPhe: 5.708 ± 1.316
1.835LysGly: 1.835 ± 0.713
1.223LysHis: 1.223 ± 0.502
5.097LysIle: 5.097 ± 1.392
3.466LysLys: 3.466 ± 0.566
7.543LysLeu: 7.543 ± 2.498
1.223LysMet: 1.223 ± 0.467
5.912LysAsn: 5.912 ± 1.329
3.262LysPro: 3.262 ± 1.388
1.631LysGln: 1.631 ± 0.64
3.058LysArg: 3.058 ± 0.95
4.281LysSer: 4.281 ± 1.319
4.689LysThr: 4.689 ± 1.26
3.058LysVal: 3.058 ± 0.871
0.612LysTrp: 0.612 ± 0.336
2.446LysTyr: 2.446 ± 0.758
0.0LysXaa: 0.0 ± 0.0
Leu
3.262LeuAla: 3.262 ± 0.838
1.223LeuCys: 1.223 ± 0.889
4.893LeuAsp: 4.893 ± 1.045
6.524LeuGlu: 6.524 ± 1.513
5.708LeuPhe: 5.708 ± 1.922
4.077LeuGly: 4.077 ± 1.395
2.854LeuHis: 2.854 ± 1.174
7.747LeuIle: 7.747 ± 0.964
7.951LeuLys: 7.951 ± 1.391
10.601LeuLeu: 10.601 ± 1.258
2.65LeuMet: 2.65 ± 0.819
6.524LeuAsn: 6.524 ± 1.238
5.301LeuPro: 5.301 ± 0.595
3.67LeuGln: 3.67 ± 1.31
3.874LeuArg: 3.874 ± 0.998
6.728LeuSer: 6.728 ± 1.805
5.912LeuThr: 5.912 ± 1.344
9.174LeuVal: 9.174 ± 2.227
1.223LeuTrp: 1.223 ± 0.465
4.077LeuTyr: 4.077 ± 1.588
0.0LeuXaa: 0.0 ± 0.0
Met
0.408MetAla: 0.408 ± 0.299
0.204MetCys: 0.204 ± 0.11
1.223MetAsp: 1.223 ± 0.391
1.835MetGlu: 1.835 ± 0.648
1.019MetPhe: 1.019 ± 0.654
0.408MetGly: 0.408 ± 0.299
0.204MetHis: 0.204 ± 0.412
1.835MetIle: 1.835 ± 0.996
1.835MetLys: 1.835 ± 0.776
2.446MetLeu: 2.446 ± 0.51
1.019MetMet: 1.019 ± 0.54
0.408MetAsn: 0.408 ± 0.341
0.204MetPro: 0.204 ± 0.357
0.815MetGln: 0.815 ± 0.4
1.427MetArg: 1.427 ± 0.313
1.835MetSer: 1.835 ± 0.53
1.223MetThr: 1.223 ± 0.465
1.223MetVal: 1.223 ± 1.352
0.612MetTrp: 0.612 ± 0.447
1.427MetTyr: 1.427 ± 0.478
0.0MetXaa: 0.0 ± 0.0
Asn
3.262AsnAla: 3.262 ± 0.506
1.631AsnCys: 1.631 ± 0.424
1.835AsnAsp: 1.835 ± 0.755
3.874AsnGlu: 3.874 ± 1.084
5.708AsnPhe: 5.708 ± 1.083
1.631AsnGly: 1.631 ± 0.525
1.019AsnHis: 1.019 ± 0.313
5.505AsnIle: 5.505 ± 1.13
5.097AsnLys: 5.097 ± 1.033
6.932AsnLeu: 6.932 ± 1.416
1.427AsnMet: 1.427 ± 0.472
6.728AsnAsn: 6.728 ± 3.067
2.446AsnPro: 2.446 ± 0.34
1.019AsnGln: 1.019 ± 0.552
2.446AsnArg: 2.446 ± 0.758
5.505AsnSer: 5.505 ± 2.046
4.281AsnThr: 4.281 ± 1.273
5.301AsnVal: 5.301 ± 1.669
0.408AsnTrp: 0.408 ± 0.221
2.446AsnTyr: 2.446 ± 0.445
0.0AsnXaa: 0.0 ± 0.0
Pro
2.243ProAla: 2.243 ± 0.342
0.408ProCys: 0.408 ± 0.534
2.243ProAsp: 2.243 ± 0.542
1.835ProGlu: 1.835 ± 0.886
2.243ProPhe: 2.243 ± 1.622
2.65ProGly: 2.65 ± 0.383
0.408ProHis: 0.408 ± 0.322
4.077ProIle: 4.077 ± 0.872
3.262ProLys: 3.262 ± 1.422
3.67ProLeu: 3.67 ± 0.969
0.612ProMet: 0.612 ± 0.448
1.631ProAsn: 1.631 ± 0.407
3.058ProPro: 3.058 ± 0.577
1.631ProGln: 1.631 ± 0.429
1.835ProArg: 1.835 ± 0.737
4.281ProSer: 4.281 ± 0.808
2.65ProThr: 2.65 ± 2.018
2.65ProVal: 2.65 ± 0.649
0.0ProTrp: 0.0 ± 0.0
2.446ProTyr: 2.446 ± 0.723
0.0ProXaa: 0.0 ± 0.0
Gln
0.612GlnAla: 0.612 ± 0.323
0.612GlnCys: 0.612 ± 0.254
2.243GlnAsp: 2.243 ± 0.755
0.815GlnGlu: 0.815 ± 0.441
1.019GlnPhe: 1.019 ± 0.486
0.815GlnGly: 0.815 ± 0.441
1.223GlnHis: 1.223 ± 0.646
2.039GlnIle: 2.039 ± 0.844
2.65GlnLys: 2.65 ± 0.567
2.039GlnLeu: 2.039 ± 1.32
1.223GlnMet: 1.223 ± 1.595
1.835GlnAsn: 1.835 ± 1.258
0.612GlnPro: 0.612 ± 0.331
1.835GlnGln: 1.835 ± 0.645
0.815GlnArg: 0.815 ± 0.563
2.446GlnSer: 2.446 ± 0.758
1.631GlnThr: 1.631 ± 0.799
1.631GlnVal: 1.631 ± 0.69
0.204GlnTrp: 0.204 ± 0.345
1.631GlnTyr: 1.631 ± 0.683
0.0GlnXaa: 0.0 ± 0.0
Arg
0.612ArgAla: 0.612 ± 0.708
0.408ArgCys: 0.408 ± 0.221
1.835ArgAsp: 1.835 ± 0.772
1.427ArgGlu: 1.427 ± 0.352
2.65ArgPhe: 2.65 ± 1.306
2.039ArgGly: 2.039 ± 0.779
0.408ArgHis: 0.408 ± 0.221
3.466ArgIle: 3.466 ± 0.717
3.262ArgLys: 3.262 ± 0.686
4.485ArgLeu: 4.485 ± 0.972
0.408ArgMet: 0.408 ± 0.221
2.039ArgAsn: 2.039 ± 0.625
1.223ArgPro: 1.223 ± 0.502
0.815ArgGln: 0.815 ± 0.561
1.631ArgArg: 1.631 ± 0.539
4.077ArgSer: 4.077 ± 0.705
1.631ArgThr: 1.631 ± 0.427
2.446ArgVal: 2.446 ± 0.991
1.019ArgTrp: 1.019 ± 0.486
2.446ArgTyr: 2.446 ± 0.704
0.0ArgXaa: 0.0 ± 0.0
Ser
3.466SerAla: 3.466 ± 1.25
1.631SerCys: 1.631 ± 0.883
5.912SerAsp: 5.912 ± 1.501
3.67SerGlu: 3.67 ± 1.406
4.077SerPhe: 4.077 ± 1.022
2.446SerGly: 2.446 ± 0.861
1.835SerHis: 1.835 ± 0.848
6.32SerIle: 6.32 ± 2.281
3.67SerLys: 3.67 ± 0.934
10.398SerLeu: 10.398 ± 0.896
0.815SerMet: 0.815 ± 0.381
6.932SerAsn: 6.932 ± 0.873
4.689SerPro: 4.689 ± 1.162
1.427SerGln: 1.427 ± 0.772
1.631SerArg: 1.631 ± 0.688
8.97SerSer: 8.97 ± 2.459
6.116SerThr: 6.116 ± 2.155
8.359SerVal: 8.359 ± 1.79
0.815SerTrp: 0.815 ± 0.365
2.65SerTyr: 2.65 ± 0.478
0.0SerXaa: 0.0 ± 0.0
Thr
2.243ThrAla: 2.243 ± 0.96
0.204ThrCys: 0.204 ± 0.11
4.281ThrAsp: 4.281 ± 1.334
3.67ThrGlu: 3.67 ± 0.758
4.281ThrPhe: 4.281 ± 1.88
2.039ThrGly: 2.039 ± 1.172
0.815ThrHis: 0.815 ± 0.262
4.077ThrIle: 4.077 ± 0.852
3.874ThrLys: 3.874 ± 0.563
5.097ThrLeu: 5.097 ± 1.43
1.631ThrMet: 1.631 ± 0.774
4.077ThrAsn: 4.077 ± 0.929
2.65ThrPro: 2.65 ± 0.988
1.427ThrGln: 1.427 ± 0.662
0.815ThrArg: 0.815 ± 0.58
6.116ThrSer: 6.116 ± 2.346
5.708ThrThr: 5.708 ± 4.172
4.689ThrVal: 4.689 ± 1.242
0.204ThrTrp: 0.204 ± 0.345
2.446ThrTyr: 2.446 ± 0.96
0.0ThrXaa: 0.0 ± 0.0
Val
4.077ValAla: 4.077 ± 0.87
1.427ValCys: 1.427 ± 0.352
3.67ValAsp: 3.67 ± 1.22
3.466ValGlu: 3.466 ± 0.643
2.854ValPhe: 2.854 ± 1.375
2.65ValGly: 2.65 ± 0.855
1.223ValHis: 1.223 ± 0.326
4.893ValIle: 4.893 ± 1.38
3.262ValLys: 3.262 ± 1.519
7.543ValLeu: 7.543 ± 1.282
1.019ValMet: 1.019 ± 0.384
5.301ValAsn: 5.301 ± 1.075
3.466ValPro: 3.466 ± 1.112
1.631ValGln: 1.631 ± 0.553
4.281ValArg: 4.281 ± 0.857
7.136ValSer: 7.136 ± 1.095
4.689ValThr: 4.689 ± 0.866
4.485ValVal: 4.485 ± 0.771
1.223ValTrp: 1.223 ± 1.466
4.077ValTyr: 4.077 ± 2.142
0.0ValXaa: 0.0 ± 0.0
Trp
0.408TrpAla: 0.408 ± 0.221
0.204TrpCys: 0.204 ± 0.11
0.204TrpAsp: 0.204 ± 0.11
0.408TrpGlu: 0.408 ± 0.221
0.815TrpPhe: 0.815 ± 0.577
0.204TrpGly: 0.204 ± 0.11
0.204TrpHis: 0.204 ± 0.11
1.019TrpIle: 1.019 ± 0.58
0.204TrpLys: 0.204 ± 0.11
1.427TrpLeu: 1.427 ± 0.461
0.0TrpMet: 0.0 ± 0.0
0.815TrpAsn: 0.815 ± 0.365
0.0TrpPro: 0.0 ± 0.0
0.408TrpGln: 0.408 ± 0.511
0.204TrpArg: 0.204 ± 0.11
0.612TrpSer: 0.612 ± 0.528
0.408TrpThr: 0.408 ± 0.299
1.019TrpVal: 1.019 ± 0.585
0.0TrpTrp: 0.0 ± 0.0
0.815TrpTyr: 0.815 ± 0.421
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.427TyrAla: 1.427 ± 0.378
1.427TyrCys: 1.427 ± 0.478
2.854TyrAsp: 2.854 ± 1.237
2.039TyrGlu: 2.039 ± 0.374
3.058TyrPhe: 3.058 ± 1.051
0.815TyrGly: 0.815 ± 0.365
1.427TyrHis: 1.427 ± 0.633
5.708TyrIle: 5.708 ± 1.693
1.835TyrLys: 1.835 ± 0.836
4.281TyrLeu: 4.281 ± 1.104
1.019TyrMet: 1.019 ± 0.384
3.058TyrAsn: 3.058 ± 0.577
2.243TyrPro: 2.243 ± 1.42
1.835TyrGln: 1.835 ± 0.706
2.446TyrArg: 2.446 ± 0.499
3.262TyrSer: 3.262 ± 0.862
1.427TyrThr: 1.427 ± 1.377
2.854TyrVal: 2.854 ± 0.947
0.612TyrTrp: 0.612 ± 0.254
3.058TyrTyr: 3.058 ± 2.089
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 8 proteins (4906 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski