Amino acid dipepetide frequency for Wuhan Insect virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.104AlaAla: 3.104 ± 2.359
1.035AlaCys: 1.035 ± 0.375
1.863AlaAsp: 1.863 ± 0.259
2.69AlaGlu: 2.69 ± 0.162
1.863AlaPhe: 1.863 ± 0.259
2.69AlaGly: 2.69 ± 2.005
0.621AlaHis: 0.621 ± 0.183
4.346AlaIle: 4.346 ± 1.284
4.139AlaLys: 4.139 ± 1.55
3.311AlaLeu: 3.311 ± 1.068
0.621AlaMet: 0.621 ± 0.399
3.725AlaAsn: 3.725 ± 0.537
1.863AlaPro: 1.863 ± 0.985
1.656AlaGln: 1.656 ± 1.498
2.483AlaArg: 2.483 ± 0.775
1.863AlaSer: 1.863 ± 1.415
2.276AlaThr: 2.276 ± 0.697
2.483AlaVal: 2.483 ± 0.598
0.0AlaTrp: 0.0 ± 0.0
1.863AlaTyr: 1.863 ± 1.415
0.0AlaXaa: 0.0 ± 0.0
Cys
0.414CysAla: 0.414 ± 0.541
1.035CysCys: 1.035 ± 0.299
1.035CysAsp: 1.035 ± 0.263
1.035CysGlu: 1.035 ± 0.299
0.207CysPhe: 0.207 ± 0.133
0.0CysGly: 0.0 ± 0.0
0.414CysHis: 0.414 ± 0.1
1.242CysIle: 1.242 ± 0.643
2.07CysLys: 2.07 ± 0.778
1.242CysLeu: 1.242 ± 0.299
0.828CysMet: 0.828 ± 0.453
2.276CysAsn: 2.276 ± 1.378
0.414CysPro: 0.414 ± 0.295
0.621CysGln: 0.621 ± 0.214
0.621CysArg: 0.621 ± 0.214
2.276CysSer: 2.276 ± 0.723
1.863CysThr: 1.863 ± 0.378
1.449CysVal: 1.449 ± 0.565
0.0CysTrp: 0.0 ± 0.0
1.242CysTyr: 1.242 ± 0.643
0.0CysXaa: 0.0 ± 0.0
Asp
3.311AspAla: 3.311 ± 0.558
1.035AspCys: 1.035 ± 0.498
4.553AspAsp: 4.553 ± 1.186
5.381AspGlu: 5.381 ± 1.266
2.07AspPhe: 2.07 ± 0.867
2.897AspGly: 2.897 ± 1.295
0.207AspHis: 0.207 ± 0.148
7.036AspIle: 7.036 ± 0.428
5.795AspLys: 5.795 ± 1.001
5.588AspLeu: 5.588 ± 1.404
3.518AspMet: 3.518 ± 1.828
4.139AspAsn: 4.139 ± 0.437
1.242AspPro: 1.242 ± 0.473
1.242AspGln: 1.242 ± 0.799
1.242AspArg: 1.242 ± 0.299
2.69AspSer: 2.69 ± 0.705
4.139AspThr: 4.139 ± 0.997
3.311AspVal: 3.311 ± 1.1
0.414AspTrp: 0.414 ± 0.266
2.07AspTyr: 2.07 ± 0.865
0.0AspXaa: 0.0 ± 0.0
Glu
3.518GluAla: 3.518 ± 0.256
1.242GluCys: 1.242 ± 0.885
2.276GluAsp: 2.276 ± 0.697
4.553GluGlu: 4.553 ± 0.516
2.07GluPhe: 2.07 ± 0.527
2.483GluGly: 2.483 ± 0.046
2.07GluHis: 2.07 ± 0.598
5.381GluIle: 5.381 ± 0.324
6.209GluLys: 6.209 ± 0.874
7.036GluLeu: 7.036 ± 2.287
1.242GluMet: 1.242 ± 0.367
4.139GluAsn: 4.139 ± 0.847
1.035GluPro: 1.035 ± 0.481
1.656GluGln: 1.656 ± 0.634
2.69GluArg: 2.69 ± 0.641
4.346GluSer: 4.346 ± 0.805
4.76GluThr: 4.76 ± 1.355
2.897GluVal: 2.897 ± 0.252
0.414GluTrp: 0.414 ± 0.266
3.932GluTyr: 3.932 ± 1.118
0.0GluXaa: 0.0 ± 0.0
Phe
1.035PheAla: 1.035 ± 0.433
1.035PheCys: 1.035 ± 0.375
2.07PheAsp: 2.07 ± 0.499
1.863PheGlu: 1.863 ± 0.55
0.828PhePhe: 0.828 ± 0.353
2.897PheGly: 2.897 ± 0.293
0.621PheHis: 0.621 ± 0.214
2.07PheIle: 2.07 ± 0.591
3.104PheLys: 3.104 ± 0.654
3.311PheLeu: 3.311 ± 0.169
1.449PheMet: 1.449 ± 0.421
5.381PheAsn: 5.381 ± 1.881
0.207PhePro: 0.207 ± 0.133
0.414PheGln: 0.414 ± 0.532
1.656PheArg: 1.656 ± 0.399
3.311PheSer: 3.311 ± 0.793
2.276PheThr: 2.276 ± 0.73
1.863PheVal: 1.863 ± 0.641
0.414PheTrp: 0.414 ± 0.266
1.656PheTyr: 1.656 ± 0.298
0.0PheXaa: 0.0 ± 0.0
Gly
1.242GlyAla: 1.242 ± 1.563
0.828GlyCys: 0.828 ± 0.199
2.276GlyAsp: 2.276 ± 0.358
3.311GlyGlu: 3.311 ± 0.877
1.242GlyPhe: 1.242 ± 0.548
2.276GlyGly: 2.276 ± 0.192
1.242GlyHis: 1.242 ± 0.334
4.553GlyIle: 4.553 ± 0.91
3.518GlyLys: 3.518 ± 0.877
4.967GlyLeu: 4.967 ± 0.092
2.07GlyMet: 2.07 ± 0.767
4.76GlyAsn: 4.76 ± 0.585
0.621GlyPro: 0.621 ± 0.443
1.656GlyGln: 1.656 ± 1.464
2.276GlyArg: 2.276 ± 0.583
6.209GlySer: 6.209 ± 0.798
3.932GlyThr: 3.932 ± 1.317
3.725GlyVal: 3.725 ± 0.734
1.242GlyTrp: 1.242 ± 0.299
2.276GlyTyr: 2.276 ± 0.918
0.0GlyXaa: 0.0 ± 0.0
His
1.035HisAla: 1.035 ± 0.299
0.207HisCys: 0.207 ± 0.148
1.035HisAsp: 1.035 ± 0.299
0.621HisGlu: 0.621 ± 0.183
0.207HisPhe: 0.207 ± 0.133
0.414HisGly: 0.414 ± 0.1
0.0HisHis: 0.0 ± 0.0
1.863HisIle: 1.863 ± 0.487
1.035HisLys: 1.035 ± 0.375
0.828HisLeu: 0.828 ± 0.453
1.242HisMet: 1.242 ± 0.367
1.035HisAsn: 1.035 ± 0.263
0.414HisPro: 0.414 ± 0.541
0.414HisGln: 0.414 ± 0.532
0.621HisArg: 0.621 ± 0.214
1.242HisSer: 1.242 ± 0.427
0.828HisThr: 0.828 ± 0.304
1.035HisVal: 1.035 ± 0.299
0.0HisTrp: 0.0 ± 0.0
0.828HisTyr: 0.828 ± 0.304
0.0HisXaa: 0.0 ± 0.0
Ile
3.725IleAla: 3.725 ± 0.369
1.242IleCys: 1.242 ± 0.643
5.588IleAsp: 5.588 ± 0.356
5.174IleGlu: 5.174 ± 1.276
2.69IlePhe: 2.69 ± 0.849
4.967IleGly: 4.967 ± 1.264
0.621IleHis: 0.621 ± 0.214
7.45IleIle: 7.45 ± 0.536
9.313IleLys: 9.313 ± 1.821
6.623IleLeu: 6.623 ± 1.524
3.104IleMet: 3.104 ± 0.188
3.932IleAsn: 3.932 ± 0.968
3.311IlePro: 3.311 ± 1.1
2.483IleGln: 2.483 ± 0.825
3.932IleArg: 3.932 ± 1.081
6.829IleSer: 6.829 ± 1.757
3.932IleThr: 3.932 ± 2.164
4.967IleVal: 4.967 ± 0.734
0.828IleTrp: 0.828 ± 0.533
4.346IleTyr: 4.346 ± 0.551
0.0IleXaa: 0.0 ± 0.0
Lys
3.932LysAla: 3.932 ± 0.933
2.276LysCys: 2.276 ± 0.918
4.139LysAsp: 4.139 ± 0.491
5.174LysGlu: 5.174 ± 0.46
2.276LysPhe: 2.276 ± 0.759
4.967LysGly: 4.967 ± 0.717
1.863LysHis: 1.863 ± 0.448
6.623LysIle: 6.623 ± 1.206
6.416LysLys: 6.416 ± 0.411
6.416LysLeu: 6.416 ± 0.31
2.483LysMet: 2.483 ± 0.733
7.45LysAsn: 7.45 ± 0.702
3.311LysPro: 3.311 ± 0.877
1.656LysGln: 1.656 ± 0.443
4.346LysArg: 4.346 ± 1.029
7.036LysSer: 7.036 ± 1.127
4.553LysThr: 4.553 ± 0.232
2.897LysVal: 2.897 ± 0.783
0.414LysTrp: 0.414 ± 0.1
4.967LysTyr: 4.967 ± 0.647
0.0LysXaa: 0.0 ± 0.0
Leu
4.346LeuAla: 4.346 ± 2.089
2.07LeuCys: 2.07 ± 0.778
7.45LeuAsp: 7.45 ± 0.702
5.795LeuGlu: 5.795 ± 1.103
3.104LeuPhe: 3.104 ± 0.188
2.07LeuGly: 2.07 ± 0.767
0.828LeuHis: 0.828 ± 0.199
6.416LeuIle: 6.416 ± 0.87
4.967LeuLys: 4.967 ± 0.677
4.967LeuLeu: 4.967 ± 1.185
3.104LeuMet: 3.104 ± 0.406
4.346LeuAsn: 4.346 ± 0.427
3.311LeuPro: 3.311 ± 0.169
2.483LeuGln: 2.483 ± 0.615
2.897LeuArg: 2.897 ± 1.167
9.106LeuSer: 9.106 ± 0.767
2.897LeuThr: 2.897 ± 0.664
4.139LeuVal: 4.139 ± 0.997
0.621LeuTrp: 0.621 ± 0.443
3.725LeuTyr: 3.725 ± 0.382
0.0LeuXaa: 0.0 ± 0.0
Met
1.863MetAla: 1.863 ± 0.796
0.828MetCys: 0.828 ± 0.304
2.483MetAsp: 2.483 ± 0.74
1.449MetGlu: 1.449 ± 0.391
1.242MetPhe: 1.242 ± 0.367
2.897MetGly: 2.897 ± 0.664
0.414MetHis: 0.414 ± 0.532
2.276MetIle: 2.276 ± 0.967
3.725MetLys: 3.725 ± 0.97
2.897MetLeu: 2.897 ± 0.559
1.035MetMet: 1.035 ± 0.263
1.449MetAsn: 1.449 ± 0.485
1.035MetPro: 1.035 ± 0.678
0.207MetGln: 0.207 ± 0.148
1.656MetArg: 1.656 ± 0.609
2.69MetSer: 2.69 ± 0.813
2.483MetThr: 2.483 ± 0.49
1.656MetVal: 1.656 ± 0.399
0.207MetTrp: 0.207 ± 0.148
1.242MetTyr: 1.242 ± 0.367
0.0MetXaa: 0.0 ± 0.0
Asn
2.07AsnAla: 2.07 ± 0.821
1.035AsnCys: 1.035 ± 0.498
5.795AsnAsp: 5.795 ± 1.018
5.588AsnGlu: 5.588 ± 1.879
2.69AsnPhe: 2.69 ± 0.849
2.897AsnGly: 2.897 ± 0.568
0.828AsnHis: 0.828 ± 0.353
9.313AsnIle: 9.313 ± 2.29
5.588AsnLys: 5.588 ± 0.854
5.588AsnLeu: 5.588 ± 0.123
3.932AsnMet: 3.932 ± 0.968
5.795AsnAsn: 5.795 ± 1.396
2.276AsnPro: 2.276 ± 0.583
1.863AsnGln: 1.863 ± 0.55
2.69AsnArg: 2.69 ± 0.105
4.967AsnSer: 4.967 ± 0.566
3.725AsnThr: 3.725 ± 0.519
2.69AsnVal: 2.69 ± 0.641
0.621AsnTrp: 0.621 ± 0.214
2.69AsnTyr: 2.69 ± 1.212
0.0AsnXaa: 0.0 ± 0.0
Pro
1.242ProAla: 1.242 ± 1.572
0.414ProCys: 0.414 ± 0.266
1.863ProAsp: 1.863 ± 0.19
2.276ProGlu: 2.276 ± 0.116
0.828ProPhe: 0.828 ± 0.353
2.483ProGly: 2.483 ± 0.654
0.828ProHis: 0.828 ± 0.304
2.69ProIle: 2.69 ± 0.105
1.656ProLys: 1.656 ± 0.846
2.897ProLeu: 2.897 ± 1.2
1.035ProMet: 1.035 ± 0.738
1.449ProAsn: 1.449 ± 0.565
0.207ProPro: 0.207 ± 0.133
0.828ProGln: 0.828 ± 0.199
0.621ProArg: 0.621 ± 0.527
3.518ProSer: 3.518 ± 0.878
1.242ProThr: 1.242 ± 0.299
0.828ProVal: 0.828 ± 0.433
0.207ProTrp: 0.207 ± 0.148
0.828ProTyr: 0.828 ± 0.199
0.0ProXaa: 0.0 ± 0.0
Gln
0.621GlnAla: 0.621 ± 0.399
0.207GlnCys: 0.207 ± 0.133
1.863GlnAsp: 1.863 ± 0.487
2.483GlnGlu: 2.483 ± 0.689
1.242GlnPhe: 1.242 ± 0.965
0.828GlnGly: 0.828 ± 0.304
0.207GlnHis: 0.207 ± 0.133
2.07GlnIle: 2.07 ± 0.666
2.483GlnLys: 2.483 ± 0.733
1.656GlnLeu: 1.656 ± 0.96
0.207GlnMet: 0.207 ± 0.57
2.483GlnAsn: 2.483 ± 0.913
0.414GlnPro: 0.414 ± 0.1
0.414GlnGln: 0.414 ± 0.295
1.242GlnArg: 1.242 ± 0.965
1.863GlnSer: 1.863 ± 1.582
1.242GlnThr: 1.242 ± 0.427
0.621GlnVal: 0.621 ± 0.214
0.207GlnTrp: 0.207 ± 0.133
2.07GlnTyr: 2.07 ± 0.499
0.0GlnXaa: 0.0 ± 0.0
Arg
1.656ArgAla: 1.656 ± 0.96
0.621ArgCys: 0.621 ± 0.214
2.276ArgAsp: 2.276 ± 0.788
2.483ArgGlu: 2.483 ± 0.654
1.656ArgPhe: 1.656 ± 0.609
1.863ArgGly: 1.863 ± 0.19
0.828ArgHis: 0.828 ± 0.199
2.897ArgIle: 2.897 ± 0.252
3.311ArgLys: 3.311 ± 0.169
4.553ArgLeu: 4.553 ± 1.097
1.035ArgMet: 1.035 ± 0.611
3.104ArgAsn: 3.104 ± 0.569
1.242ArgPro: 1.242 ± 0.563
1.242ArgGln: 1.242 ± 0.427
3.104ArgArg: 3.104 ± 0.717
4.346ArgSer: 4.346 ± 0.896
3.104ArgThr: 3.104 ± 0.569
2.276ArgVal: 2.276 ± 0.723
0.207ArgTrp: 0.207 ± 0.148
1.449ArgTyr: 1.449 ± 0.543
0.0ArgXaa: 0.0 ± 0.0
Ser
3.104SerAla: 3.104 ± 0.732
1.863SerCys: 1.863 ± 0.487
4.967SerAsp: 4.967 ± 1.144
5.795SerGlu: 5.795 ± 0.148
4.553SerPhe: 4.553 ± 1.432
6.209SerGly: 6.209 ± 0.737
0.621SerHis: 0.621 ± 0.527
6.002SerIle: 6.002 ± 0.718
6.829SerLys: 6.829 ± 1.249
5.795SerLeu: 5.795 ± 0.262
3.104SerMet: 3.104 ± 0.534
4.346SerAsn: 4.346 ± 0.551
1.863SerPro: 1.863 ± 0.922
1.242SerGln: 1.242 ± 0.367
4.346SerArg: 4.346 ± 0.305
8.692SerSer: 8.692 ± 1.045
6.829SerThr: 6.829 ± 0.86
5.174SerVal: 5.174 ± 0.197
1.242SerTrp: 1.242 ± 0.643
2.897SerTyr: 2.897 ± 0.807
0.0SerXaa: 0.0 ± 0.0
Thr
3.932ThrAla: 3.932 ± 3.208
1.242ThrCys: 1.242 ± 0.885
4.346ThrAsp: 4.346 ± 0.839
1.656ThrGlu: 1.656 ± 0.235
2.69ThrPhe: 2.69 ± 0.968
5.588ThrGly: 5.588 ± 1.136
0.621ThrHis: 0.621 ± 0.214
4.967ThrIle: 4.967 ± 0.734
3.932ThrLys: 3.932 ± 0.35
4.553ThrLeu: 4.553 ± 0.232
1.035ThrMet: 1.035 ± 0.433
3.104ThrAsn: 3.104 ± 0.168
1.863ThrPro: 1.863 ± 0.796
1.449ThrGln: 1.449 ± 0.353
3.518ThrArg: 3.518 ± 0.944
4.346ThrSer: 4.346 ± 0.896
3.932ThrThr: 3.932 ± 0.863
4.139ThrVal: 4.139 ± 0.445
0.621ThrTrp: 0.621 ± 0.183
2.897ThrTyr: 2.897 ± 0.551
0.0ThrXaa: 0.0 ± 0.0
Val
2.483ValAla: 2.483 ± 0.733
0.621ValCys: 0.621 ± 0.443
3.725ValAsp: 3.725 ± 0.369
3.518ValGlu: 3.518 ± 0.476
3.104ValPhe: 3.104 ± 0.897
2.483ValGly: 2.483 ± 0.228
0.621ValHis: 0.621 ± 0.443
2.483ValIle: 2.483 ± 0.855
4.553ValLys: 4.553 ± 0.555
3.311ValLeu: 3.311 ± 0.169
1.449ValMet: 1.449 ± 0.28
3.518ValAsn: 3.518 ± 0.691
1.449ValPro: 1.449 ± 0.343
1.449ValGln: 1.449 ± 0.391
1.656ValArg: 1.656 ± 0.399
5.381ValSer: 5.381 ± 0.324
4.139ValThr: 4.139 ± 0.367
3.104ValVal: 3.104 ± 0.499
0.0ValTrp: 0.0 ± 0.0
2.897ValTyr: 2.897 ± 0.698
0.0ValXaa: 0.0 ± 0.0
Trp
0.207TrpAla: 0.207 ± 0.133
0.207TrpCys: 0.207 ± 0.148
0.207TrpAsp: 0.207 ± 0.148
0.414TrpGlu: 0.414 ± 0.1
0.621TrpPhe: 0.621 ± 0.183
0.828TrpGly: 0.828 ± 0.353
0.414TrpHis: 0.414 ± 0.266
1.242TrpIle: 1.242 ± 0.563
0.0TrpLys: 0.0 ± 0.0
0.207TrpLeu: 0.207 ± 0.148
0.0TrpMet: 0.0 ± 0.0
0.621TrpAsn: 0.621 ± 0.214
0.828TrpPro: 0.828 ± 0.199
0.207TrpGln: 0.207 ± 0.133
0.207TrpArg: 0.207 ± 0.148
0.621TrpSer: 0.621 ± 0.399
0.621TrpThr: 0.621 ± 0.443
0.414TrpVal: 0.414 ± 0.1
0.207TrpTrp: 0.207 ± 0.148
0.414TrpTyr: 0.414 ± 0.295
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.656TyrAla: 1.656 ± 0.235
1.242TyrCys: 1.242 ± 0.37
2.07TyrAsp: 2.07 ± 0.666
2.69TyrGlu: 2.69 ± 0.681
2.276TyrPhe: 2.276 ± 0.624
2.483TyrGly: 2.483 ± 0.046
1.035TyrHis: 1.035 ± 0.375
4.139TyrIle: 4.139 ± 0.997
4.553TyrLys: 4.553 ± 0.89
2.897TyrLeu: 2.897 ± 0.707
1.035TyrMet: 1.035 ± 1.007
5.588TyrAsn: 5.588 ± 0.356
1.035TyrPro: 1.035 ± 0.433
1.242TyrGln: 1.242 ± 0.299
1.449TyrArg: 1.449 ± 0.543
4.346TyrSer: 4.346 ± 0.427
1.656TyrThr: 1.656 ± 0.399
2.276TyrVal: 2.276 ± 0.624
0.621TyrTrp: 0.621 ± 0.214
3.311TyrTyr: 3.311 ± 0.318
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (4833 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski