Amino acid dipepetide frequency for Wuhan mosquito virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.526AlaAla: 3.526 ± 0.274
1.282AlaCys: 1.282 ± 0.245
3.526AlaAsp: 3.526 ± 1.231
2.564AlaGlu: 2.564 ± 0.352
2.885AlaPhe: 2.885 ± 1.388
1.923AlaGly: 1.923 ± 0.749
0.641AlaHis: 0.641 ± 0.258
4.167AlaIle: 4.167 ± 0.38
3.205AlaLys: 3.205 ± 1.1
2.885AlaLeu: 2.885 ± 1.191
0.962AlaMet: 0.962 ± 0.397
2.244AlaAsn: 2.244 ± 0.242
0.962AlaPro: 0.962 ± 0.201
2.885AlaGln: 2.885 ± 1.09
1.923AlaArg: 1.923 ± 0.603
2.885AlaSer: 2.885 ± 1.09
2.564AlaThr: 2.564 ± 1.742
3.846AlaVal: 3.846 ± 1.692
0.0AlaTrp: 0.0 ± 0.0
1.923AlaTyr: 1.923 ± 0.484
0.0AlaXaa: 0.0 ± 0.0
Cys
1.282CysAla: 1.282 ± 0.366
0.641CysCys: 0.641 ± 0.258
1.282CysAsp: 1.282 ± 0.989
2.244CysGlu: 2.244 ± 0.689
0.641CysPhe: 0.641 ± 0.742
1.603CysGly: 1.603 ± 1.139
0.0CysHis: 0.0 ± 0.0
0.962CysIle: 0.962 ± 0.201
1.282CysLys: 1.282 ± 0.515
2.244CysLeu: 2.244 ± 0.422
0.641CysMet: 0.641 ± 0.258
2.244CysAsn: 2.244 ± 0.624
0.0CysPro: 0.0 ± 0.0
0.962CysGln: 0.962 ± 0.911
2.244CysArg: 2.244 ± 0.422
2.564CysSer: 2.564 ± 0.63
2.564CysThr: 2.564 ± 0.664
0.962CysVal: 0.962 ± 0.911
0.321CysTrp: 0.321 ± 0.371
1.603CysTyr: 1.603 ± 0.252
0.0CysXaa: 0.0 ± 0.0
Asp
1.603AspAla: 1.603 ± 0.252
1.282AspCys: 1.282 ± 0.425
4.808AspAsp: 4.808 ± 1.39
2.885AspGlu: 2.885 ± 0.916
1.603AspPhe: 1.603 ± 0.252
1.923AspGly: 1.923 ± 0.773
0.962AspHis: 0.962 ± 0.5
8.013AspIle: 8.013 ± 0.972
6.731AspLys: 6.731 ± 0.412
4.487AspLeu: 4.487 ± 2.104
1.603AspMet: 1.603 ± 0.353
4.808AspAsn: 4.808 ± 0.193
1.282AspPro: 1.282 ± 0.515
1.923AspGln: 1.923 ± 0.603
1.923AspArg: 1.923 ± 0.902
3.846AspSer: 3.846 ± 1.205
1.923AspThr: 1.923 ± 0.484
4.167AspVal: 4.167 ± 0.84
1.282AspTrp: 1.282 ± 0.245
3.846AspTyr: 3.846 ± 0.639
0.0AspXaa: 0.0 ± 0.0
Glu
2.244GluAla: 2.244 ± 1.753
0.321GluCys: 0.321 ± 0.15
2.244GluAsp: 2.244 ± 0.624
3.526GluGlu: 3.526 ± 0.274
1.603GluPhe: 1.603 ± 0.808
3.205GluGly: 3.205 ± 1.289
2.244GluHis: 2.244 ± 0.809
5.769GluIle: 5.769 ± 1.047
2.885GluLys: 2.885 ± 0.588
3.846GluLeu: 3.846 ± 0.915
3.526GluMet: 3.526 ± 0.761
1.923GluAsn: 1.923 ± 0.902
2.244GluPro: 2.244 ± 0.32
1.923GluGln: 1.923 ± 0.749
2.564GluArg: 2.564 ± 0.489
4.808GluSer: 4.808 ± 0.409
4.808GluThr: 4.808 ± 1.312
4.487GluVal: 4.487 ± 0.261
0.641GluTrp: 0.641 ± 0.301
3.526GluTyr: 3.526 ± 0.281
0.0GluXaa: 0.0 ± 0.0
Phe
2.244PheAla: 2.244 ± 2.432
1.603PheCys: 1.603 ± 0.437
2.244PheAsp: 2.244 ± 0.242
1.923PheGlu: 1.923 ± 0.603
0.0PhePhe: 0.0 ± 0.0
2.885PheGly: 2.885 ± 0.611
0.321PheHis: 0.321 ± 0.15
6.731PheIle: 6.731 ± 1.315
2.564PheLys: 2.564 ± 1.031
1.282PheLeu: 1.282 ± 0.425
1.282PheMet: 1.282 ± 0.515
1.923PheAsn: 1.923 ± 0.403
0.641PhePro: 0.641 ± 0.301
0.962PheGln: 0.962 ± 0.451
0.962PheArg: 0.962 ± 0.397
3.526PheSer: 3.526 ± 1.231
1.603PheThr: 1.603 ± 0.353
0.641PheVal: 0.641 ± 0.301
0.321PheTrp: 0.321 ± 0.15
0.641PheTyr: 0.641 ± 0.258
0.0PheXaa: 0.0 ± 0.0
Gly
2.564GlyAla: 2.564 ± 1.2
3.205GlyCys: 3.205 ± 2.23
3.205GlyAsp: 3.205 ± 0.705
2.885GlyGlu: 2.885 ± 1.191
1.282GlyPhe: 1.282 ± 0.601
2.885GlyGly: 2.885 ± 0.611
1.282GlyHis: 1.282 ± 0.366
2.885GlyIle: 2.885 ± 0.604
4.808GlyLys: 4.808 ± 0.468
4.167GlyLeu: 4.167 ± 2.502
2.564GlyMet: 2.564 ± 0.732
2.564GlyAsn: 2.564 ± 0.63
0.641GlyPro: 0.641 ± 0.641
1.282GlyGln: 1.282 ± 0.848
2.244GlyArg: 2.244 ± 0.723
5.769GlySer: 5.769 ± 0.959
2.885GlyThr: 2.885 ± 1.499
3.205GlyVal: 3.205 ± 0.891
0.962GlyTrp: 0.962 ± 0.201
2.564GlyTyr: 2.564 ± 0.63
0.0GlyXaa: 0.0 ± 0.0
His
0.962HisAla: 0.962 ± 0.397
0.321HisCys: 0.321 ± 0.15
1.923HisAsp: 1.923 ± 0.749
1.282HisGlu: 1.282 ± 0.601
0.0HisPhe: 0.0 ± 0.0
0.962HisGly: 0.962 ± 0.451
0.962HisHis: 0.962 ± 0.397
1.923HisIle: 1.923 ± 0.484
0.641HisLys: 0.641 ± 0.258
4.167HisLeu: 4.167 ± 0.829
1.282HisMet: 1.282 ± 0.245
2.244HisAsn: 2.244 ± 0.624
0.641HisPro: 0.641 ± 0.258
0.641HisGln: 0.641 ± 0.301
0.321HisArg: 0.321 ± 0.497
2.244HisSer: 2.244 ± 0.809
0.962HisThr: 0.962 ± 0.201
1.282HisVal: 1.282 ± 0.245
0.0HisTrp: 0.0 ± 0.0
0.962HisTyr: 0.962 ± 0.397
0.0HisXaa: 0.0 ± 0.0
Ile
4.487IleAla: 4.487 ± 0.34
1.282IleCys: 1.282 ± 0.425
4.808IleAsp: 4.808 ± 1.39
6.09IleGlu: 6.09 ± 0.616
1.603IlePhe: 1.603 ± 0.849
6.09IleGly: 6.09 ± 1.768
1.603IleHis: 1.603 ± 0.353
5.128IleIle: 5.128 ± 0.34
6.41IleLys: 6.41 ± 0.837
4.167IleLeu: 4.167 ± 1.27
2.244IleMet: 2.244 ± 0.422
5.449IleAsn: 5.449 ± 1.183
2.244IlePro: 2.244 ± 0.624
1.923IleGln: 1.923 ± 0.484
3.846IleArg: 3.846 ± 0.806
9.936IleSer: 9.936 ± 1.876
5.128IleThr: 5.128 ± 0.422
3.846IleVal: 3.846 ± 0.968
0.641IleTrp: 0.641 ± 0.258
2.564IleTyr: 2.564 ± 0.17
0.0IleXaa: 0.0 ± 0.0
Lys
4.167LysAla: 4.167 ± 0.829
1.603LysCys: 1.603 ± 0.437
2.244LysAsp: 2.244 ± 0.32
2.885LysGlu: 2.885 ± 0.604
5.128LysPhe: 5.128 ± 1.704
5.128LysGly: 5.128 ± 1.712
1.923LysHis: 1.923 ± 0.196
4.167LysIle: 4.167 ± 0.414
4.808LysLys: 4.808 ± 1.074
5.128LysLeu: 5.128 ± 1.954
3.846LysMet: 3.846 ± 0.343
4.167LysAsn: 4.167 ± 1.001
2.885LysPro: 2.885 ± 0.484
2.885LysGln: 2.885 ± 1.09
3.526LysArg: 3.526 ± 2.01
5.128LysSer: 5.128 ± 1.185
4.487LysThr: 4.487 ± 0.806
4.487LysVal: 4.487 ± 0.893
1.282LysTrp: 1.282 ± 0.515
4.808LysTyr: 4.808 ± 1.804
0.0LysXaa: 0.0 ± 0.0
Leu
3.526LeuAla: 3.526 ± 0.768
2.885LeuCys: 2.885 ± 0.916
4.487LeuAsp: 4.487 ± 0.844
6.09LeuGlu: 6.09 ± 1.194
1.923LeuPhe: 1.923 ± 1.73
5.128LeuGly: 5.128 ± 1.033
2.244LeuHis: 2.244 ± 0.242
4.167LeuIle: 4.167 ± 0.532
4.487LeuLys: 4.487 ± 1.21
4.808LeuLeu: 4.808 ± 0.585
1.923LeuMet: 1.923 ± 0.196
5.449LeuAsn: 5.449 ± 1.183
3.205LeuPro: 3.205 ± 0.705
1.923LeuGln: 1.923 ± 0.773
5.128LeuArg: 5.128 ± 1.628
8.333LeuSer: 8.333 ± 2.536
4.167LeuThr: 4.167 ± 1.062
4.808LeuVal: 4.808 ± 0.789
0.962LeuTrp: 0.962 ± 0.451
2.885LeuTyr: 2.885 ± 0.588
0.0LeuXaa: 0.0 ± 0.0
Met
1.923MetAla: 1.923 ± 0.196
0.321MetCys: 0.321 ± 0.15
2.564MetAsp: 2.564 ± 0.17
1.282MetGlu: 1.282 ± 0.771
0.962MetPhe: 0.962 ± 0.621
0.641MetGly: 0.641 ± 0.258
1.282MetHis: 1.282 ± 0.245
3.526MetIle: 3.526 ± 1.2
2.244MetLys: 2.244 ± 0.32
4.487MetLeu: 4.487 ± 1.21
0.641MetMet: 0.641 ± 0.258
1.923MetAsn: 1.923 ± 0.773
2.244MetPro: 2.244 ± 0.668
1.282MetGln: 1.282 ± 0.366
1.282MetArg: 1.282 ± 0.245
2.564MetSer: 2.564 ± 1.161
2.885MetThr: 2.885 ± 1.392
1.923MetVal: 1.923 ± 0.403
0.0MetTrp: 0.0 ± 0.0
2.244MetTyr: 2.244 ± 1.309
0.0MetXaa: 0.0 ± 0.0
Asn
2.564AsnAla: 2.564 ± 0.767
1.603AsnCys: 1.603 ± 0.875
3.205AsnAsp: 3.205 ± 1.503
3.205AsnGlu: 3.205 ± 0.39
2.885AsnPhe: 2.885 ± 0.913
3.526AsnGly: 3.526 ± 0.833
1.282AsnHis: 1.282 ± 0.245
3.846AsnIle: 3.846 ± 0.208
4.808AsnLys: 4.808 ± 1.087
4.808AsnLeu: 4.808 ± 0.902
1.923AsnMet: 1.923 ± 0.999
2.244AsnAsn: 2.244 ± 0.809
1.603AsnPro: 1.603 ± 0.252
1.603AsnGln: 1.603 ± 0.252
1.603AsnArg: 1.603 ± 0.252
2.885AsnSer: 2.885 ± 0.913
1.923AsnThr: 1.923 ± 0.484
8.013AsnVal: 8.013 ± 0.422
0.962AsnTrp: 0.962 ± 0.451
1.923AsnTyr: 1.923 ± 0.403
0.0AsnXaa: 0.0 ± 0.0
Pro
1.282ProAla: 1.282 ± 0.848
0.321ProCys: 0.321 ± 0.15
2.564ProAsp: 2.564 ± 0.852
2.244ProGlu: 2.244 ± 0.32
1.923ProPhe: 1.923 ± 0.902
0.962ProGly: 0.962 ± 0.911
0.321ProHis: 0.321 ± 0.15
3.846ProIle: 3.846 ± 0.208
1.923ProLys: 1.923 ± 0.773
0.962ProLeu: 0.962 ± 0.5
1.282ProMet: 1.282 ± 0.962
1.923ProAsn: 1.923 ± 0.484
0.962ProPro: 0.962 ± 0.5
0.641ProGln: 0.641 ± 0.301
0.962ProArg: 0.962 ± 0.921
1.923ProSer: 1.923 ± 0.484
1.282ProThr: 1.282 ± 0.601
0.962ProVal: 0.962 ± 0.451
0.0ProTrp: 0.0 ± 0.0
0.641ProTyr: 0.641 ± 0.742
0.0ProXaa: 0.0 ± 0.0
Gln
0.962GlnAla: 0.962 ± 0.5
1.282GlnCys: 1.282 ± 0.515
1.282GlnAsp: 1.282 ± 0.245
1.282GlnGlu: 1.282 ± 0.245
1.603GlnPhe: 1.603 ± 1.331
0.962GlnGly: 0.962 ± 0.201
0.0GlnHis: 0.0 ± 0.0
1.603GlnIle: 1.603 ± 1.102
1.603GlnLys: 1.603 ± 0.5
2.885GlnLeu: 2.885 ± 0.913
1.923GlnMet: 1.923 ± 0.749
1.923GlnAsn: 1.923 ± 0.403
0.641GlnPro: 0.641 ± 0.424
1.603GlnGln: 1.603 ± 0.752
0.641GlnArg: 0.641 ± 0.301
2.885GlnSer: 2.885 ± 0.593
2.885GlnThr: 2.885 ± 1.191
2.564GlnVal: 2.564 ± 1.742
0.0GlnTrp: 0.0 ± 0.0
2.244GlnTyr: 2.244 ± 0.723
0.0GlnXaa: 0.0 ± 0.0
Arg
2.564ArgAla: 2.564 ± 1.097
0.641ArgCys: 0.641 ± 0.258
2.885ArgAsp: 2.885 ± 0.524
3.526ArgGlu: 3.526 ± 0.66
1.603ArgPhe: 1.603 ± 0.621
1.923ArgGly: 1.923 ± 0.902
1.603ArgHis: 1.603 ± 0.353
1.923ArgIle: 1.923 ± 0.403
2.885ArgLys: 2.885 ± 0.913
2.244ArgLeu: 2.244 ± 0.32
3.205ArgMet: 3.205 ± 0.856
1.282ArgAsn: 1.282 ± 0.962
0.641ArgPro: 0.641 ± 0.301
1.923ArgGln: 1.923 ± 0.47
1.282ArgArg: 1.282 ± 0.601
4.487ArgSer: 4.487 ± 0.261
2.885ArgThr: 2.885 ± 0.593
5.449ArgVal: 5.449 ± 0.614
0.321ArgTrp: 0.321 ± 0.497
0.962ArgTyr: 0.962 ± 0.201
0.0ArgXaa: 0.0 ± 0.0
Ser
3.846SerAla: 3.846 ± 0.741
3.526SerCys: 3.526 ± 2.551
5.449SerAsp: 5.449 ± 1.183
5.128SerGlu: 5.128 ± 1.01
2.885SerPhe: 2.885 ± 1.353
3.846SerGly: 3.846 ± 1.442
1.603SerHis: 1.603 ± 0.752
7.372SerIle: 7.372 ± 2.158
8.333SerLys: 8.333 ± 0.615
9.615SerLeu: 9.615 ± 0.721
2.885SerMet: 2.885 ± 1.027
4.167SerAsn: 4.167 ± 1.106
1.282SerPro: 1.282 ± 0.425
2.564SerGln: 2.564 ± 0.352
3.526SerArg: 3.526 ± 0.281
8.974SerSer: 8.974 ± 1.14
5.769SerThr: 5.769 ± 1.333
3.846SerVal: 3.846 ± 0.516
0.641SerTrp: 0.641 ± 0.301
3.846SerTyr: 3.846 ± 0.741
0.0SerXaa: 0.0 ± 0.0
Thr
1.603ThrAla: 1.603 ± 0.808
0.962ThrCys: 0.962 ± 0.451
5.128ThrAsp: 5.128 ± 1.464
0.962ThrGlu: 0.962 ± 0.201
1.282ThrPhe: 1.282 ± 1.281
3.526ThrGly: 3.526 ± 1.091
1.603ThrHis: 1.603 ± 0.5
4.167ThrIle: 4.167 ± 1.506
5.128ThrLys: 5.128 ± 1.507
5.769ThrLeu: 5.769 ± 0.968
1.282ThrMet: 1.282 ± 0.771
2.564ThrAsn: 2.564 ± 0.767
1.923ThrPro: 1.923 ± 0.749
0.962ThrGln: 0.962 ± 0.451
4.487ThrArg: 4.487 ± 1.459
5.449ThrSer: 5.449 ± 1.513
1.923ThrThr: 1.923 ± 0.749
3.205ThrVal: 3.205 ± 0.39
0.641ThrTrp: 0.641 ± 0.258
3.846ThrTyr: 3.846 ± 0.208
0.0ThrXaa: 0.0 ± 0.0
Val
2.564ValAla: 2.564 ± 1.638
1.923ValCys: 1.923 ± 1.579
5.769ValAsp: 5.769 ± 1.975
3.205ValGlu: 3.205 ± 0.39
1.923ValPhe: 1.923 ± 0.603
3.846ValGly: 3.846 ± 0.516
2.244ValHis: 2.244 ± 0.624
4.808ValIle: 4.808 ± 1.058
5.128ValLys: 5.128 ± 0.673
4.808ValLeu: 4.808 ± 0.882
1.923ValMet: 1.923 ± 0.773
4.167ValAsn: 4.167 ± 1.502
1.282ValPro: 1.282 ± 0.366
1.603ValGln: 1.603 ± 0.808
2.885ValArg: 2.885 ± 0.913
5.128ValSer: 5.128 ± 0.829
3.526ValThr: 3.526 ± 0.658
4.808ValVal: 4.808 ± 0.882
0.0ValTrp: 0.0 ± 0.0
4.487ValTyr: 4.487 ± 0.64
0.0ValXaa: 0.0 ± 0.0
Trp
0.962TrpAla: 0.962 ± 0.451
0.962TrpCys: 0.962 ± 0.201
0.321TrpAsp: 0.321 ± 0.371
0.962TrpGlu: 0.962 ± 0.201
0.321TrpPhe: 0.321 ± 0.15
0.641TrpGly: 0.641 ± 0.301
0.0TrpHis: 0.0 ± 0.0
0.321TrpIle: 0.321 ± 0.15
0.321TrpLys: 0.321 ± 0.497
0.641TrpLeu: 0.641 ± 0.258
0.321TrpMet: 0.321 ± 0.15
0.321TrpAsn: 0.321 ± 0.15
0.641TrpPro: 0.641 ± 0.301
0.0TrpGln: 0.0 ± 0.0
0.321TrpArg: 0.321 ± 0.15
0.962TrpSer: 0.962 ± 0.621
0.321TrpThr: 0.321 ± 0.15
0.321TrpVal: 0.321 ± 0.15
0.0TrpTrp: 0.0 ± 0.0
0.321TrpTyr: 0.321 ± 0.371
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.923TyrAla: 1.923 ± 0.955
0.321TyrCys: 0.321 ± 0.15
0.962TyrAsp: 0.962 ± 0.451
4.167TyrGlu: 4.167 ± 1.298
2.244TyrPhe: 2.244 ± 1.052
2.244TyrGly: 2.244 ± 0.689
1.603TyrHis: 1.603 ± 0.353
4.167TyrIle: 4.167 ± 0.532
4.487TyrLys: 4.487 ± 0.261
5.128TyrLeu: 5.128 ± 0.422
0.641TyrMet: 0.641 ± 0.301
2.885TyrAsn: 2.885 ± 0.02
0.962TyrPro: 0.962 ± 0.201
1.282TyrGln: 1.282 ± 0.989
2.885TyrArg: 2.885 ± 0.524
4.808TyrSer: 4.808 ± 1.804
1.923TyrThr: 1.923 ± 0.47
3.205TyrVal: 3.205 ± 0.624
0.0TyrTrp: 0.0 ± 0.0
2.244TyrTyr: 2.244 ± 0.624
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (3121 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski