Amino acid dipepetide frequency for Wuhan House Fly Virus 2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.474AlaAla: 3.474 ± 2.089
2.233AlaCys: 2.233 ± 0.835
3.722AlaAsp: 3.722 ± 0.501
3.97AlaGlu: 3.97 ± 1.314
2.233AlaPhe: 2.233 ± 0.835
2.481AlaGly: 2.481 ± 0.71
0.744AlaHis: 0.744 ± 0.414
3.722AlaIle: 3.722 ± 0.586
2.481AlaLys: 2.481 ± 0.71
6.7AlaLeu: 6.7 ± 1.013
1.489AlaMet: 1.489 ± 0.248
2.233AlaAsn: 2.233 ± 0.58
2.73AlaPro: 2.73 ± 1.154
1.985AlaGln: 1.985 ± 0.58
2.73AlaArg: 2.73 ± 0.626
4.715AlaSer: 4.715 ± 0.858
2.481AlaThr: 2.481 ± 0.295
3.226AlaVal: 3.226 ± 0.85
0.248AlaTrp: 0.248 ± 0.15
1.241AlaTyr: 1.241 ± 0.392
0.0AlaXaa: 0.0 ± 0.0
Cys
1.985CysAla: 1.985 ± 0.615
0.496CysCys: 0.496 ± 0.311
0.744CysAsp: 0.744 ± 0.414
0.0CysGlu: 0.0 ± 0.0
1.241CysPhe: 1.241 ± 0.48
0.496CysGly: 0.496 ± 0.277
0.248CysHis: 0.248 ± 0.349
2.481CysIle: 2.481 ± 0.582
1.241CysLys: 1.241 ± 1.368
2.233CysLeu: 2.233 ± 0.774
0.248CysMet: 0.248 ± 0.15
0.993CysAsn: 0.993 ± 0.615
1.241CysPro: 1.241 ± 0.384
1.241CysGln: 1.241 ± 0.396
1.489CysArg: 1.489 ± 0.291
0.248CysSer: 0.248 ± 0.15
1.241CysThr: 1.241 ± 0.628
0.744CysVal: 0.744 ± 0.342
0.248CysTrp: 0.248 ± 0.15
0.744CysTyr: 0.744 ± 0.449
0.0CysXaa: 0.0 ± 0.0
Asp
2.481AspAla: 2.481 ± 0.965
1.241AspCys: 1.241 ± 0.396
2.233AspAsp: 2.233 ± 1.221
2.73AspGlu: 2.73 ± 0.982
2.73AspPhe: 2.73 ± 0.775
2.481AspGly: 2.481 ± 0.46
1.737AspHis: 1.737 ± 0.397
4.963AspIle: 4.963 ± 1.451
4.963AspLys: 4.963 ± 0.439
4.218AspLeu: 4.218 ± 0.881
1.737AspMet: 1.737 ± 0.408
2.481AspAsn: 2.481 ± 0.671
2.481AspPro: 2.481 ± 1.099
2.73AspGln: 2.73 ± 1.695
0.993AspArg: 0.993 ± 0.366
3.97AspSer: 3.97 ± 0.68
1.489AspThr: 1.489 ± 1.035
1.985AspVal: 1.985 ± 1.176
1.241AspTrp: 1.241 ± 0.342
0.993AspTyr: 0.993 ± 0.752
0.0AspXaa: 0.0 ± 0.0
Glu
3.226GluAla: 3.226 ± 1.147
1.489GluCys: 1.489 ± 0.458
2.978GluAsp: 2.978 ± 0.874
2.978GluGlu: 2.978 ± 0.508
1.737GluPhe: 1.737 ± 0.226
3.474GluGly: 3.474 ± 0.985
1.489GluHis: 1.489 ± 0.439
4.218GluIle: 4.218 ± 1.55
2.73GluLys: 2.73 ± 0.387
5.211GluLeu: 5.211 ± 1.462
2.481GluMet: 2.481 ± 0.899
2.73GluAsn: 2.73 ± 0.383
1.737GluPro: 1.737 ± 0.824
2.233GluGln: 2.233 ± 0.963
1.985GluArg: 1.985 ± 0.647
4.963GluSer: 4.963 ± 1.558
3.97GluThr: 3.97 ± 1.192
2.233GluVal: 2.233 ± 1.03
0.744GluTrp: 0.744 ± 0.342
1.489GluTyr: 1.489 ± 0.957
0.0GluXaa: 0.0 ± 0.0
Phe
1.241PheAla: 1.241 ± 1.246
0.993PheCys: 0.993 ± 0.366
2.233PheAsp: 2.233 ± 0.464
1.737PheGlu: 1.737 ± 0.577
1.241PhePhe: 1.241 ± 0.433
4.467PheGly: 4.467 ± 1.159
1.241PheHis: 1.241 ± 0.545
2.978PheIle: 2.978 ± 0.741
1.985PheLys: 1.985 ± 0.957
5.955PheLeu: 5.955 ± 1.185
2.233PheMet: 2.233 ± 0.669
2.73PheAsn: 2.73 ± 1.074
3.474PhePro: 3.474 ± 1.487
1.489PheGln: 1.489 ± 0.684
3.474PheArg: 3.474 ± 0.656
5.955PheSer: 5.955 ± 1.522
2.481PheThr: 2.481 ± 0.98
1.985PheVal: 1.985 ± 0.44
0.993PheTrp: 0.993 ± 0.257
2.233PheTyr: 2.233 ± 1.1
0.0PheXaa: 0.0 ± 0.0
Gly
2.233GlyAla: 2.233 ± 0.505
1.241GlyCys: 1.241 ± 0.48
1.737GlyAsp: 1.737 ± 1.048
3.226GlyGlu: 3.226 ± 0.777
3.474GlyPhe: 3.474 ± 1.632
1.985GlyGly: 1.985 ± 1.198
1.241GlyHis: 1.241 ± 0.749
4.467GlyIle: 4.467 ± 1.168
2.233GlyLys: 2.233 ± 0.803
4.467GlyLeu: 4.467 ± 1.527
0.993GlyMet: 0.993 ± 0.672
1.489GlyAsn: 1.489 ± 0.684
1.985GlyPro: 1.985 ± 0.63
1.985GlyGln: 1.985 ± 0.797
1.985GlyArg: 1.985 ± 0.782
4.715GlySer: 4.715 ± 1.181
3.474GlyThr: 3.474 ± 0.804
2.233GlyVal: 2.233 ± 1.123
1.241GlyTrp: 1.241 ± 0.492
1.985GlyTyr: 1.985 ± 0.779
0.0GlyXaa: 0.0 ± 0.0
His
1.489HisAla: 1.489 ± 0.659
0.496HisCys: 0.496 ± 0.277
0.744HisAsp: 0.744 ± 0.288
0.744HisGlu: 0.744 ± 0.449
1.737HisPhe: 1.737 ± 0.749
0.496HisGly: 0.496 ± 0.277
0.496HisHis: 0.496 ± 0.303
1.489HisIle: 1.489 ± 0.861
1.241HisLys: 1.241 ± 0.433
2.73HisLeu: 2.73 ± 0.699
0.744HisMet: 0.744 ± 0.449
1.489HisAsn: 1.489 ± 0.472
2.233HisPro: 2.233 ± 0.403
1.241HisGln: 1.241 ± 0.545
0.744HisArg: 0.744 ± 0.318
2.481HisSer: 2.481 ± 0.92
0.496HisThr: 0.496 ± 0.476
1.737HisVal: 1.737 ± 0.765
0.744HisTrp: 0.744 ± 0.601
1.489HisTyr: 1.489 ± 0.49
0.0HisXaa: 0.0 ± 0.0
Ile
4.963IleAla: 4.963 ± 1.121
0.744IleCys: 0.744 ± 0.449
2.978IleAsp: 2.978 ± 1.148
2.73IleGlu: 2.73 ± 0.843
4.218IlePhe: 4.218 ± 0.673
2.73IleGly: 2.73 ± 0.699
2.233IleHis: 2.233 ± 1.417
6.203IleIle: 6.203 ± 1.122
6.948IleLys: 6.948 ± 0.948
5.707IleLeu: 5.707 ± 1.164
1.985IleMet: 1.985 ± 0.714
4.218IleAsn: 4.218 ± 1.279
3.97IlePro: 3.97 ± 1.177
2.481IleGln: 2.481 ± 1.114
2.481IleArg: 2.481 ± 0.569
6.7IleSer: 6.7 ± 2.161
3.722IleThr: 3.722 ± 0.903
3.722IleVal: 3.722 ± 0.414
0.744IleTrp: 0.744 ± 0.601
1.737IleTyr: 1.737 ± 1.385
0.0IleXaa: 0.0 ± 0.0
Lys
3.722LysAla: 3.722 ± 1.651
0.993LysCys: 0.993 ± 0.752
2.978LysAsp: 2.978 ± 0.497
5.459LysGlu: 5.459 ± 1.112
4.963LysPhe: 4.963 ± 1.312
4.467LysGly: 4.467 ± 0.805
1.985LysHis: 1.985 ± 0.772
3.97LysIle: 3.97 ± 1.092
3.97LysLys: 3.97 ± 1.209
7.444LysLeu: 7.444 ± 1.161
0.993LysMet: 0.993 ± 0.432
2.481LysAsn: 2.481 ± 0.831
1.489LysPro: 1.489 ± 0.458
2.233LysGln: 2.233 ± 0.595
2.481LysArg: 2.481 ± 1.385
6.203LysSer: 6.203 ± 1.207
2.481LysThr: 2.481 ± 0.899
6.7LysVal: 6.7 ± 2.022
0.993LysTrp: 0.993 ± 0.599
1.737LysTyr: 1.737 ± 0.692
0.0LysXaa: 0.0 ± 0.0
Leu
4.963LeuAla: 4.963 ± 1.01
1.489LeuCys: 1.489 ± 0.637
5.955LeuAsp: 5.955 ± 1.99
4.963LeuGlu: 4.963 ± 0.837
4.715LeuPhe: 4.715 ± 1.269
2.978LeuGly: 2.978 ± 1.218
2.233LeuHis: 2.233 ± 0.406
7.444LeuIle: 7.444 ± 1.049
9.181LeuLys: 9.181 ± 0.798
10.918LeuLeu: 10.918 ± 1.147
2.481LeuMet: 2.481 ± 0.762
5.211LeuAsn: 5.211 ± 1.213
3.97LeuPro: 3.97 ± 1.206
4.467LeuGln: 4.467 ± 0.719
4.467LeuArg: 4.467 ± 0.869
12.407LeuSer: 12.407 ± 0.986
4.963LeuThr: 4.963 ± 0.975
4.715LeuVal: 4.715 ± 1.568
1.241LeuTrp: 1.241 ± 0.545
2.481LeuTyr: 2.481 ± 1.523
0.0LeuXaa: 0.0 ± 0.0
Met
1.489MetAla: 1.489 ± 0.837
0.744MetCys: 0.744 ± 0.47
2.73MetAsp: 2.73 ± 1.319
0.744MetGlu: 0.744 ± 0.342
1.737MetPhe: 1.737 ± 0.577
1.737MetGly: 1.737 ± 0.577
0.496MetHis: 0.496 ± 0.277
1.489MetIle: 1.489 ± 0.472
0.993MetLys: 0.993 ± 0.366
2.73MetLeu: 2.73 ± 0.764
0.496MetMet: 0.496 ± 0.299
0.993MetAsn: 0.993 ± 1.137
1.489MetPro: 1.489 ± 1.285
0.993MetGln: 0.993 ± 0.257
1.985MetArg: 1.985 ± 0.976
3.226MetSer: 3.226 ± 0.841
1.737MetThr: 1.737 ± 0.856
0.993MetVal: 0.993 ± 0.432
0.248MetTrp: 0.248 ± 0.349
0.993MetTyr: 0.993 ± 0.432
0.0MetXaa: 0.0 ± 0.0
Asn
3.474AsnAla: 3.474 ± 1.084
0.248AsnCys: 0.248 ± 0.15
0.744AsnAsp: 0.744 ± 0.352
1.985AsnGlu: 1.985 ± 0.328
0.744AsnPhe: 0.744 ± 0.342
0.993AsnGly: 0.993 ± 0.599
1.489AsnHis: 1.489 ± 0.609
3.226AsnIle: 3.226 ± 0.994
2.233AsnLys: 2.233 ± 0.808
5.707AsnLeu: 5.707 ± 1.944
2.233AsnMet: 2.233 ± 1.113
1.985AsnAsn: 1.985 ± 0.619
4.218AsnPro: 4.218 ± 0.61
1.737AsnGln: 1.737 ± 0.328
1.985AsnArg: 1.985 ± 0.489
4.963AsnSer: 4.963 ± 0.96
1.737AsnThr: 1.737 ± 0.738
3.474AsnVal: 3.474 ± 1.657
0.993AsnTrp: 0.993 ± 0.599
1.737AsnTyr: 1.737 ± 0.642
0.0AsnXaa: 0.0 ± 0.0
Pro
2.73ProAla: 2.73 ± 0.579
0.496ProCys: 0.496 ± 0.303
3.722ProAsp: 3.722 ± 0.687
2.978ProGlu: 2.978 ± 1.02
2.978ProPhe: 2.978 ± 0.341
1.489ProGly: 1.489 ± 0.831
2.73ProHis: 2.73 ± 1.022
3.226ProIle: 3.226 ± 0.83
4.467ProLys: 4.467 ± 0.671
3.722ProLeu: 3.722 ± 0.973
0.744ProMet: 0.744 ± 0.601
2.233ProAsn: 2.233 ± 0.774
1.985ProPro: 1.985 ± 0.44
1.489ProGln: 1.489 ± 0.602
1.737ProArg: 1.737 ± 0.226
6.7ProSer: 6.7 ± 1.332
2.481ProThr: 2.481 ± 0.64
2.73ProVal: 2.73 ± 0.764
0.0ProTrp: 0.0 ± 0.0
1.737ProTyr: 1.737 ± 0.816
0.0ProXaa: 0.0 ± 0.0
Gln
3.474GlnAla: 3.474 ± 0.93
0.744GlnCys: 0.744 ± 0.288
2.233GlnAsp: 2.233 ± 0.807
2.233GlnGlu: 2.233 ± 1.123
1.985GlnPhe: 1.985 ± 0.948
0.993GlnGly: 0.993 ± 0.564
0.496GlnHis: 0.496 ± 0.299
3.226GlnIle: 3.226 ± 0.835
2.481GlnLys: 2.481 ± 0.474
4.467GlnLeu: 4.467 ± 1.05
0.993GlnMet: 0.993 ± 0.257
0.993GlnAsn: 0.993 ± 0.627
1.737GlnPro: 1.737 ± 1.09
0.496GlnGln: 0.496 ± 0.311
1.489GlnArg: 1.489 ± 0.576
2.233GlnSer: 2.233 ± 0.193
0.744GlnThr: 0.744 ± 0.352
3.226GlnVal: 3.226 ± 0.814
0.248GlnTrp: 0.248 ± 0.15
1.489GlnTyr: 1.489 ± 0.306
0.0GlnXaa: 0.0 ± 0.0
Arg
2.978ArgAla: 2.978 ± 0.792
1.241ArgCys: 1.241 ± 0.392
1.737ArgAsp: 1.737 ± 0.465
3.722ArgGlu: 3.722 ± 1.129
2.233ArgPhe: 2.233 ± 0.464
2.233ArgGly: 2.233 ± 0.48
0.248ArgHis: 0.248 ± 0.34
2.73ArgIle: 2.73 ± 0.955
1.985ArgLys: 1.985 ± 1.473
2.978ArgLeu: 2.978 ± 1.047
1.489ArgMet: 1.489 ± 0.248
2.481ArgAsn: 2.481 ± 0.548
1.489ArgPro: 1.489 ± 0.831
2.481ArgGln: 2.481 ± 0.87
0.744ArgArg: 0.744 ± 0.449
4.218ArgSer: 4.218 ± 1.217
2.73ArgThr: 2.73 ± 0.685
2.233ArgVal: 2.233 ± 0.48
0.248ArgTrp: 0.248 ± 0.15
0.993ArgTyr: 0.993 ± 0.623
0.0ArgXaa: 0.0 ± 0.0
Ser
3.226SerAla: 3.226 ± 1.217
2.233SerCys: 2.233 ± 0.483
5.211SerAsp: 5.211 ± 1.046
4.467SerGlu: 4.467 ± 1.544
5.459SerPhe: 5.459 ± 1.004
5.955SerGly: 5.955 ± 1.736
2.233SerHis: 2.233 ± 0.541
5.211SerIle: 5.211 ± 1.058
6.452SerLys: 6.452 ± 1.138
11.166SerLeu: 11.166 ± 1.304
3.226SerMet: 3.226 ± 0.603
4.218SerAsn: 4.218 ± 0.873
4.218SerPro: 4.218 ± 1.049
2.978SerGln: 2.978 ± 0.915
5.211SerArg: 5.211 ± 1.625
10.67SerSer: 10.67 ± 2.167
4.467SerThr: 4.467 ± 0.671
6.948SerVal: 6.948 ± 0.89
1.489SerTrp: 1.489 ± 0.379
2.978SerTyr: 2.978 ± 0.758
0.0SerXaa: 0.0 ± 0.0
Thr
2.73ThrAla: 2.73 ± 0.387
0.248ThrCys: 0.248 ± 0.15
2.73ThrAsp: 2.73 ± 0.605
3.722ThrGlu: 3.722 ± 1.025
1.985ThrPhe: 1.985 ± 0.957
2.73ThrGly: 2.73 ± 0.564
2.233ThrHis: 2.233 ± 0.406
2.73ThrIle: 2.73 ± 0.878
4.715ThrLys: 4.715 ± 0.799
6.452ThrLeu: 6.452 ± 2.048
0.744ThrMet: 0.744 ± 0.318
1.241ThrAsn: 1.241 ± 0.48
3.474ThrPro: 3.474 ± 0.175
1.489ThrGln: 1.489 ± 1.109
2.481ThrArg: 2.481 ± 0.569
3.226ThrSer: 3.226 ± 1.17
3.97ThrThr: 3.97 ± 0.512
3.226ThrVal: 3.226 ± 0.4
1.241ThrTrp: 1.241 ± 0.23
1.985ThrTyr: 1.985 ± 0.328
0.0ThrXaa: 0.0 ± 0.0
Val
3.474ValAla: 3.474 ± 0.978
1.737ValCys: 1.737 ± 0.787
1.737ValAsp: 1.737 ± 0.937
2.73ValGlu: 2.73 ± 0.682
2.73ValPhe: 2.73 ± 0.699
3.226ValGly: 3.226 ± 0.584
0.496ValHis: 0.496 ± 0.277
3.97ValIle: 3.97 ± 1.325
4.218ValLys: 4.218 ± 0.687
3.97ValLeu: 3.97 ± 0.585
1.489ValMet: 1.489 ± 0.488
4.218ValAsn: 4.218 ± 0.566
3.722ValPro: 3.722 ± 0.861
1.241ValGln: 1.241 ± 0.48
1.985ValArg: 1.985 ± 0.852
5.707ValSer: 5.707 ± 0.694
5.459ValThr: 5.459 ± 0.874
3.226ValVal: 3.226 ± 0.837
0.744ValTrp: 0.744 ± 0.449
2.233ValTyr: 2.233 ± 0.48
0.0ValXaa: 0.0 ± 0.0
Trp
0.248TrpAla: 0.248 ± 0.15
0.0TrpCys: 0.0 ± 0.0
0.744TrpAsp: 0.744 ± 1.019
1.489TrpGlu: 1.489 ± 0.616
0.744TrpPhe: 0.744 ± 0.288
0.993TrpGly: 0.993 ± 0.599
0.248TrpHis: 0.248 ± 0.34
0.993TrpIle: 0.993 ± 0.427
0.744TrpLys: 0.744 ± 0.449
0.993TrpLeu: 0.993 ± 0.366
0.744TrpMet: 0.744 ± 0.449
0.248TrpAsn: 0.248 ± 0.15
0.496TrpPro: 0.496 ± 0.277
0.0TrpGln: 0.0 ± 0.0
0.496TrpArg: 0.496 ± 0.529
1.737TrpSer: 1.737 ± 0.492
1.737TrpThr: 1.737 ± 0.756
1.241TrpVal: 1.241 ± 0.749
0.248TrpTrp: 0.248 ± 0.15
0.248TrpTyr: 0.248 ± 0.15
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.241TyrAla: 1.241 ± 0.433
0.496TyrCys: 0.496 ± 0.299
1.985TyrAsp: 1.985 ± 0.782
1.489TyrGlu: 1.489 ± 0.248
2.233TyrPhe: 2.233 ± 0.193
1.985TyrGly: 1.985 ± 0.505
0.744TyrHis: 0.744 ± 0.47
2.481TyrIle: 2.481 ± 1.245
2.481TyrLys: 2.481 ± 0.264
3.722TyrLeu: 3.722 ± 1.123
0.248TyrMet: 0.248 ± 0.34
1.241TyrAsn: 1.241 ± 0.696
2.233TyrPro: 2.233 ± 0.788
0.993TyrGln: 0.993 ± 0.623
0.248TyrArg: 0.248 ± 0.15
2.978TyrSer: 2.978 ± 0.605
1.489TyrThr: 1.489 ± 0.437
1.737TyrVal: 1.737 ± 0.592
0.496TyrTrp: 0.496 ± 0.299
0.744TyrTyr: 0.744 ± 0.449
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 5 proteins (4031 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski