Amino acid dipepetide frequency for Drosophila immigrans Nora virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.989AlaAla: 3.989 ± 0.682
0.798AlaCys: 0.798 ± 0.481
3.191AlaAsp: 3.191 ± 0.679
4.521AlaGlu: 4.521 ± 1.138
3.457AlaPhe: 3.457 ± 0.994
3.457AlaGly: 3.457 ± 0.794
1.064AlaHis: 1.064 ± 0.405
5.319AlaIle: 5.319 ± 1.397
3.723AlaLys: 3.723 ± 1.586
6.383AlaLeu: 6.383 ± 1.664
1.33AlaMet: 1.33 ± 0.307
3.989AlaAsn: 3.989 ± 1.44
3.457AlaPro: 3.457 ± 1.637
1.862AlaGln: 1.862 ± 0.888
1.862AlaArg: 1.862 ± 0.831
3.457AlaSer: 3.457 ± 0.917
3.457AlaThr: 3.457 ± 1.224
3.989AlaVal: 3.989 ± 0.677
1.064AlaTrp: 1.064 ± 0.405
2.394AlaTyr: 2.394 ± 0.452
0.0AlaXaa: 0.0 ± 0.0
Cys
0.266CysAla: 0.266 ± 0.16
0.266CysCys: 0.266 ± 0.244
1.064CysAsp: 1.064 ± 0.642
1.33CysGlu: 1.33 ± 0.543
0.266CysPhe: 0.266 ± 0.16
0.798CysGly: 0.798 ± 0.481
0.266CysHis: 0.266 ± 0.373
0.798CysIle: 0.798 ± 0.272
0.532CysLys: 0.532 ± 0.202
1.064CysLeu: 1.064 ± 0.405
0.532CysMet: 0.532 ± 0.403
1.33CysAsn: 1.33 ± 0.525
0.0CysPro: 0.0 ± 0.0
0.266CysGln: 0.266 ± 0.16
0.266CysArg: 0.266 ± 0.16
0.266CysSer: 0.266 ± 0.244
0.532CysThr: 0.532 ± 0.202
1.064CysVal: 1.064 ± 0.398
0.0CysTrp: 0.0 ± 0.0
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.596AspAla: 1.596 ± 0.35
1.064AspCys: 1.064 ± 0.398
2.66AspAsp: 2.66 ± 0.787
4.521AspGlu: 4.521 ± 1.061
2.128AspPhe: 2.128 ± 1.027
1.596AspGly: 1.596 ± 0.372
1.33AspHis: 1.33 ± 0.236
3.989AspIle: 3.989 ± 1.456
2.926AspLys: 2.926 ± 0.852
5.585AspLeu: 5.585 ± 0.748
1.064AspMet: 1.064 ± 0.654
3.191AspAsn: 3.191 ± 0.817
1.862AspPro: 1.862 ± 0.645
1.596AspGln: 1.596 ± 0.631
1.33AspArg: 1.33 ± 0.447
3.191AspSer: 3.191 ± 0.362
3.191AspThr: 3.191 ± 0.362
3.457AspVal: 3.457 ± 1.327
0.798AspTrp: 0.798 ± 0.323
1.862AspTyr: 1.862 ± 0.76
0.0AspXaa: 0.0 ± 0.0
Glu
4.787GluAla: 4.787 ± 0.769
0.798GluCys: 0.798 ± 0.272
2.66GluAsp: 2.66 ± 0.526
8.245GluGlu: 8.245 ± 0.634
3.191GluPhe: 3.191 ± 0.513
2.394GluGly: 2.394 ± 0.796
0.798GluHis: 0.798 ± 0.628
5.053GluIle: 5.053 ± 1.914
6.383GluLys: 6.383 ± 3.137
7.447GluLeu: 7.447 ± 0.618
1.596GluMet: 1.596 ± 0.48
3.457GluAsn: 3.457 ± 1.532
1.064GluPro: 1.064 ± 0.417
5.851GluGln: 5.851 ± 1.097
2.926GluArg: 2.926 ± 0.668
3.723GluSer: 3.723 ± 0.71
3.723GluThr: 3.723 ± 0.779
5.585GluVal: 5.585 ± 1.697
1.33GluTrp: 1.33 ± 0.802
2.926GluTyr: 2.926 ± 1.423
0.0GluXaa: 0.0 ± 0.0
Phe
2.394PheAla: 2.394 ± 0.457
0.532PheCys: 0.532 ± 0.321
2.394PheAsp: 2.394 ± 0.739
3.457PheGlu: 3.457 ± 1.022
1.33PhePhe: 1.33 ± 1.363
3.191PheGly: 3.191 ± 1.095
1.064PheHis: 1.064 ± 0.642
3.723PheIle: 3.723 ± 0.596
3.989PheLys: 3.989 ± 0.838
2.394PheLeu: 2.394 ± 0.452
0.532PheMet: 0.532 ± 0.309
2.128PheAsn: 2.128 ± 0.635
1.862PhePro: 1.862 ± 0.808
2.394PheGln: 2.394 ± 0.508
1.064PheArg: 1.064 ± 0.548
2.66PheSer: 2.66 ± 0.614
2.66PheThr: 2.66 ± 0.548
2.66PheVal: 2.66 ± 0.826
0.266PheTrp: 0.266 ± 0.519
1.33PheTyr: 1.33 ± 0.447
0.0PheXaa: 0.0 ± 0.0
Gly
2.128GlyAla: 2.128 ± 0.404
0.0GlyCys: 0.0 ± 0.0
2.926GlyAsp: 2.926 ± 0.647
5.053GlyGlu: 5.053 ± 1.416
1.862GlyPhe: 1.862 ± 0.363
2.66GlyGly: 2.66 ± 1.789
0.532GlyHis: 0.532 ± 0.202
3.723GlyIle: 3.723 ± 0.983
2.128GlyLys: 2.128 ± 0.399
6.117GlyLeu: 6.117 ± 1.64
1.596GlyMet: 1.596 ± 0.694
2.128GlyAsn: 2.128 ± 0.81
2.394GlyPro: 2.394 ± 0.938
1.862GlyGln: 1.862 ± 0.849
3.191GlyArg: 3.191 ± 0.725
4.255GlySer: 4.255 ± 1.417
4.255GlyThr: 4.255 ± 0.383
3.989GlyVal: 3.989 ± 0.493
0.266GlyTrp: 0.266 ± 0.16
1.33GlyTyr: 1.33 ± 0.668
0.0GlyXaa: 0.0 ± 0.0
His
1.596HisAla: 1.596 ± 0.607
0.266HisCys: 0.266 ± 0.16
0.266HisAsp: 0.266 ± 0.16
0.532HisGlu: 0.532 ± 0.202
0.798HisPhe: 0.798 ± 0.272
0.266HisGly: 0.266 ± 0.244
0.266HisHis: 0.266 ± 0.373
1.33HisIle: 1.33 ± 0.236
1.064HisLys: 1.064 ± 0.199
1.33HisLeu: 1.33 ± 0.452
0.0HisMet: 0.0 ± 0.0
0.266HisAsn: 0.266 ± 0.244
0.532HisPro: 0.532 ± 0.202
0.798HisGln: 0.798 ± 0.323
0.798HisArg: 0.798 ± 0.481
1.064HisSer: 1.064 ± 0.352
1.33HisThr: 1.33 ± 0.969
0.798HisVal: 0.798 ± 0.418
0.532HisTrp: 0.532 ± 0.202
0.798HisTyr: 0.798 ± 0.275
0.0HisXaa: 0.0 ± 0.0
Ile
5.053IleAla: 5.053 ± 0.648
0.798IleCys: 0.798 ± 0.323
3.457IleAsp: 3.457 ± 0.342
6.383IleGlu: 6.383 ± 1.741
2.926IlePhe: 2.926 ± 0.679
3.191IleGly: 3.191 ± 1.088
0.532IleHis: 0.532 ± 0.403
3.457IleIle: 3.457 ± 1.345
4.521IleLys: 4.521 ± 1.715
4.255IleLeu: 4.255 ± 1.282
1.33IleMet: 1.33 ± 0.778
2.926IleAsn: 2.926 ± 0.834
3.723IlePro: 3.723 ± 2.207
4.521IleGln: 4.521 ± 1.297
2.394IleArg: 2.394 ± 0.452
4.255IleSer: 4.255 ± 0.685
4.787IleThr: 4.787 ± 1.005
5.851IleVal: 5.851 ± 0.795
0.798IleTrp: 0.798 ± 0.323
1.862IleTyr: 1.862 ± 0.842
0.0IleXaa: 0.0 ± 0.0
Lys
3.191LysAla: 3.191 ± 0.997
0.532LysCys: 0.532 ± 0.321
2.926LysAsp: 2.926 ± 1.059
5.053LysGlu: 5.053 ± 2.836
3.191LysPhe: 3.191 ± 1.268
2.926LysGly: 2.926 ± 0.334
1.064LysHis: 1.064 ± 0.199
4.787LysIle: 4.787 ± 0.665
5.319LysLys: 5.319 ± 3.877
6.915LysLeu: 6.915 ± 1.333
1.862LysMet: 1.862 ± 0.914
4.521LysAsn: 4.521 ± 1.515
3.723LysPro: 3.723 ± 1.676
4.787LysGln: 4.787 ± 1.995
3.989LysArg: 3.989 ± 1.601
4.521LysSer: 4.521 ± 0.497
5.053LysThr: 5.053 ± 1.319
6.117LysVal: 6.117 ± 1.336
2.394LysTrp: 2.394 ± 1.128
1.862LysTyr: 1.862 ± 0.298
0.0LysXaa: 0.0 ± 0.0
Leu
4.787LeuAla: 4.787 ± 0.99
0.798LeuCys: 0.798 ± 0.481
5.585LeuAsp: 5.585 ± 0.864
5.053LeuGlu: 5.053 ± 1.899
2.66LeuPhe: 2.66 ± 0.858
3.723LeuGly: 3.723 ± 0.881
1.33LeuHis: 1.33 ± 0.452
5.319LeuIle: 5.319 ± 1.763
7.713LeuLys: 7.713 ± 1.452
6.117LeuLeu: 6.117 ± 2.418
1.33LeuMet: 1.33 ± 0.452
3.723LeuAsn: 3.723 ± 0.532
5.585LeuPro: 5.585 ± 1.496
4.521LeuGln: 4.521 ± 0.991
3.723LeuArg: 3.723 ± 0.662
7.181LeuSer: 7.181 ± 1.423
5.585LeuThr: 5.585 ± 0.828
5.319LeuVal: 5.319 ± 0.997
1.064LeuTrp: 1.064 ± 0.642
2.394LeuTyr: 2.394 ± 1.163
0.0LeuXaa: 0.0 ± 0.0
Met
2.926MetAla: 2.926 ± 0.643
0.0MetCys: 0.0 ± 0.0
0.532MetAsp: 0.532 ± 0.321
2.128MetGlu: 2.128 ± 0.661
0.532MetPhe: 0.532 ± 0.745
1.064MetGly: 1.064 ± 0.398
0.532MetHis: 0.532 ± 0.202
0.798MetIle: 0.798 ± 0.323
1.064MetLys: 1.064 ± 0.405
1.862MetLeu: 1.862 ± 0.363
0.0MetMet: 0.0 ± 0.0
1.064MetAsn: 1.064 ± 0.623
0.798MetPro: 0.798 ± 0.275
0.798MetGln: 0.798 ± 0.272
1.064MetArg: 1.064 ± 0.561
1.596MetSer: 1.596 ± 0.368
1.596MetThr: 1.596 ± 0.544
0.798MetVal: 0.798 ± 0.275
0.266MetTrp: 0.266 ± 0.16
0.532MetTyr: 0.532 ± 0.202
0.0MetXaa: 0.0 ± 0.0
Asn
2.926AsnAla: 2.926 ± 0.789
0.798AsnCys: 0.798 ± 0.272
2.394AsnAsp: 2.394 ± 0.505
3.191AsnGlu: 3.191 ± 1.547
3.457AsnPhe: 3.457 ± 2.076
2.926AsnGly: 2.926 ± 0.585
0.798AsnHis: 0.798 ± 0.553
2.128AsnIle: 2.128 ± 0.408
5.319AsnLys: 5.319 ± 1.926
5.319AsnLeu: 5.319 ± 1.115
0.266AsnMet: 0.266 ± 0.374
1.862AsnAsn: 1.862 ± 0.777
2.926AsnPro: 2.926 ± 1.068
3.191AsnGln: 3.191 ± 1.224
3.989AsnArg: 3.989 ± 1.745
3.723AsnSer: 3.723 ± 0.207
4.255AsnThr: 4.255 ± 0.905
2.926AsnVal: 2.926 ± 1.078
0.0AsnTrp: 0.0 ± 0.0
2.128AsnTyr: 2.128 ± 0.718
0.0AsnXaa: 0.0 ± 0.0
Pro
2.926ProAla: 2.926 ± 1.378
0.266ProCys: 0.266 ± 0.16
2.128ProAsp: 2.128 ± 0.417
1.862ProGlu: 1.862 ± 0.808
1.862ProPhe: 1.862 ± 0.491
2.394ProGly: 2.394 ± 0.614
0.532ProHis: 0.532 ± 0.321
5.053ProIle: 5.053 ± 1.037
3.457ProLys: 3.457 ± 1.317
2.66ProLeu: 2.66 ± 1.221
1.064ProMet: 1.064 ± 0.642
1.596ProAsn: 1.596 ± 0.35
0.266ProPro: 0.266 ± 0.244
2.394ProGln: 2.394 ± 0.9
1.064ProArg: 1.064 ± 0.975
2.128ProSer: 2.128 ± 0.546
2.66ProThr: 2.66 ± 1.538
3.723ProVal: 3.723 ± 1.366
1.33ProTrp: 1.33 ± 0.61
1.862ProTyr: 1.862 ± 0.665
0.0ProXaa: 0.0 ± 0.0
Gln
4.255GlnAla: 4.255 ± 1.447
0.798GlnCys: 0.798 ± 0.275
0.798GlnAsp: 0.798 ± 0.666
3.191GlnGlu: 3.191 ± 0.918
1.862GlnPhe: 1.862 ± 0.355
1.596GlnGly: 1.596 ± 0.694
0.532GlnHis: 0.532 ± 0.321
1.33GlnIle: 1.33 ± 0.452
6.117GlnLys: 6.117 ± 2.01
3.457GlnLeu: 3.457 ± 1.568
0.798GlnMet: 0.798 ± 0.358
3.723GlnAsn: 3.723 ± 0.577
2.128GlnPro: 2.128 ± 0.404
2.128GlnGln: 2.128 ± 0.339
2.926GlnArg: 2.926 ± 0.772
2.66GlnSer: 2.66 ± 1.216
3.457GlnThr: 3.457 ± 2.313
5.053GlnVal: 5.053 ± 0.615
0.266GlnTrp: 0.266 ± 0.373
1.596GlnTyr: 1.596 ± 0.544
0.0GlnXaa: 0.0 ± 0.0
Arg
3.457ArgAla: 3.457 ± 0.917
0.266ArgCys: 0.266 ± 0.16
1.862ArgAsp: 1.862 ± 0.849
2.394ArgGlu: 2.394 ± 0.538
2.394ArgPhe: 2.394 ± 0.843
3.191ArgGly: 3.191 ± 1.268
0.532ArgHis: 0.532 ± 0.488
2.128ArgIle: 2.128 ± 0.747
2.128ArgLys: 2.128 ± 0.404
2.394ArgLeu: 2.394 ± 0.767
0.532ArgMet: 0.532 ± 0.309
3.457ArgAsn: 3.457 ± 0.905
1.064ArgPro: 1.064 ± 0.405
3.457ArgGln: 3.457 ± 1.109
2.66ArgArg: 2.66 ± 1.099
1.33ArgSer: 1.33 ± 0.519
3.723ArgThr: 3.723 ± 1.417
4.521ArgVal: 4.521 ± 1.542
0.0ArgTrp: 0.0 ± 0.0
1.596ArgTyr: 1.596 ± 0.368
0.0ArgXaa: 0.0 ± 0.0
Ser
2.394SerAla: 2.394 ± 1.615
0.798SerCys: 0.798 ± 0.275
1.862SerAsp: 1.862 ± 0.849
3.191SerGlu: 3.191 ± 0.343
2.394SerPhe: 2.394 ± 1.045
2.926SerGly: 2.926 ± 1.183
0.266SerHis: 0.266 ± 0.16
4.521SerIle: 4.521 ± 1.575
4.255SerLys: 4.255 ± 1.01
4.255SerLeu: 4.255 ± 1.01
1.064SerMet: 1.064 ± 0.877
4.255SerAsn: 4.255 ± 1.174
1.862SerPro: 1.862 ± 0.315
1.862SerGln: 1.862 ± 0.537
2.394SerArg: 2.394 ± 0.614
3.457SerSer: 3.457 ± 1.228
6.117SerThr: 6.117 ± 0.641
8.245SerVal: 8.245 ± 3.089
0.798SerTrp: 0.798 ± 0.272
3.457SerTyr: 3.457 ± 1.143
0.0SerXaa: 0.0 ± 0.0
Thr
5.851ThrAla: 5.851 ± 2.117
0.532ThrCys: 0.532 ± 0.309
2.926ThrAsp: 2.926 ± 0.334
3.723ThrGlu: 3.723 ± 0.577
3.191ThrPhe: 3.191 ± 1.111
6.383ThrGly: 6.383 ± 0.906
1.064ThrHis: 1.064 ± 0.405
4.255ThrIle: 4.255 ± 1.717
5.585ThrLys: 5.585 ± 0.608
5.319ThrLeu: 5.319 ± 1.294
1.596ThrMet: 1.596 ± 0.629
4.255ThrAsn: 4.255 ± 1.406
3.457ThrPro: 3.457 ± 1.543
2.66ThrGln: 2.66 ± 0.911
2.66ThrArg: 2.66 ± 1.012
3.989ThrSer: 3.989 ± 1.757
7.181ThrThr: 7.181 ± 3.256
5.585ThrVal: 5.585 ± 1.422
1.064ThrTrp: 1.064 ± 0.642
1.862ThrTyr: 1.862 ± 1.123
0.0ThrXaa: 0.0 ± 0.0
Val
5.851ValAla: 5.851 ± 1.295
0.798ValCys: 0.798 ± 0.272
6.117ValAsp: 6.117 ± 1.398
7.181ValGlu: 7.181 ± 1.695
3.457ValPhe: 3.457 ± 0.825
5.319ValGly: 5.319 ± 0.962
1.064ValHis: 1.064 ± 0.199
4.521ValIle: 4.521 ± 0.654
4.255ValLys: 4.255 ± 0.798
7.979ValLeu: 7.979 ± 1.129
1.33ValMet: 1.33 ± 0.962
3.457ValAsn: 3.457 ± 1.701
2.926ValPro: 2.926 ± 1.0
2.394ValGln: 2.394 ± 0.726
2.394ValArg: 2.394 ± 0.731
4.521ValSer: 4.521 ± 0.49
6.649ValThr: 6.649 ± 2.745
4.255ValVal: 4.255 ± 0.738
1.862ValTrp: 1.862 ± 0.298
2.128ValTyr: 2.128 ± 0.483
0.0ValXaa: 0.0 ± 0.0
Trp
0.266TrpAla: 0.266 ± 0.519
0.532TrpCys: 0.532 ± 0.202
0.532TrpAsp: 0.532 ± 0.202
1.064TrpGlu: 1.064 ± 0.642
0.266TrpPhe: 0.266 ± 0.16
0.266TrpGly: 0.266 ± 0.16
0.266TrpHis: 0.266 ± 0.16
1.33TrpIle: 1.33 ± 0.236
0.532TrpLys: 0.532 ± 0.309
1.064TrpLeu: 1.064 ± 0.199
1.064TrpMet: 1.064 ± 0.398
1.33TrpAsn: 1.33 ± 0.525
0.0TrpPro: 0.0 ± 0.0
0.266TrpGln: 0.266 ± 0.16
0.532TrpArg: 0.532 ± 0.403
1.064TrpSer: 1.064 ± 0.398
1.33TrpThr: 1.33 ± 0.452
1.33TrpVal: 1.33 ± 0.525
0.266TrpTrp: 0.266 ± 0.373
1.33TrpTyr: 1.33 ± 0.532
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.128TyrAla: 2.128 ± 0.718
0.266TyrCys: 0.266 ± 0.16
2.66TyrAsp: 2.66 ± 1.029
1.862TyrGlu: 1.862 ± 0.663
0.798TyrPhe: 0.798 ± 0.272
2.128TyrGly: 2.128 ± 0.642
0.798TyrHis: 0.798 ± 0.272
3.457TyrIle: 3.457 ± 1.067
3.723TyrLys: 3.723 ± 1.573
1.596TyrLeu: 1.596 ± 0.256
0.798TyrMet: 0.798 ± 0.481
2.128TyrAsn: 2.128 ± 0.665
1.596TyrPro: 1.596 ± 0.368
1.064TyrGln: 1.064 ± 0.398
1.862TyrArg: 1.862 ± 0.472
1.33TyrSer: 1.33 ± 0.831
1.596TyrThr: 1.596 ± 0.35
3.191TyrVal: 3.191 ± 0.725
0.266TyrTrp: 0.266 ± 0.16
1.862TyrTyr: 1.862 ± 0.76
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3761 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski