Amino acid dipepetide frequency for Drosophila subobscura Nora virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
2.923AlaAla: 2.923 ± 0.637
0.797AlaCys: 0.797 ± 0.501
2.657AlaAsp: 2.657 ± 1.349
5.049AlaGlu: 5.049 ± 0.362
2.923AlaPhe: 2.923 ± 0.691
2.392AlaGly: 2.392 ± 0.875
0.797AlaHis: 0.797 ± 0.468
7.707AlaIle: 7.707 ± 1.358
4.783AlaLys: 4.783 ± 1.508
4.252AlaLeu: 4.252 ± 1.746
0.797AlaMet: 0.797 ± 0.269
2.657AlaAsn: 2.657 ± 0.943
2.923AlaPro: 2.923 ± 1.312
1.86AlaGln: 1.86 ± 0.915
0.797AlaArg: 0.797 ± 0.501
2.126AlaSer: 2.126 ± 0.547
2.657AlaThr: 2.657 ± 0.546
2.923AlaVal: 2.923 ± 0.985
1.063AlaTrp: 1.063 ± 0.432
3.189AlaTyr: 3.189 ± 0.937
0.0AlaXaa: 0.0 ± 0.0
Cys
0.266CysAla: 0.266 ± 0.167
0.266CysCys: 0.266 ± 0.277
0.797CysAsp: 0.797 ± 0.501
0.797CysGlu: 0.797 ± 0.501
0.266CysPhe: 0.266 ± 0.167
0.797CysGly: 0.797 ± 0.501
0.266CysHis: 0.266 ± 0.167
0.266CysIle: 0.266 ± 0.167
0.531CysLys: 0.531 ± 0.334
1.594CysLeu: 1.594 ± 0.696
0.797CysMet: 0.797 ± 0.269
0.531CysAsn: 0.531 ± 0.216
0.531CysPro: 0.531 ± 0.216
0.531CysGln: 0.531 ± 0.334
0.266CysArg: 0.266 ± 0.167
0.266CysSer: 0.266 ± 0.167
0.797CysThr: 0.797 ± 0.269
0.797CysVal: 0.797 ± 0.269
0.266CysTrp: 0.266 ± 0.167
0.266CysTyr: 0.266 ± 0.167
0.0CysXaa: 0.0 ± 0.0
Asp
1.329AspAla: 1.329 ± 0.674
0.797AspCys: 0.797 ± 0.269
3.455AspAsp: 3.455 ± 0.786
3.455AspGlu: 3.455 ± 0.362
2.657AspPhe: 2.657 ± 0.997
2.923AspGly: 2.923 ± 1.452
1.329AspHis: 1.329 ± 0.836
3.72AspIle: 3.72 ± 0.177
4.783AspLys: 4.783 ± 2.266
3.986AspLeu: 3.986 ± 0.452
0.797AspMet: 0.797 ± 0.468
3.72AspAsn: 3.72 ± 0.849
1.594AspPro: 1.594 ± 0.538
2.923AspGln: 2.923 ± 1.484
1.594AspArg: 1.594 ± 0.483
3.72AspSer: 3.72 ± 0.817
4.518AspThr: 4.518 ± 0.776
3.455AspVal: 3.455 ± 0.362
1.329AspTrp: 1.329 ± 0.534
1.594AspTyr: 1.594 ± 0.675
0.0AspXaa: 0.0 ± 0.0
Glu
2.126GluAla: 2.126 ± 0.633
0.266GluCys: 0.266 ± 0.167
3.455GluAsp: 3.455 ± 0.821
6.909GluGlu: 6.909 ± 0.208
2.657GluPhe: 2.657 ± 0.546
2.923GluGly: 2.923 ± 0.834
0.531GluHis: 0.531 ± 0.327
5.846GluIle: 5.846 ± 1.791
3.72GluLys: 3.72 ± 2.0
7.175GluLeu: 7.175 ± 0.407
1.86GluMet: 1.86 ± 0.516
3.72GluAsn: 3.72 ± 0.954
1.329GluPro: 1.329 ± 0.407
6.112GluGln: 6.112 ± 0.302
2.392GluArg: 2.392 ± 1.81
3.986GluSer: 3.986 ± 0.576
4.518GluThr: 4.518 ± 1.308
6.112GluVal: 6.112 ± 1.263
1.063GluTrp: 1.063 ± 0.412
1.86GluTyr: 1.86 ± 0.856
0.0GluXaa: 0.0 ± 0.0
Phe
1.594PheAla: 1.594 ± 0.651
0.266PheCys: 0.266 ± 0.167
2.923PheAsp: 2.923 ± 0.634
1.86PheGlu: 1.86 ± 0.916
0.797PhePhe: 0.797 ± 0.341
1.86PheGly: 1.86 ± 0.885
1.594PheHis: 1.594 ± 0.696
4.252PheIle: 4.252 ± 0.48
2.923PheLys: 2.923 ± 0.515
3.455PheLeu: 3.455 ± 0.706
0.531PheMet: 0.531 ± 0.476
1.063PheAsn: 1.063 ± 0.247
2.126PhePro: 2.126 ± 0.72
1.86PheGln: 1.86 ± 0.651
1.594PheArg: 1.594 ± 0.345
3.72PheSer: 3.72 ± 1.367
2.126PheThr: 2.126 ± 1.019
2.923PheVal: 2.923 ± 0.759
0.0PheTrp: 0.0 ± 0.0
1.594PheTyr: 1.594 ± 0.483
0.0PheXaa: 0.0 ± 0.0
Gly
2.126GlyAla: 2.126 ± 0.402
0.266GlyCys: 0.266 ± 0.167
3.189GlyAsp: 3.189 ± 0.755
5.049GlyGlu: 5.049 ± 1.768
1.329GlyPhe: 1.329 ± 0.25
2.923GlyGly: 2.923 ± 1.943
1.594GlyHis: 1.594 ± 0.696
4.518GlyIle: 4.518 ± 0.84
2.923GlyLys: 2.923 ± 1.234
4.783GlyLeu: 4.783 ± 1.202
2.392GlyMet: 2.392 ± 0.988
2.392GlyAsn: 2.392 ± 0.496
2.126GlyPro: 2.126 ± 0.464
1.86GlyGln: 1.86 ± 0.394
2.657GlyArg: 2.657 ± 0.916
2.657GlySer: 2.657 ± 0.916
4.252GlyThr: 4.252 ± 1.113
3.455GlyVal: 3.455 ± 0.445
0.266GlyTrp: 0.266 ± 0.167
1.594GlyTyr: 1.594 ± 0.33
0.0GlyXaa: 0.0 ± 0.0
His
1.594HisAla: 1.594 ± 0.538
0.797HisCys: 0.797 ± 0.501
0.531HisAsp: 0.531 ± 0.334
0.797HisGlu: 0.797 ± 0.269
0.531HisPhe: 0.531 ± 0.334
0.0HisGly: 0.0 ± 0.0
0.266HisHis: 0.266 ± 0.167
1.86HisIle: 1.86 ± 0.664
1.86HisLys: 1.86 ± 0.73
1.329HisLeu: 1.329 ± 0.458
0.266HisMet: 0.266 ± 0.167
0.266HisAsn: 0.266 ± 0.277
1.063HisPro: 1.063 ± 0.432
0.266HisGln: 0.266 ± 0.167
0.531HisArg: 0.531 ± 0.334
0.797HisSer: 0.797 ± 0.269
1.86HisThr: 1.86 ± 0.572
0.531HisVal: 0.531 ± 0.216
0.797HisTrp: 0.797 ± 0.341
0.797HisTyr: 0.797 ± 0.341
0.0HisXaa: 0.0 ± 0.0
Ile
3.986IleAla: 3.986 ± 1.105
0.797IleCys: 0.797 ± 0.501
5.315IleAsp: 5.315 ± 0.334
5.581IleGlu: 5.581 ± 0.97
1.594IlePhe: 1.594 ± 0.417
5.049IleGly: 5.049 ± 0.854
0.531IleHis: 0.531 ± 0.476
4.252IleIle: 4.252 ± 0.563
3.72IleLys: 3.72 ± 0.954
6.112IleLeu: 6.112 ± 1.647
0.797IleMet: 0.797 ± 0.366
5.846IleAsn: 5.846 ± 1.475
3.455IlePro: 3.455 ± 1.807
3.72IleGln: 3.72 ± 1.635
5.846IleArg: 5.846 ± 1.313
4.783IleSer: 4.783 ± 1.38
6.112IleThr: 6.112 ± 0.991
4.783IleVal: 4.783 ± 1.116
0.266IleTrp: 0.266 ± 0.167
2.392IleTyr: 2.392 ± 0.769
0.0IleXaa: 0.0 ± 0.0
Lys
2.923LysAla: 2.923 ± 1.114
1.063LysCys: 1.063 ± 0.669
3.986LysAsp: 3.986 ± 1.459
6.378LysGlu: 6.378 ± 2.181
2.392LysPhe: 2.392 ± 0.828
2.657LysGly: 2.657 ± 0.952
1.594LysHis: 1.594 ± 0.696
6.909LysIle: 6.909 ± 1.803
5.049LysLys: 5.049 ± 2.913
6.378LysLeu: 6.378 ± 2.026
2.126LysMet: 2.126 ± 0.824
3.189LysAsn: 3.189 ± 1.519
4.252LysPro: 4.252 ± 3.196
4.252LysGln: 4.252 ± 0.991
2.126LysArg: 2.126 ± 0.964
6.378LysSer: 6.378 ± 1.279
5.315LysThr: 5.315 ± 1.476
5.315LysVal: 5.315 ± 0.471
1.594LysTrp: 1.594 ± 0.539
2.392LysTyr: 2.392 ± 0.514
0.0LysXaa: 0.0 ± 0.0
Leu
6.378LeuAla: 6.378 ± 1.235
0.797LeuCys: 0.797 ± 0.501
5.581LeuAsp: 5.581 ± 1.386
5.049LeuGlu: 5.049 ± 2.151
2.657LeuPhe: 2.657 ± 0.267
3.189LeuGly: 3.189 ± 0.306
1.329LeuHis: 1.329 ± 0.836
6.112LeuIle: 6.112 ± 0.753
9.301LeuLys: 9.301 ± 1.918
3.986LeuLeu: 3.986 ± 1.432
1.594LeuMet: 1.594 ± 0.538
4.783LeuAsn: 4.783 ± 0.959
5.581LeuPro: 5.581 ± 2.161
3.72LeuGln: 3.72 ± 0.852
3.189LeuArg: 3.189 ± 0.659
8.238LeuSer: 8.238 ± 0.662
3.72LeuThr: 3.72 ± 0.36
5.315LeuVal: 5.315 ± 1.191
1.063LeuTrp: 1.063 ± 0.669
2.126LeuTyr: 2.126 ± 0.547
0.0LeuXaa: 0.0 ± 0.0
Met
0.797MetAla: 0.797 ± 0.498
0.266MetCys: 0.266 ± 0.167
0.797MetAsp: 0.797 ± 0.501
1.329MetGlu: 1.329 ± 0.638
0.531MetPhe: 0.531 ± 0.327
0.531MetGly: 0.531 ± 0.542
1.329MetHis: 1.329 ± 0.458
1.329MetIle: 1.329 ± 0.836
1.329MetLys: 1.329 ± 0.25
2.392MetLeu: 2.392 ± 0.522
0.266MetMet: 0.266 ± 0.277
2.392MetAsn: 2.392 ± 0.703
1.063MetPro: 1.063 ± 0.432
2.126MetGln: 2.126 ± 0.866
1.329MetArg: 1.329 ± 0.674
1.063MetSer: 1.063 ± 0.952
1.594MetThr: 1.594 ± 0.696
2.657MetVal: 2.657 ± 0.772
0.266MetTrp: 0.266 ± 0.167
0.266MetTyr: 0.266 ± 0.167
0.0MetXaa: 0.0 ± 0.0
Asn
2.923AsnAla: 2.923 ± 0.722
0.797AsnCys: 0.797 ± 0.269
1.594AsnAsp: 1.594 ± 0.997
3.72AsnGlu: 3.72 ± 1.05
2.923AsnPhe: 2.923 ± 0.179
2.392AsnGly: 2.392 ± 0.928
0.797AsnHis: 0.797 ± 0.341
4.518AsnIle: 4.518 ± 0.598
6.378AsnLys: 6.378 ± 2.25
5.049AsnLeu: 5.049 ± 0.578
1.594AsnMet: 1.594 ± 0.33
2.657AsnAsn: 2.657 ± 1.108
3.189AsnPro: 3.189 ± 1.192
1.86AsnGln: 1.86 ± 0.394
2.392AsnArg: 2.392 ± 0.565
4.518AsnSer: 4.518 ± 2.462
3.455AsnThr: 3.455 ± 2.205
4.783AsnVal: 4.783 ± 2.14
0.266AsnTrp: 0.266 ± 0.167
1.329AsnTyr: 1.329 ± 0.458
0.0AsnXaa: 0.0 ± 0.0
Pro
3.72ProAla: 3.72 ± 1.642
0.266ProCys: 0.266 ± 0.167
1.329ProAsp: 1.329 ± 0.674
1.86ProGlu: 1.86 ± 0.664
1.86ProPhe: 1.86 ± 0.826
2.126ProGly: 2.126 ± 0.786
0.531ProHis: 0.531 ± 0.216
3.189ProIle: 3.189 ± 0.95
2.657ProLys: 2.657 ± 0.908
3.455ProLeu: 3.455 ± 1.139
1.063ProMet: 1.063 ± 0.433
2.392ProAsn: 2.392 ± 1.282
1.594ProPro: 1.594 ± 0.651
3.189ProGln: 3.189 ± 0.383
1.594ProArg: 1.594 ± 1.017
2.657ProSer: 2.657 ± 1.174
2.923ProThr: 2.923 ± 1.562
3.986ProVal: 3.986 ± 1.772
1.594ProTrp: 1.594 ± 0.648
2.392ProTyr: 2.392 ± 0.514
0.0ProXaa: 0.0 ± 0.0
Gln
3.72GlnAla: 3.72 ± 1.017
0.266GlnCys: 0.266 ± 0.167
1.063GlnAsp: 1.063 ± 0.433
1.86GlnGlu: 1.86 ± 0.516
3.189GlnPhe: 3.189 ± 0.306
1.86GlnGly: 1.86 ± 1.17
0.797GlnHis: 0.797 ± 0.332
3.455GlnIle: 3.455 ± 1.468
3.986GlnLys: 3.986 ± 1.15
5.315GlnLeu: 5.315 ± 0.334
1.063GlnMet: 1.063 ± 0.467
3.455GlnAsn: 3.455 ± 1.831
1.594GlnPro: 1.594 ± 0.538
4.252GlnGln: 4.252 ± 1.602
1.86GlnArg: 1.86 ± 0.394
2.657GlnSer: 2.657 ± 1.108
4.252GlnThr: 4.252 ± 1.759
2.126GlnVal: 2.126 ± 0.781
0.531GlnTrp: 0.531 ± 0.334
1.594GlnTyr: 1.594 ± 0.417
0.0GlnXaa: 0.0 ± 0.0
Arg
3.455ArgAla: 3.455 ± 1.128
0.797ArgCys: 0.797 ± 0.501
2.126ArgAsp: 2.126 ± 0.402
2.923ArgGlu: 2.923 ± 1.554
2.657ArgPhe: 2.657 ± 0.695
3.189ArgGly: 3.189 ± 0.993
0.531ArgHis: 0.531 ± 0.555
1.86ArgIle: 1.86 ± 0.893
2.126ArgLys: 2.126 ± 0.495
1.86ArgLeu: 1.86 ± 0.651
1.063ArgMet: 1.063 ± 0.392
3.189ArgAsn: 3.189 ± 0.833
2.126ArgPro: 2.126 ± 0.961
1.86ArgGln: 1.86 ± 0.73
1.86ArgArg: 1.86 ± 0.328
2.126ArgSer: 2.126 ± 0.982
2.923ArgThr: 2.923 ± 0.935
3.455ArgVal: 3.455 ± 0.809
0.266ArgTrp: 0.266 ± 0.167
0.797ArgTyr: 0.797 ± 0.332
0.0ArgXaa: 0.0 ± 0.0
Ser
4.252SerAla: 4.252 ± 3.025
0.0SerCys: 0.0 ± 0.0
2.923SerAsp: 2.923 ± 0.818
4.518SerGlu: 4.518 ± 0.598
2.126SerPhe: 2.126 ± 0.949
4.783SerGly: 4.783 ± 0.302
0.797SerHis: 0.797 ± 0.468
5.049SerIle: 5.049 ± 0.457
4.252SerLys: 4.252 ± 0.48
7.175SerLeu: 7.175 ± 1.344
1.063SerMet: 1.063 ± 0.247
3.189SerAsn: 3.189 ± 1.145
1.594SerPro: 1.594 ± 0.675
1.329SerGln: 1.329 ± 0.487
3.72SerArg: 3.72 ± 0.632
4.783SerSer: 4.783 ± 1.842
5.049SerThr: 5.049 ± 1.141
5.315SerVal: 5.315 ± 2.739
1.063SerTrp: 1.063 ± 0.247
2.126SerTyr: 2.126 ± 0.464
0.0SerXaa: 0.0 ± 0.0
Thr
2.923ThrAla: 2.923 ± 0.935
0.797ThrCys: 0.797 ± 0.468
2.657ThrAsp: 2.657 ± 1.053
2.923ThrGlu: 2.923 ± 0.388
3.455ThrPhe: 3.455 ± 1.374
5.846ThrGly: 5.846 ± 1.583
0.266ThrHis: 0.266 ± 0.277
4.518ThrIle: 4.518 ± 1.701
7.972ThrLys: 7.972 ± 1.922
5.315ThrLeu: 5.315 ± 0.334
2.657ThrMet: 2.657 ± 0.916
3.72ThrAsn: 3.72 ± 1.968
3.455ThrPro: 3.455 ± 1.416
2.392ThrGln: 2.392 ± 0.979
3.189ThrArg: 3.189 ± 0.9
3.455ThrSer: 3.455 ± 0.907
5.049ThrThr: 5.049 ± 2.868
5.315ThrVal: 5.315 ± 1.507
1.063ThrTrp: 1.063 ± 0.669
2.657ThrTyr: 2.657 ± 1.053
0.0ThrXaa: 0.0 ± 0.0
Val
4.783ValAla: 4.783 ± 0.974
0.797ValCys: 0.797 ± 0.269
6.112ValAsp: 6.112 ± 0.966
6.909ValGlu: 6.909 ± 0.772
2.126ValPhe: 2.126 ± 0.786
3.986ValGly: 3.986 ± 0.904
1.063ValHis: 1.063 ± 0.528
2.392ValIle: 2.392 ± 1.082
5.049ValLys: 5.049 ± 0.695
7.175ValLeu: 7.175 ± 0.723
1.063ValMet: 1.063 ± 0.642
3.72ValAsn: 3.72 ± 0.694
3.455ValPro: 3.455 ± 1.637
3.189ValGln: 3.189 ± 3.132
1.594ValArg: 1.594 ± 1.003
5.315ValSer: 5.315 ± 1.298
5.049ValThr: 5.049 ± 1.028
3.986ValVal: 3.986 ± 0.531
1.594ValTrp: 1.594 ± 0.681
1.329ValTyr: 1.329 ± 0.458
0.0ValXaa: 0.0 ± 0.0
Trp
0.266TrpAla: 0.266 ± 0.167
0.266TrpCys: 0.266 ± 0.277
0.797TrpAsp: 0.797 ± 0.269
0.266TrpGlu: 0.266 ± 0.167
0.797TrpPhe: 0.797 ± 0.501
0.266TrpGly: 0.266 ± 0.167
0.531TrpHis: 0.531 ± 0.327
1.329TrpIle: 1.329 ± 0.54
0.531TrpLys: 0.531 ± 0.334
1.063TrpLeu: 1.063 ± 0.392
1.063TrpMet: 1.063 ± 0.247
1.063TrpAsn: 1.063 ± 0.412
0.0TrpPro: 0.0 ± 0.0
0.531TrpGln: 0.531 ± 0.334
0.531TrpArg: 0.531 ± 0.476
1.063TrpSer: 1.063 ± 0.392
1.329TrpThr: 1.329 ± 0.458
1.86TrpVal: 1.86 ± 0.73
0.266TrpTrp: 0.266 ± 0.4
1.063TrpTyr: 1.063 ± 0.711
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.189TyrAla: 3.189 ± 0.232
0.531TyrCys: 0.531 ± 0.334
2.392TyrAsp: 2.392 ± 0.875
1.329TyrGlu: 1.329 ± 0.638
1.594TyrPhe: 1.594 ± 0.696
2.657TyrGly: 2.657 ± 1.029
0.266TyrHis: 0.266 ± 0.167
2.126TyrIle: 2.126 ± 0.72
2.392TyrLys: 2.392 ± 1.064
1.86TyrLeu: 1.86 ± 0.328
0.531TyrMet: 0.531 ± 0.2
3.189TyrAsn: 3.189 ± 0.95
1.329TyrPro: 1.329 ± 1.664
1.063TyrGln: 1.063 ± 0.432
2.126TyrArg: 2.126 ± 0.633
0.797TyrSer: 0.797 ± 1.327
2.126TyrThr: 2.126 ± 0.547
1.594TyrVal: 1.594 ± 0.538
0.266TyrTrp: 0.266 ± 0.167
1.594TyrTyr: 1.594 ± 0.345
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4 proteins (3764 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski