Amino acid dipepetide frequency for Drosophila melanogaster sigma virus (isolate Drosophila/USA/AP30/2005) (DMelSV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.776AlaAla: 3.776 ± 1.385
0.755AlaCys: 0.755 ± 0.286
3.525AlaAsp: 3.525 ± 1.277
2.769AlaGlu: 2.769 ± 0.53
1.259AlaPhe: 1.259 ± 0.543
2.769AlaGly: 2.769 ± 1.212
1.511AlaHis: 1.511 ± 0.375
2.518AlaIle: 2.518 ± 1.409
1.259AlaLys: 1.259 ± 0.528
5.539AlaLeu: 5.539 ± 0.95
1.007AlaMet: 1.007 ± 0.4
1.259AlaAsn: 1.259 ± 0.376
1.511AlaPro: 1.511 ± 0.519
2.014AlaGln: 2.014 ± 0.906
2.014AlaArg: 2.014 ± 0.87
2.518AlaSer: 2.518 ± 0.373
3.273AlaThr: 3.273 ± 0.587
2.769AlaVal: 2.769 ± 0.309
0.504AlaTrp: 0.504 ± 0.294
2.769AlaTyr: 2.769 ± 1.209
0.0AlaXaa: 0.0 ± 0.0
Cys
0.252CysAla: 0.252 ± 0.32
0.0CysCys: 0.0 ± 0.0
0.504CysAsp: 0.504 ± 0.276
0.0CysGlu: 0.0 ± 0.0
0.504CysPhe: 0.504 ± 0.279
1.007CysGly: 1.007 ± 0.358
0.252CysHis: 0.252 ± 0.334
0.0CysIle: 0.0 ± 0.0
1.007CysLys: 1.007 ± 0.505
1.511CysLeu: 1.511 ± 0.495
0.0CysMet: 0.0 ± 0.0
0.252CysAsn: 0.252 ± 0.143
1.007CysPro: 1.007 ± 0.926
0.252CysGln: 0.252 ± 0.334
1.007CysArg: 1.007 ± 0.62
3.525CysSer: 3.525 ± 0.691
1.511CysThr: 1.511 ± 0.859
1.259CysVal: 1.259 ± 0.716
0.252CysTrp: 0.252 ± 0.143
0.252CysTyr: 0.252 ± 0.143
0.0CysXaa: 0.0 ± 0.0
Asp
3.021AspAla: 3.021 ± 0.34
1.007AspCys: 1.007 ± 0.278
1.511AspAsp: 1.511 ± 0.375
4.28AspGlu: 4.28 ± 0.944
3.021AspPhe: 3.021 ± 0.917
3.273AspGly: 3.273 ± 0.704
1.762AspHis: 1.762 ± 0.271
4.028AspIle: 4.028 ± 0.859
1.511AspLys: 1.511 ± 0.447
6.546AspLeu: 6.546 ± 0.772
1.762AspMet: 1.762 ± 0.484
2.014AspAsn: 2.014 ± 0.648
4.028AspPro: 4.028 ± 1.141
2.518AspGln: 2.518 ± 0.808
2.769AspArg: 2.769 ± 0.635
2.518AspSer: 2.518 ± 0.611
3.273AspThr: 3.273 ± 1.263
3.273AspVal: 3.273 ± 0.885
0.252AspTrp: 0.252 ± 0.427
2.266AspTyr: 2.266 ± 0.741
0.0AspXaa: 0.0 ± 0.0
Glu
2.266GluAla: 2.266 ± 0.755
0.755GluCys: 0.755 ± 0.596
3.776GluAsp: 3.776 ± 0.73
3.021GluGlu: 3.021 ± 0.782
2.266GluPhe: 2.266 ± 1.208
5.035GluGly: 5.035 ± 1.341
0.755GluHis: 0.755 ± 0.722
5.539GluIle: 5.539 ± 0.648
3.021GluLys: 3.021 ± 0.738
4.783GluLeu: 4.783 ± 0.612
1.511GluMet: 1.511 ± 0.608
2.266GluAsn: 2.266 ± 0.933
1.762GluPro: 1.762 ± 0.839
2.014GluGln: 2.014 ± 0.471
2.518GluArg: 2.518 ± 0.61
3.776GluSer: 3.776 ± 0.553
3.273GluThr: 3.273 ± 1.173
3.525GluVal: 3.525 ± 0.884
1.511GluTrp: 1.511 ± 0.602
2.769GluTyr: 2.769 ± 0.723
0.0GluXaa: 0.0 ± 0.0
Phe
1.511PheAla: 1.511 ± 0.654
0.504PheCys: 0.504 ± 0.421
1.511PheAsp: 1.511 ± 0.447
1.259PheGlu: 1.259 ± 0.339
0.755PhePhe: 0.755 ± 0.422
2.769PheGly: 2.769 ± 0.972
0.504PheHis: 0.504 ± 0.447
2.266PheIle: 2.266 ± 0.296
3.021PheLys: 3.021 ± 0.661
4.28PheLeu: 4.28 ± 1.602
1.007PheMet: 1.007 ± 0.327
1.007PheAsn: 1.007 ± 0.572
4.28PhePro: 4.28 ± 0.86
1.511PheGln: 1.511 ± 0.843
2.518PheArg: 2.518 ± 1.431
3.525PheSer: 3.525 ± 0.926
2.518PheThr: 2.518 ± 1.417
4.028PheVal: 4.028 ± 1.046
0.755PheTrp: 0.755 ± 0.286
0.755PheTyr: 0.755 ± 0.286
0.0PheXaa: 0.0 ± 0.0
Gly
2.266GlyAla: 2.266 ± 0.972
0.755GlyCys: 0.755 ± 0.562
3.525GlyAsp: 3.525 ± 1.76
4.532GlyGlu: 4.532 ± 1.907
2.266GlyPhe: 2.266 ± 0.4
3.273GlyGly: 3.273 ± 0.649
1.259GlyHis: 1.259 ± 0.604
5.035GlyIle: 5.035 ± 1.22
2.518GlyLys: 2.518 ± 0.677
7.301GlyLeu: 7.301 ± 1.676
1.762GlyMet: 1.762 ± 0.772
2.014GlyAsn: 2.014 ± 0.56
2.266GlyPro: 2.266 ± 0.644
3.021GlyGln: 3.021 ± 0.626
2.014GlyArg: 2.014 ± 1.239
4.28GlySer: 4.28 ± 0.938
3.273GlyThr: 3.273 ± 0.83
3.273GlyVal: 3.273 ± 0.582
1.762GlyTrp: 1.762 ± 0.484
3.273GlyTyr: 3.273 ± 1.252
0.0GlyXaa: 0.0 ± 0.0
His
0.755HisAla: 0.755 ± 0.375
0.252HisCys: 0.252 ± 0.32
1.259HisAsp: 1.259 ± 0.524
1.259HisGlu: 1.259 ± 0.485
0.504HisPhe: 0.504 ± 0.279
0.755HisGly: 0.755 ± 0.321
0.252HisHis: 0.252 ± 0.328
1.762HisIle: 1.762 ± 0.523
1.511HisLys: 1.511 ± 0.679
2.769HisLeu: 2.769 ± 0.624
0.252HisMet: 0.252 ± 0.32
0.755HisAsn: 0.755 ± 0.308
3.273HisPro: 3.273 ± 0.865
2.014HisGln: 2.014 ± 0.556
2.014HisArg: 2.014 ± 0.658
2.014HisSer: 2.014 ± 0.496
1.762HisThr: 1.762 ± 0.506
2.266HisVal: 2.266 ± 1.033
0.504HisTrp: 0.504 ± 0.286
2.014HisTyr: 2.014 ± 0.746
0.0HisXaa: 0.0 ± 0.0
Ile
3.273IleAla: 3.273 ± 0.76
1.007IleCys: 1.007 ± 0.358
3.273IleAsp: 3.273 ± 0.864
3.525IleGlu: 3.525 ± 0.733
1.762IlePhe: 1.762 ± 0.632
5.035IleGly: 5.035 ± 0.898
2.769IleHis: 2.769 ± 0.598
2.014IleIle: 2.014 ± 0.382
4.532IleLys: 4.532 ± 0.792
7.553IleLeu: 7.553 ± 1.485
1.259IleMet: 1.259 ± 0.823
4.028IleAsn: 4.028 ± 0.759
4.028IlePro: 4.028 ± 1.031
3.273IleGln: 3.273 ± 1.076
4.28IleArg: 4.28 ± 1.549
6.042IleSer: 6.042 ± 1.737
3.776IleThr: 3.776 ± 0.879
3.776IleVal: 3.776 ± 1.141
0.755IleTrp: 0.755 ± 0.429
1.511IleTyr: 1.511 ± 0.482
0.0IleXaa: 0.0 ± 0.0
Lys
2.014LysAla: 2.014 ± 0.407
1.259LysCys: 1.259 ± 0.604
2.266LysAsp: 2.266 ± 0.571
3.021LysGlu: 3.021 ± 1.025
0.755LysPhe: 0.755 ± 0.422
3.021LysGly: 3.021 ± 0.675
1.259LysHis: 1.259 ± 0.637
3.776LysIle: 3.776 ± 1.725
1.259LysLys: 1.259 ± 0.582
4.028LysLeu: 4.028 ± 0.911
1.259LysMet: 1.259 ± 0.405
1.762LysAsn: 1.762 ± 0.297
3.021LysPro: 3.021 ± 0.661
1.259LysGln: 1.259 ± 0.339
2.518LysArg: 2.518 ± 0.77
4.532LysSer: 4.532 ± 1.135
3.273LysThr: 3.273 ± 1.364
3.776LysVal: 3.776 ± 1.103
1.762LysTrp: 1.762 ± 0.786
1.762LysTyr: 1.762 ± 0.843
0.0LysXaa: 0.0 ± 0.0
Leu
7.301LeuAla: 7.301 ± 0.802
1.259LeuCys: 1.259 ± 0.915
4.783LeuAsp: 4.783 ± 0.86
5.287LeuGlu: 5.287 ± 0.717
3.525LeuPhe: 3.525 ± 0.418
4.783LeuGly: 4.783 ± 0.518
3.525LeuHis: 3.525 ± 0.435
7.805LeuIle: 7.805 ± 1.7
4.532LeuLys: 4.532 ± 0.542
6.546LeuLeu: 6.546 ± 1.236
4.028LeuMet: 4.028 ± 1.078
4.783LeuAsn: 4.783 ± 0.896
3.776LeuPro: 3.776 ± 0.919
2.518LeuGln: 2.518 ± 0.617
6.546LeuArg: 6.546 ± 1.68
7.805LeuSer: 7.805 ± 1.835
9.063LeuThr: 9.063 ± 1.447
5.287LeuVal: 5.287 ± 1.423
0.252LeuTrp: 0.252 ± 0.143
4.532LeuTyr: 4.532 ± 1.482
0.0LeuXaa: 0.0 ± 0.0
Met
2.518MetAla: 2.518 ± 0.373
0.504MetCys: 0.504 ± 0.279
1.762MetAsp: 1.762 ± 0.636
1.511MetGlu: 1.511 ± 0.392
1.007MetPhe: 1.007 ± 0.413
2.014MetGly: 2.014 ± 0.556
0.252MetHis: 0.252 ± 0.143
2.518MetIle: 2.518 ± 1.166
0.755MetLys: 0.755 ± 0.286
1.762MetLeu: 1.762 ± 0.813
1.259MetMet: 1.259 ± 0.669
2.769MetAsn: 2.769 ± 1.051
0.504MetPro: 0.504 ± 0.276
0.755MetGln: 0.755 ± 0.321
0.504MetArg: 0.504 ± 0.286
1.259MetSer: 1.259 ± 0.355
2.518MetThr: 2.518 ± 0.672
1.259MetVal: 1.259 ± 0.404
0.755MetTrp: 0.755 ± 0.784
1.762MetTyr: 1.762 ± 0.573
0.0MetXaa: 0.0 ± 0.0
Asn
1.762AsnAla: 1.762 ± 0.806
1.259AsnCys: 1.259 ± 0.501
1.007AsnAsp: 1.007 ± 0.572
2.014AsnGlu: 2.014 ± 0.862
3.273AsnPhe: 3.273 ± 0.929
2.266AsnGly: 2.266 ± 0.624
1.762AsnHis: 1.762 ± 0.323
2.769AsnIle: 2.769 ± 1.046
2.769AsnLys: 2.769 ± 1.107
5.791AsnLeu: 5.791 ± 1.094
0.755AsnMet: 0.755 ± 0.378
1.762AsnAsn: 1.762 ± 0.484
3.776AsnPro: 3.776 ± 0.446
1.762AsnGln: 1.762 ± 0.673
2.518AsnArg: 2.518 ± 0.935
4.028AsnSer: 4.028 ± 1.113
1.762AsnThr: 1.762 ± 0.553
2.014AsnVal: 2.014 ± 0.78
0.504AsnTrp: 0.504 ± 0.277
2.769AsnTyr: 2.769 ± 0.597
0.0AsnXaa: 0.0 ± 0.0
Pro
2.266ProAla: 2.266 ± 0.585
0.252ProCys: 0.252 ± 0.334
3.525ProAsp: 3.525 ± 0.932
4.28ProGlu: 4.28 ± 1.79
2.014ProPhe: 2.014 ± 1.124
4.28ProGly: 4.28 ± 1.416
1.259ProHis: 1.259 ± 0.716
2.769ProIle: 2.769 ± 1.056
1.762ProLys: 1.762 ± 0.533
6.798ProLeu: 6.798 ± 0.987
1.511ProMet: 1.511 ± 0.586
1.762ProAsn: 1.762 ± 0.878
3.776ProPro: 3.776 ± 1.143
1.762ProGln: 1.762 ± 0.805
1.762ProArg: 1.762 ± 0.854
6.042ProSer: 6.042 ± 1.187
3.021ProThr: 3.021 ± 1.494
3.525ProVal: 3.525 ± 0.647
0.504ProTrp: 0.504 ± 0.276
1.259ProTyr: 1.259 ± 0.298
0.0ProXaa: 0.0 ± 0.0
Gln
0.755GlnAla: 0.755 ± 0.429
0.504GlnCys: 0.504 ± 0.279
3.273GlnAsp: 3.273 ± 0.825
3.525GlnGlu: 3.525 ± 1.212
1.259GlnPhe: 1.259 ± 0.513
2.014GlnGly: 2.014 ± 0.591
0.504GlnHis: 0.504 ± 0.294
1.511GlnIle: 1.511 ± 0.251
1.762GlnLys: 1.762 ± 0.632
4.028GlnLeu: 4.028 ± 1.121
1.259GlnMet: 1.259 ± 0.71
1.762GlnAsn: 1.762 ± 0.786
1.007GlnPro: 1.007 ± 0.88
0.755GlnGln: 0.755 ± 0.308
2.014GlnArg: 2.014 ± 0.553
3.776GlnSer: 3.776 ± 0.595
2.518GlnThr: 2.518 ± 0.739
3.525GlnVal: 3.525 ± 0.952
0.0GlnTrp: 0.0 ± 0.0
0.504GlnTyr: 0.504 ± 0.543
0.0GlnXaa: 0.0 ± 0.0
Arg
3.776ArgAla: 3.776 ± 1.223
0.252ArgCys: 0.252 ± 0.143
3.273ArgAsp: 3.273 ± 0.698
2.518ArgGlu: 2.518 ± 0.843
3.776ArgPhe: 3.776 ± 0.805
3.273ArgGly: 3.273 ± 0.337
1.007ArgHis: 1.007 ± 0.39
4.028ArgIle: 4.028 ± 1.415
3.273ArgLys: 3.273 ± 1.179
4.028ArgLeu: 4.028 ± 0.641
1.259ArgMet: 1.259 ± 0.716
3.273ArgAsn: 3.273 ± 1.287
1.762ArgPro: 1.762 ± 0.871
2.014ArgGln: 2.014 ± 0.605
2.266ArgArg: 2.266 ± 0.457
4.28ArgSer: 4.28 ± 1.208
4.028ArgThr: 4.028 ± 1.279
2.769ArgVal: 2.769 ± 0.774
1.259ArgTrp: 1.259 ± 0.716
2.266ArgTyr: 2.266 ± 0.87
0.0ArgXaa: 0.0 ± 0.0
Ser
2.769SerAla: 2.769 ± 0.748
1.511SerCys: 1.511 ± 0.859
4.028SerAsp: 4.028 ± 0.72
3.525SerGlu: 3.525 ± 0.874
4.028SerPhe: 4.028 ± 1.007
4.532SerGly: 4.532 ± 1.48
3.776SerHis: 3.776 ± 0.495
6.546SerIle: 6.546 ± 0.733
3.525SerLys: 3.525 ± 0.857
8.056SerLeu: 8.056 ± 0.784
2.266SerMet: 2.266 ± 0.705
4.28SerAsn: 4.28 ± 0.699
5.035SerPro: 5.035 ± 1.124
2.014SerGln: 2.014 ± 0.377
4.783SerArg: 4.783 ± 1.575
5.287SerSer: 5.287 ± 1.27
4.783SerThr: 4.783 ± 1.769
5.791SerVal: 5.791 ± 1.499
2.518SerTrp: 2.518 ± 0.913
3.021SerTyr: 3.021 ± 0.991
0.0SerXaa: 0.0 ± 0.0
Thr
2.518ThrAla: 2.518 ± 0.523
1.259ThrCys: 1.259 ± 0.376
4.28ThrAsp: 4.28 ± 1.275
3.273ThrGlu: 3.273 ± 0.672
1.762ThrPhe: 1.762 ± 0.484
2.769ThrGly: 2.769 ± 1.04
1.511ThrHis: 1.511 ± 0.482
4.783ThrIle: 4.783 ± 1.343
4.532ThrLys: 4.532 ± 0.753
5.287ThrLeu: 5.287 ± 1.159
0.755ThrMet: 0.755 ± 0.322
3.776ThrAsn: 3.776 ± 0.485
3.525ThrPro: 3.525 ± 1.301
2.769ThrGln: 2.769 ± 0.809
5.539ThrArg: 5.539 ± 0.606
6.042ThrSer: 6.042 ± 1.386
6.042ThrThr: 6.042 ± 1.383
4.532ThrVal: 4.532 ± 0.565
1.259ThrTrp: 1.259 ± 0.405
1.511ThrTyr: 1.511 ± 0.426
0.0ThrXaa: 0.0 ± 0.0
Val
2.014ValAla: 2.014 ± 0.49
1.259ValCys: 1.259 ± 0.501
3.776ValAsp: 3.776 ± 1.248
2.769ValGlu: 2.769 ± 0.927
3.776ValPhe: 3.776 ± 0.686
2.518ValGly: 2.518 ± 0.44
1.511ValHis: 1.511 ± 0.566
4.783ValIle: 4.783 ± 1.0
3.525ValLys: 3.525 ± 1.035
5.035ValLeu: 5.035 ± 1.746
2.266ValMet: 2.266 ± 0.715
3.273ValAsn: 3.273 ± 0.896
3.021ValPro: 3.021 ± 0.652
1.762ValGln: 1.762 ± 0.297
5.035ValArg: 5.035 ± 1.063
5.287ValSer: 5.287 ± 1.605
6.042ValThr: 6.042 ± 0.484
3.776ValVal: 3.776 ± 0.806
1.007ValTrp: 1.007 ± 0.376
2.014ValTyr: 2.014 ± 0.986
0.0ValXaa: 0.0 ± 0.0
Trp
0.252TrpAla: 0.252 ± 0.143
0.0TrpCys: 0.0 ± 0.0
0.755TrpAsp: 0.755 ± 0.517
1.259TrpGlu: 1.259 ± 0.441
1.007TrpPhe: 1.007 ± 0.413
1.259TrpGly: 1.259 ± 0.464
0.504TrpHis: 0.504 ± 0.376
1.007TrpIle: 1.007 ± 0.572
0.504TrpLys: 0.504 ± 0.286
1.259TrpLeu: 1.259 ± 0.704
1.007TrpMet: 1.007 ± 0.552
1.511TrpAsn: 1.511 ± 0.571
0.755TrpPro: 0.755 ± 0.429
0.755TrpGln: 0.755 ± 0.286
0.0TrpArg: 0.0 ± 0.0
2.518TrpSer: 2.518 ± 0.737
0.252TrpThr: 0.252 ± 0.334
1.007TrpVal: 1.007 ± 0.278
0.0TrpTrp: 0.0 ± 0.0
0.504TrpTyr: 0.504 ± 0.276
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.252TyrAla: 0.252 ± 0.143
0.0TyrCys: 0.0 ± 0.0
3.021TyrAsp: 3.021 ± 0.551
2.518TyrGlu: 2.518 ± 1.017
1.762TyrPhe: 1.762 ± 0.323
3.021TyrGly: 3.021 ± 0.737
2.014TyrHis: 2.014 ± 0.564
2.014TyrIle: 2.014 ± 0.496
1.007TyrLys: 1.007 ± 0.278
5.035TyrLeu: 5.035 ± 0.721
1.511TyrMet: 1.511 ± 0.392
2.266TyrAsn: 2.266 ± 0.953
2.014TyrPro: 2.014 ± 1.113
1.511TyrGln: 1.511 ± 0.773
1.762TyrArg: 1.762 ± 0.484
3.021TyrSer: 3.021 ± 0.86
1.762TyrThr: 1.762 ± 0.476
3.021TyrVal: 3.021 ± 1.018
0.0TyrTrp: 0.0 ± 0.0
1.762TyrTyr: 1.762 ± 0.664
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 6 proteins (3973 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski