Amino acid dipepetide frequency for Drosophila x virus (isolate Chung/1996) (DXV)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.74AlaAla: 5.74 ± 1.207
0.883AlaCys: 0.883 ± 0.757
3.974AlaAsp: 3.974 ± 0.595
1.766AlaGlu: 1.766 ± 0.758
0.442AlaPhe: 0.442 ± 0.323
1.766AlaGly: 1.766 ± 0.366
0.0AlaHis: 0.0 ± 0.0
2.649AlaIle: 2.649 ± 0.806
3.532AlaLys: 3.532 ± 0.998
5.298AlaLeu: 5.298 ± 1.041
2.208AlaMet: 2.208 ± 0.536
3.974AlaAsn: 3.974 ± 0.68
2.208AlaPro: 2.208 ± 1.023
3.532AlaGln: 3.532 ± 0.93
1.766AlaArg: 1.766 ± 0.366
5.74AlaSer: 5.74 ± 1.207
4.857AlaThr: 4.857 ± 1.031
3.091AlaVal: 3.091 ± 1.105
0.442AlaTrp: 0.442 ± 0.337
3.974AlaTyr: 3.974 ± 1.846
0.0AlaXaa: 0.0 ± 0.0
Cys
0.0CysAla: 0.0 ± 0.0
0.0CysCys: 0.0 ± 0.0
0.442CysAsp: 0.442 ± 0.323
0.0CysGlu: 0.0 ± 0.0
0.0CysPhe: 0.0 ± 0.0
0.442CysGly: 0.442 ± 0.782
0.883CysHis: 0.883 ± 1.564
0.442CysIle: 0.442 ± 0.337
0.442CysLys: 0.442 ± 0.782
0.0CysLeu: 0.0 ± 0.0
0.442CysMet: 0.442 ± 0.575
0.0CysAsn: 0.0 ± 0.0
0.442CysPro: 0.442 ± 0.337
0.0CysGln: 0.0 ± 0.0
0.442CysArg: 0.442 ± 0.782
0.883CysSer: 0.883 ± 1.564
0.442CysThr: 0.442 ± 0.337
0.0CysVal: 0.0 ± 0.0
0.0CysTrp: 0.0 ± 0.0
0.442CysTyr: 0.442 ± 0.323
0.0CysXaa: 0.0 ± 0.0
Asp
0.883AspAla: 0.883 ± 0.647
0.442AspCys: 0.442 ± 0.782
3.974AspAsp: 3.974 ± 0.859
3.532AspGlu: 3.532 ± 1.007
1.325AspPhe: 1.325 ± 0.403
1.766AspGly: 1.766 ± 1.074
0.442AspHis: 0.442 ± 0.337
7.064AspIle: 7.064 ± 1.216
5.298AspLys: 5.298 ± 0.939
3.091AspLeu: 3.091 ± 1.105
1.766AspMet: 1.766 ± 0.366
1.325AspAsn: 1.325 ± 0.436
3.091AspPro: 3.091 ± 1.151
1.766AspGln: 1.766 ± 0.366
2.208AspArg: 2.208 ± 0.536
7.064AspSer: 7.064 ± 1.928
1.766AspThr: 1.766 ± 0.602
3.974AspVal: 3.974 ± 1.846
1.325AspTrp: 1.325 ± 0.436
1.766AspTyr: 1.766 ± 1.293
0.0AspXaa: 0.0 ± 0.0
Glu
4.415GluAla: 4.415 ± 2.179
0.0GluCys: 0.0 ± 0.0
2.208GluAsp: 2.208 ± 0.417
3.091GluGlu: 3.091 ± 1.105
0.883GluPhe: 0.883 ± 0.183
3.091GluGly: 3.091 ± 0.703
2.649GluHis: 2.649 ± 2.292
1.325GluIle: 1.325 ± 1.012
2.649GluLys: 2.649 ± 1.026
7.947GluLeu: 7.947 ± 1.311
1.325GluMet: 1.325 ± 0.436
3.091GluAsn: 3.091 ± 0.692
3.091GluPro: 3.091 ± 1.464
2.208GluGln: 2.208 ± 1.023
3.091GluArg: 3.091 ± 0.236
2.649GluSer: 2.649 ± 0.806
2.208GluThr: 2.208 ± 0.417
4.415GluVal: 4.415 ± 1.155
1.325GluTrp: 1.325 ± 0.599
3.974GluTyr: 3.974 ± 1.846
0.0GluXaa: 0.0 ± 0.0
Phe
0.883PheAla: 0.883 ± 0.647
0.0PheCys: 0.0 ± 0.0
1.325PheAsp: 1.325 ± 0.97
0.0PheGlu: 0.0 ± 0.0
0.0PhePhe: 0.0 ± 0.0
1.766PheGly: 1.766 ± 0.602
1.766PheHis: 1.766 ± 1.074
0.883PheIle: 0.883 ± 0.183
1.325PheLys: 1.325 ± 1.012
1.766PheLeu: 1.766 ± 0.707
0.442PheMet: 0.442 ± 0.323
2.208PheAsn: 2.208 ± 0.536
0.883PhePro: 0.883 ± 0.183
1.325PheGln: 1.325 ± 0.436
0.883PheArg: 0.883 ± 0.675
3.532PheSer: 3.532 ± 1.007
1.766PheThr: 1.766 ± 0.366
1.325PheVal: 1.325 ± 0.599
0.883PheTrp: 0.883 ± 0.764
0.0PheTyr: 0.0 ± 0.0
0.0PheXaa: 0.0 ± 0.0
Gly
2.649GlyAla: 2.649 ± 0.806
0.0GlyCys: 0.0 ± 0.0
3.532GlyAsp: 3.532 ± 1.559
4.857GlyGlu: 4.857 ± 0.705
2.208GlyPhe: 2.208 ± 0.749
2.208GlyGly: 2.208 ± 0.749
1.325GlyHis: 1.325 ± 0.436
3.974GlyIle: 3.974 ± 1.208
3.532GlyLys: 3.532 ± 0.998
4.415GlyLeu: 4.415 ± 0.37
0.883GlyMet: 0.883 ± 0.183
3.974GlyAsn: 3.974 ± 2.204
2.208GlyPro: 2.208 ± 0.749
3.974GlyGln: 3.974 ± 0.68
6.181GlyArg: 6.181 ± 2.549
4.857GlySer: 4.857 ± 0.143
4.857GlyThr: 4.857 ± 1.722
4.857GlyVal: 4.857 ± 0.592
0.883GlyTrp: 0.883 ± 0.675
1.766GlyTyr: 1.766 ± 0.758
0.0GlyXaa: 0.0 ± 0.0
His
0.442HisAla: 0.442 ± 0.323
0.0HisCys: 0.0 ± 0.0
0.883HisAsp: 0.883 ± 0.647
3.532HisGlu: 3.532 ± 1.914
0.0HisPhe: 0.0 ± 0.0
0.442HisGly: 0.442 ± 0.782
0.442HisHis: 0.442 ± 0.337
0.883HisIle: 0.883 ± 0.183
1.325HisLys: 1.325 ± 0.436
1.325HisLeu: 1.325 ± 0.872
0.0HisMet: 0.0 ± 0.0
0.883HisAsn: 0.883 ± 0.764
0.442HisPro: 0.442 ± 0.782
0.883HisGln: 0.883 ± 0.764
1.766HisArg: 1.766 ± 2.283
1.325HisSer: 1.325 ± 0.403
0.442HisThr: 0.442 ± 0.337
1.766HisVal: 1.766 ± 0.707
1.325HisTrp: 1.325 ± 0.403
1.766HisTyr: 1.766 ± 0.707
0.0HisXaa: 0.0 ± 0.0
Ile
4.857IleAla: 4.857 ± 1.08
0.883IleCys: 0.883 ± 1.564
1.766IleAsp: 1.766 ± 1.085
3.091IleGlu: 3.091 ± 1.19
0.442IlePhe: 0.442 ± 0.337
3.091IleGly: 3.091 ± 0.236
0.883IleHis: 0.883 ± 0.764
2.649IleIle: 2.649 ± 0.47
3.532IleLys: 3.532 ± 0.93
3.532IleLeu: 3.532 ± 0.93
2.208IleMet: 2.208 ± 1.023
4.857IleAsn: 4.857 ± 1.329
5.74IlePro: 5.74 ± 1.238
1.325IleGln: 1.325 ± 0.403
2.649IleArg: 2.649 ± 1.356
7.064IleSer: 7.064 ± 1.857
4.415IleThr: 4.415 ± 0.37
3.091IleVal: 3.091 ± 0.236
1.325IleTrp: 1.325 ± 0.436
3.974IleTyr: 3.974 ± 1.208
0.0IleXaa: 0.0 ± 0.0
Lys
2.649LysAla: 2.649 ± 1.356
0.0LysCys: 0.0 ± 0.0
3.532LysAsp: 3.532 ± 1.192
6.623LysGlu: 6.623 ± 2.317
1.325LysPhe: 1.325 ± 0.436
3.532LysGly: 3.532 ± 1.516
0.0LysHis: 0.0 ± 0.0
2.208LysIle: 2.208 ± 1.023
5.74LysLys: 5.74 ± 2.96
5.74LysLeu: 5.74 ± 1.73
0.442LysMet: 0.442 ± 0.323
3.974LysAsn: 3.974 ± 0.68
3.532LysPro: 3.532 ± 1.629
3.532LysGln: 3.532 ± 1.017
3.532LysArg: 3.532 ± 1.636
5.298LysSer: 5.298 ± 0.442
3.532LysThr: 3.532 ± 2.763
3.091LysVal: 3.091 ± 0.738
0.442LysTrp: 0.442 ± 0.337
1.325LysTyr: 1.325 ± 0.436
0.0LysXaa: 0.0 ± 0.0
Leu
6.181LeuAla: 6.181 ± 1.384
0.442LeuCys: 0.442 ± 0.782
6.181LeuAsp: 6.181 ± 1.407
3.974LeuGlu: 3.974 ± 1.208
2.649LeuPhe: 2.649 ± 0.549
5.74LeuGly: 5.74 ± 1.238
0.442LeuHis: 0.442 ± 0.337
4.857LeuIle: 4.857 ± 1.031
5.298LeuLys: 5.298 ± 1.224
9.713LeuLeu: 9.713 ± 3.366
0.883LeuMet: 0.883 ± 0.675
5.74LeuAsn: 5.74 ± 1.256
5.74LeuPro: 5.74 ± 1.207
1.766LeuGln: 1.766 ± 0.366
3.974LeuArg: 3.974 ± 1.863
7.506LeuSer: 7.506 ± 1.216
7.064LeuThr: 7.064 ± 0.762
5.298LeuVal: 5.298 ± 1.312
0.442LeuTrp: 0.442 ± 0.323
3.532LeuTyr: 3.532 ± 2.095
0.0LeuXaa: 0.0 ± 0.0
Met
1.325MetAla: 1.325 ± 0.436
0.0MetCys: 0.0 ± 0.0
0.883MetAsp: 0.883 ± 0.675
2.208MetGlu: 2.208 ± 0.578
0.442MetPhe: 0.442 ± 0.323
2.208MetGly: 2.208 ± 0.578
1.325MetHis: 1.325 ± 0.436
1.325MetIle: 1.325 ± 0.403
0.442MetLys: 0.442 ± 0.323
2.208MetLeu: 2.208 ± 0.771
1.766MetMet: 1.766 ± 0.366
3.091MetAsn: 3.091 ± 1.105
1.325MetPro: 1.325 ± 0.403
0.442MetGln: 0.442 ± 0.323
0.883MetArg: 0.883 ± 0.183
1.325MetSer: 1.325 ± 0.403
3.091MetThr: 3.091 ± 0.738
1.325MetVal: 1.325 ± 0.436
0.442MetTrp: 0.442 ± 0.323
0.883MetTyr: 0.883 ± 0.183
0.0MetXaa: 0.0 ± 0.0
Asn
3.532AsnAla: 3.532 ± 0.93
0.442AsnCys: 0.442 ± 0.337
2.208AsnAsp: 2.208 ± 0.578
3.091AsnGlu: 3.091 ± 0.738
1.766AsnPhe: 1.766 ± 0.366
2.208AsnGly: 2.208 ± 0.749
1.325AsnHis: 1.325 ± 0.872
3.532AsnIle: 3.532 ± 1.007
3.091AsnLys: 3.091 ± 0.738
4.857AsnLeu: 4.857 ± 1.22
2.208AsnMet: 2.208 ± 0.53
3.091AsnAsn: 3.091 ± 1.105
6.181AsnPro: 6.181 ± 0.745
2.649AsnGln: 2.649 ± 0.806
2.208AsnArg: 2.208 ± 0.578
4.415AsnSer: 4.415 ± 1.071
3.974AsnThr: 3.974 ± 1.228
4.857AsnVal: 4.857 ± 1.181
0.442AsnTrp: 0.442 ± 0.323
2.649AsnTyr: 2.649 ± 0.806
0.0AsnXaa: 0.0 ± 0.0
Pro
3.091ProAla: 3.091 ± 2.086
0.883ProCys: 0.883 ± 0.183
3.091ProAsp: 3.091 ± 0.703
2.649ProGlu: 2.649 ± 0.47
2.208ProPhe: 2.208 ± 0.536
3.091ProGly: 3.091 ± 1.151
0.883ProHis: 0.883 ± 0.647
5.298ProIle: 5.298 ± 1.554
3.091ProLys: 3.091 ± 1.464
3.974ProLeu: 3.974 ± 0.859
0.883ProMet: 0.883 ± 0.183
3.091ProAsn: 3.091 ± 1.105
4.857ProPro: 4.857 ± 1.732
3.091ProGln: 3.091 ± 1.19
3.974ProArg: 3.974 ± 0.68
5.74ProSer: 5.74 ± 1.207
3.532ProThr: 3.532 ± 0.732
4.415ProVal: 4.415 ± 1.032
0.883ProTrp: 0.883 ± 0.675
2.649ProTyr: 2.649 ± 0.872
0.0ProXaa: 0.0 ± 0.0
Gln
2.649GlnAla: 2.649 ± 0.872
0.442GlnCys: 0.442 ± 0.782
0.883GlnAsp: 0.883 ± 0.183
2.649GlnGlu: 2.649 ± 1.94
1.325GlnPhe: 1.325 ± 0.403
6.181GlnGly: 6.181 ± 0.745
0.442GlnHis: 0.442 ± 0.782
2.208GlnIle: 2.208 ± 0.536
3.091GlnLys: 3.091 ± 0.703
3.532GlnLeu: 3.532 ± 0.928
0.442GlnMet: 0.442 ± 0.323
1.325GlnAsn: 1.325 ± 0.436
3.974GlnPro: 3.974 ± 0.067
0.883GlnGln: 0.883 ± 0.675
2.649GlnArg: 2.649 ± 0.441
0.883GlnSer: 0.883 ± 0.183
3.091GlnThr: 3.091 ± 1.105
2.649GlnVal: 2.649 ± 1.424
0.442GlnTrp: 0.442 ± 0.337
0.442GlnTyr: 0.442 ± 0.323
0.0GlnXaa: 0.0 ± 0.0
Arg
0.883ArgAla: 0.883 ± 0.183
0.883ArgCys: 0.883 ± 1.564
2.208ArgAsp: 2.208 ± 0.771
4.857ArgGlu: 4.857 ± 1.616
0.883ArgPhe: 0.883 ± 0.675
3.091ArgGly: 3.091 ± 1.151
1.325ArgHis: 1.325 ± 0.403
4.857ArgIle: 4.857 ± 1.732
3.974ArgLys: 3.974 ± 1.798
4.415ArgLeu: 4.415 ± 1.538
2.208ArgMet: 2.208 ± 1.089
2.649ArgAsn: 2.649 ± 0.441
2.208ArgPro: 2.208 ± 0.771
1.766ArgGln: 1.766 ± 0.758
6.623ArgArg: 6.623 ± 3.062
4.415ArgSer: 4.415 ± 2.662
2.649ArgThr: 2.649 ± 0.47
5.298ArgVal: 5.298 ± 2.398
0.0ArgTrp: 0.0 ± 0.0
0.883ArgTyr: 0.883 ± 0.757
0.0ArgXaa: 0.0 ± 0.0
Ser
5.298SerAla: 5.298 ± 1.611
0.0SerCys: 0.0 ± 0.0
4.857SerAsp: 4.857 ± 1.08
3.091SerGlu: 3.091 ± 0.703
3.091SerPhe: 3.091 ± 2.086
7.506SerGly: 7.506 ± 2.153
2.649SerHis: 2.649 ± 1.6
4.857SerIle: 4.857 ± 0.795
4.415SerLys: 4.415 ± 1.957
8.83SerLeu: 8.83 ± 1.889
3.091SerMet: 3.091 ± 0.738
7.064SerAsn: 7.064 ± 0.628
5.298SerPro: 5.298 ± 1.224
3.532SerGln: 3.532 ± 1.414
5.298SerArg: 5.298 ± 3.284
7.947SerSer: 7.947 ± 1.823
3.974SerThr: 3.974 ± 0.68
3.974SerVal: 3.974 ± 1.208
0.442SerTrp: 0.442 ± 0.782
3.974SerTyr: 3.974 ± 0.595
0.0SerXaa: 0.0 ± 0.0
Thr
3.532ThrAla: 3.532 ± 1.414
0.442ThrCys: 0.442 ± 0.323
1.766ThrAsp: 1.766 ± 0.596
1.325ThrGlu: 1.325 ± 0.403
1.766ThrPhe: 1.766 ± 0.758
6.181ThrGly: 6.181 ± 1.281
1.325ThrHis: 1.325 ± 0.436
5.74ThrIle: 5.74 ± 0.321
3.974ThrLys: 3.974 ± 3.094
7.064ThrLeu: 7.064 ± 1.86
0.883ThrMet: 0.883 ± 0.183
3.532ThrAsn: 3.532 ± 0.93
3.974ThrPro: 3.974 ± 1.314
2.208ThrGln: 2.208 ± 0.417
4.415ThrArg: 4.415 ± 2.867
5.74ThrSer: 5.74 ± 0.321
3.532ThrThr: 3.532 ± 0.998
2.649ThrVal: 2.649 ± 0.806
1.766ThrTrp: 1.766 ± 0.602
2.208ThrTyr: 2.208 ± 0.536
0.0ThrXaa: 0.0 ± 0.0
Val
4.857ValAla: 4.857 ± 1.947
0.0ValCys: 0.0 ± 0.0
4.857ValAsp: 4.857 ± 1.22
2.649ValGlu: 2.649 ± 2.025
1.766ValPhe: 1.766 ± 0.366
5.74ValGly: 5.74 ± 1.238
0.442ValHis: 0.442 ± 0.337
2.208ValIle: 2.208 ± 0.536
3.974ValLys: 3.974 ± 0.906
4.857ValLeu: 4.857 ± 1.441
0.883ValMet: 0.883 ± 0.353
1.766ValAsn: 1.766 ± 0.707
3.532ValPro: 3.532 ± 1.192
1.766ValGln: 1.766 ± 1.528
3.091ValArg: 3.091 ± 1.458
7.064ValSer: 7.064 ± 1.471
4.415ValThr: 4.415 ± 1.155
1.766ValVal: 1.766 ± 0.707
0.883ValTrp: 0.883 ± 0.183
3.532ValTyr: 3.532 ± 1.007
0.0ValXaa: 0.0 ± 0.0
Trp
0.883TrpAla: 0.883 ± 0.675
0.0TrpCys: 0.0 ± 0.0
1.766TrpAsp: 1.766 ± 0.602
0.883TrpGlu: 0.883 ± 0.757
0.442TrpPhe: 0.442 ± 0.323
0.0TrpGly: 0.0 ± 0.0
0.442TrpHis: 0.442 ± 0.782
0.0TrpIle: 0.0 ± 0.0
0.0TrpLys: 0.0 ± 0.0
2.208TrpLeu: 2.208 ± 0.578
0.442TrpMet: 0.442 ± 0.323
0.442TrpAsn: 0.442 ± 0.337
0.442TrpPro: 0.442 ± 0.337
0.442TrpGln: 0.442 ± 0.323
0.0TrpArg: 0.0 ± 0.0
2.649TrpSer: 2.649 ± 0.441
1.766TrpThr: 1.766 ± 0.366
0.442TrpVal: 0.442 ± 0.323
0.0TrpTrp: 0.0 ± 0.0
0.883TrpTyr: 0.883 ± 0.675
0.0TrpXaa: 0.0 ± 0.0
Tyr
3.091TyrAla: 3.091 ± 1.19
0.0TyrCys: 0.0 ± 0.0
3.532TyrAsp: 3.532 ± 0.93
1.766TyrGlu: 1.766 ± 0.366
0.0TyrPhe: 0.0 ± 0.0
3.091TyrGly: 3.091 ± 0.236
0.883TyrHis: 0.883 ± 0.183
4.415TyrIle: 4.415 ± 1.505
1.325TyrLys: 1.325 ± 1.012
2.649TyrLeu: 2.649 ± 0.872
3.091TyrMet: 3.091 ± 0.738
2.649TyrAsn: 2.649 ± 0.549
2.208TyrPro: 2.208 ± 1.089
3.091TyrGln: 3.091 ± 0.236
0.442TyrArg: 0.442 ± 0.337
3.091TyrSer: 3.091 ± 0.738
2.649TyrThr: 2.649 ± 0.872
2.208TyrVal: 2.208 ± 0.578
0.442TyrTrp: 0.442 ± 0.323
0.883TyrTyr: 0.883 ± 0.183
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (2266 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski