Amino acid dipepetide frequency for Diabrotica virgifera virgifera virus 3

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.348AlaAla: 4.348 ± 0.814
0.767AlaCys: 0.767 ± 0.087
1.79AlaAsp: 1.79 ± 0.039
3.581AlaGlu: 3.581 ± 0.411
4.092AlaPhe: 4.092 ± 0.789
2.558AlaGly: 2.558 ± 0.126
1.535AlaHis: 1.535 ± 0.316
4.859AlaIle: 4.859 ± 1.083
5.371AlaLys: 5.371 ± 0.862
4.092AlaLeu: 4.092 ± 0.299
1.535AlaMet: 1.535 ± 0.806
4.092AlaAsn: 4.092 ± 1.768
1.279AlaPro: 1.279 ± 1.287
2.302AlaGln: 2.302 ± 0.75
2.813AlaArg: 2.813 ± 0.008
4.859AlaSer: 4.859 ± 0.104
4.604AlaThr: 4.604 ± 2.478
4.604AlaVal: 4.604 ± 1.499
0.767AlaTrp: 0.767 ± 0.087
1.79AlaTyr: 1.79 ± 0.039
0.0AlaXaa: 0.0 ± 0.0
Cys
0.256CysAla: 0.256 ± 0.134
0.256CysCys: 0.256 ± 0.134
1.023CysAsp: 1.023 ± 0.442
0.512CysGlu: 0.512 ± 0.269
0.767CysPhe: 0.767 ± 0.087
1.023CysGly: 1.023 ± 0.048
0.0CysHis: 0.0 ± 0.0
0.767CysIle: 0.767 ± 0.403
1.023CysLys: 1.023 ± 0.537
1.023CysLeu: 1.023 ± 0.537
1.023CysMet: 1.023 ± 0.537
0.512CysAsn: 0.512 ± 0.269
0.767CysPro: 0.767 ± 0.087
0.512CysGln: 0.512 ± 0.269
0.512CysArg: 0.512 ± 0.269
1.023CysSer: 1.023 ± 0.442
0.767CysThr: 0.767 ± 0.403
1.279CysVal: 1.279 ± 0.308
0.0CysTrp: 0.0 ± 0.0
0.512CysTyr: 0.512 ± 0.221
0.0CysXaa: 0.0 ± 0.0
Asp
4.348AspAla: 4.348 ± 0.165
1.79AspCys: 1.79 ± 0.039
2.302AspAsp: 2.302 ± 1.208
2.046AspGlu: 2.046 ± 0.884
4.092AspPhe: 4.092 ± 0.68
2.558AspGly: 2.558 ± 1.105
1.279AspHis: 1.279 ± 0.671
4.348AspIle: 4.348 ± 0.325
3.069AspLys: 3.069 ± 1.611
6.138AspLeu: 6.138 ± 1.183
1.79AspMet: 1.79 ± 0.94
3.581AspAsn: 3.581 ± 0.411
2.558AspPro: 2.558 ± 0.126
0.767AspGln: 0.767 ± 0.403
2.046AspArg: 2.046 ± 0.585
4.092AspSer: 4.092 ± 2.257
5.115AspThr: 5.115 ± 0.727
3.325AspVal: 3.325 ± 1.192
1.023AspTrp: 1.023 ± 0.048
2.302AspTyr: 2.302 ± 0.719
0.0AspXaa: 0.0 ± 0.0
Glu
3.069GluAla: 3.069 ± 0.347
0.767GluCys: 0.767 ± 0.087
3.325GluAsp: 3.325 ± 0.277
4.604GluGlu: 4.604 ± 1.438
2.813GluPhe: 2.813 ± 0.987
3.069GluGly: 3.069 ± 0.347
0.512GluHis: 0.512 ± 0.221
2.558GluIle: 2.558 ± 0.364
3.581GluLys: 3.581 ± 1.39
4.604GluLeu: 4.604 ± 0.948
1.023GluMet: 1.023 ± 0.048
3.325GluAsn: 3.325 ± 0.702
2.046GluPro: 2.046 ± 1.373
3.069GluGln: 3.069 ± 0.143
2.813GluArg: 2.813 ± 0.498
3.325GluSer: 3.325 ± 0.767
3.069GluThr: 3.069 ± 1.122
3.836GluVal: 3.836 ± 0.546
0.0GluTrp: 0.0 ± 0.0
2.302GluTyr: 2.302 ± 0.26
0.0GluXaa: 0.0 ± 0.0
Phe
3.581PheAla: 3.581 ± 0.901
0.0PheCys: 0.0 ± 0.0
4.348PheAsp: 4.348 ± 0.165
3.069PheGlu: 3.069 ± 1.611
1.79PhePhe: 1.79 ± 0.039
2.046PheGly: 2.046 ± 0.394
1.023PheHis: 1.023 ± 0.537
2.302PheIle: 2.302 ± 0.719
3.581PheLys: 3.581 ± 0.078
1.79PheLeu: 1.79 ± 0.039
0.512PheMet: 0.512 ± 0.269
3.836PheAsn: 3.836 ± 0.433
1.023PhePro: 1.023 ± 0.537
1.023PheGln: 1.023 ± 0.442
0.256PheArg: 0.256 ± 0.134
3.836PheSer: 3.836 ± 0.546
3.581PheThr: 3.581 ± 1.39
3.581PheVal: 3.581 ± 0.411
0.512PheTrp: 0.512 ± 0.269
0.767PheTyr: 0.767 ± 0.403
0.0PheXaa: 0.0 ± 0.0
Gly
3.069GlyAla: 3.069 ± 0.836
0.256GlyCys: 0.256 ± 0.134
3.069GlyAsp: 3.069 ± 0.347
2.813GlyGlu: 2.813 ± 0.498
2.046GlyPhe: 2.046 ± 0.095
1.79GlyGly: 1.79 ± 0.039
1.79GlyHis: 1.79 ± 0.94
4.604GlyIle: 4.604 ± 0.031
2.558GlyLys: 2.558 ± 0.364
4.604GlyLeu: 4.604 ± 1.01
1.279GlyMet: 1.279 ± 0.308
3.325GlyAsn: 3.325 ± 2.171
1.79GlyPro: 1.79 ± 1.508
0.767GlyGln: 0.767 ± 0.087
1.279GlyArg: 1.279 ± 0.797
2.813GlySer: 2.813 ± 1.46
3.325GlyThr: 3.325 ± 2.66
4.604GlyVal: 4.604 ± 1.01
0.256GlyTrp: 0.256 ± 0.355
2.046GlyTyr: 2.046 ± 0.585
0.0GlyXaa: 0.0 ± 0.0
His
1.023HisAla: 1.023 ± 0.048
0.0HisCys: 0.0 ± 0.0
0.512HisAsp: 0.512 ± 0.269
0.512HisGlu: 0.512 ± 0.221
2.046HisPhe: 2.046 ± 0.585
1.79HisGly: 1.79 ± 0.039
0.256HisHis: 0.256 ± 0.134
2.046HisIle: 2.046 ± 1.074
1.535HisLys: 1.535 ± 0.806
2.046HisLeu: 2.046 ± 0.095
0.512HisMet: 0.512 ± 0.269
0.256HisAsn: 0.256 ± 0.134
0.767HisPro: 0.767 ± 0.087
0.767HisGln: 0.767 ± 0.403
1.535HisArg: 1.535 ± 0.316
2.046HisSer: 2.046 ± 0.095
0.767HisThr: 0.767 ± 0.403
1.023HisVal: 1.023 ± 0.048
0.767HisTrp: 0.767 ± 0.403
1.79HisTyr: 1.79 ± 0.039
0.0HisXaa: 0.0 ± 0.0
Ile
4.604IleAla: 4.604 ± 0.948
1.023IleCys: 1.023 ± 0.537
5.115IleAsp: 5.115 ± 0.238
4.092IleGlu: 4.092 ± 0.299
1.279IlePhe: 1.279 ± 0.182
4.604IleGly: 4.604 ± 1.927
0.256IleHis: 0.256 ± 0.134
6.905IleIle: 6.905 ± 1.178
6.138IleLys: 6.138 ± 1.754
4.859IleLeu: 4.859 ± 1.083
0.767IleMet: 0.767 ± 0.576
5.115IleAsn: 5.115 ± 1.217
2.813IlePro: 2.813 ± 0.008
2.558IleGln: 2.558 ± 0.126
1.279IleArg: 1.279 ± 0.308
4.348IleSer: 4.348 ± 0.325
5.627IleThr: 5.627 ± 0.017
5.115IleVal: 5.115 ± 0.727
1.023IleTrp: 1.023 ± 0.048
4.092IleTyr: 4.092 ± 1.169
0.0IleXaa: 0.0 ± 0.0
Lys
5.627LysAla: 5.627 ± 0.473
1.023LysCys: 1.023 ± 0.537
3.325LysAsp: 3.325 ± 0.277
5.115LysGlu: 5.115 ± 1.217
2.558LysPhe: 2.558 ± 1.343
2.813LysGly: 2.813 ± 0.498
2.302LysHis: 2.302 ± 0.719
5.627LysIle: 5.627 ± 1.975
4.604LysLys: 4.604 ± 1.927
6.138LysLeu: 6.138 ± 1.264
2.813LysMet: 2.813 ± 0.818
4.348LysAsn: 4.348 ± 0.325
2.558LysPro: 2.558 ± 0.364
3.581LysGln: 3.581 ± 0.411
1.279LysArg: 1.279 ± 0.671
5.882LysSer: 5.882 ± 0.151
3.836LysThr: 3.836 ± 0.433
6.65LysVal: 6.65 ± 2.023
0.512LysTrp: 0.512 ± 0.221
3.325LysTyr: 3.325 ± 1.746
0.0LysXaa: 0.0 ± 0.0
Leu
5.115LeuAla: 5.115 ± 0.741
0.767LeuCys: 0.767 ± 0.403
7.417LeuAsp: 7.417 ± 1.001
5.115LeuGlu: 5.115 ± 0.727
2.813LeuPhe: 2.813 ± 1.477
3.325LeuGly: 3.325 ± 1.192
1.79LeuHis: 1.79 ± 0.039
5.115LeuIle: 5.115 ± 0.727
5.115LeuLys: 5.115 ± 0.727
5.627LeuLeu: 5.627 ± 0.962
2.046LeuMet: 2.046 ± 1.074
4.859LeuAsn: 4.859 ± 2.062
4.604LeuPro: 4.604 ± 0.52
4.092LeuGln: 4.092 ± 1.278
2.813LeuArg: 2.813 ± 1.477
4.859LeuSer: 4.859 ± 0.875
6.394LeuThr: 6.394 ± 0.07
4.859LeuVal: 4.859 ± 0.875
0.256LeuTrp: 0.256 ± 0.134
2.813LeuTyr: 2.813 ± 0.498
0.0LeuXaa: 0.0 ± 0.0
Met
2.302MetAla: 2.302 ± 0.229
0.256MetCys: 0.256 ± 0.134
1.535MetAsp: 1.535 ± 0.806
1.79MetGlu: 1.79 ± 0.45
0.767MetPhe: 0.767 ± 0.087
0.767MetGly: 0.767 ± 0.087
0.256MetHis: 0.256 ± 0.134
1.279MetIle: 1.279 ± 0.308
3.325MetLys: 3.325 ± 1.746
2.813MetLeu: 2.813 ± 0.987
0.512MetMet: 0.512 ± 0.221
1.023MetAsn: 1.023 ± 0.048
1.023MetPro: 1.023 ± 0.048
1.279MetGln: 1.279 ± 0.308
0.512MetArg: 0.512 ± 0.221
1.535MetSer: 1.535 ± 0.316
1.535MetThr: 1.535 ± 0.316
3.581MetVal: 3.581 ± 1.057
0.767MetTrp: 0.767 ± 0.087
0.512MetTyr: 0.512 ± 0.269
0.256MetXaa: 0.256 ± 0.355
Asn
3.581AsnAla: 3.581 ± 0.568
0.256AsnCys: 0.256 ± 0.134
2.813AsnAsp: 2.813 ± 0.498
2.302AsnGlu: 2.302 ± 0.26
1.535AsnPhe: 1.535 ± 0.316
2.302AsnGly: 2.302 ± 0.75
1.023AsnHis: 1.023 ± 0.537
6.65AsnIle: 6.65 ± 1.044
6.138AsnLys: 6.138 ± 1.754
5.371AsnLeu: 5.371 ± 0.607
3.069AsnMet: 3.069 ± 1.055
3.325AsnAsn: 3.325 ± 1.256
2.813AsnPro: 2.813 ± 1.46
2.558AsnGln: 2.558 ± 1.105
1.79AsnArg: 1.79 ± 0.45
4.092AsnSer: 4.092 ± 0.19
3.325AsnThr: 3.325 ± 2.171
5.371AsnVal: 5.371 ± 1.586
0.256AsnTrp: 0.256 ± 0.134
3.836AsnTyr: 3.836 ± 0.433
0.0AsnXaa: 0.0 ± 0.0
Pro
1.79ProAla: 1.79 ± 0.529
0.512ProCys: 0.512 ± 0.269
2.558ProAsp: 2.558 ± 0.126
1.279ProGlu: 1.279 ± 0.182
2.046ProPhe: 2.046 ± 1.074
2.558ProGly: 2.558 ± 2.084
1.535ProHis: 1.535 ± 0.173
2.813ProIle: 2.813 ± 0.498
4.348ProLys: 4.348 ± 1.634
2.813ProLeu: 2.813 ± 1.46
1.279ProMet: 1.279 ± 0.671
2.302ProAsn: 2.302 ± 0.26
1.535ProPro: 1.535 ± 1.152
2.558ProGln: 2.558 ± 1.105
0.512ProArg: 0.512 ± 0.269
5.115ProSer: 5.115 ± 2.21
1.79ProThr: 1.79 ± 0.529
2.046ProVal: 2.046 ± 0.394
0.0ProTrp: 0.0 ± 0.0
2.046ProTyr: 2.046 ± 0.394
0.0ProXaa: 0.0 ± 0.0
Gln
1.79GlnAla: 1.79 ± 1.018
1.023GlnCys: 1.023 ± 0.442
2.046GlnAsp: 2.046 ± 0.884
2.302GlnGlu: 2.302 ± 0.26
2.558GlnPhe: 2.558 ± 0.853
1.535GlnGly: 1.535 ± 0.173
1.535GlnHis: 1.535 ± 0.806
3.325GlnIle: 3.325 ± 0.277
3.581GlnLys: 3.581 ± 0.568
4.348GlnLeu: 4.348 ± 0.325
1.023GlnMet: 1.023 ± 0.048
2.813GlnAsn: 2.813 ± 0.971
1.79GlnPro: 1.79 ± 0.529
1.023GlnGln: 1.023 ± 0.442
1.535GlnArg: 1.535 ± 0.173
1.79GlnSer: 1.79 ± 0.039
2.558GlnThr: 2.558 ± 0.364
1.279GlnVal: 1.279 ± 0.182
0.512GlnTrp: 0.512 ± 0.221
1.279GlnTyr: 1.279 ± 0.308
0.0GlnXaa: 0.0 ± 0.0
Arg
2.558ArgAla: 2.558 ± 1.105
0.256ArgCys: 0.256 ± 0.134
1.79ArgAsp: 1.79 ± 0.45
1.023ArgGlu: 1.023 ± 0.048
1.279ArgPhe: 1.279 ± 0.182
1.535ArgGly: 1.535 ± 1.152
0.256ArgHis: 0.256 ± 0.355
1.79ArgIle: 1.79 ± 0.039
2.558ArgLys: 2.558 ± 0.853
3.069ArgLeu: 3.069 ± 1.122
2.046ArgMet: 2.046 ± 0.585
1.79ArgAsn: 1.79 ± 0.039
1.023ArgPro: 1.023 ± 0.537
1.79ArgGln: 1.79 ± 0.94
1.023ArgArg: 1.023 ± 0.048
1.79ArgSer: 1.79 ± 0.039
2.813ArgThr: 2.813 ± 0.987
2.302ArgVal: 2.302 ± 0.229
0.256ArgTrp: 0.256 ± 0.134
1.79ArgTyr: 1.79 ± 0.529
0.0ArgXaa: 0.0 ± 0.0
Ser
3.069SerAla: 3.069 ± 0.836
1.279SerCys: 1.279 ± 0.308
6.138SerAsp: 6.138 ± 0.204
2.558SerGlu: 2.558 ± 0.364
2.046SerPhe: 2.046 ± 1.373
3.325SerGly: 3.325 ± 0.702
0.767SerHis: 0.767 ± 0.403
5.882SerIle: 5.882 ± 1.13
4.604SerLys: 4.604 ± 0.459
5.371SerLeu: 5.371 ± 0.117
1.023SerMet: 1.023 ± 0.931
6.65SerAsn: 6.65 ± 0.915
3.069SerPro: 3.069 ± 0.347
2.813SerGln: 2.813 ± 0.481
3.069SerArg: 3.069 ± 0.143
5.371SerSer: 5.371 ± 1.586
6.905SerThr: 6.905 ± 2.738
5.115SerVal: 5.115 ± 1.231
0.256SerTrp: 0.256 ± 0.134
3.581SerTyr: 3.581 ± 0.568
0.0SerXaa: 0.0 ± 0.0
Thr
5.115ThrAla: 5.115 ± 1.231
1.023ThrCys: 1.023 ± 0.048
2.813ThrAsp: 2.813 ± 1.46
3.325ThrGlu: 3.325 ± 0.213
3.581ThrPhe: 3.581 ± 0.078
4.348ThrGly: 4.348 ± 1.634
1.023ThrHis: 1.023 ± 0.048
4.348ThrIle: 4.348 ± 0.165
3.581ThrLys: 3.581 ± 0.411
4.604ThrLeu: 4.604 ± 0.948
1.279ThrMet: 1.279 ± 0.797
3.836ThrAsn: 3.836 ± 0.056
3.069ThrPro: 3.069 ± 0.347
2.558ThrGln: 2.558 ± 0.364
3.069ThrArg: 3.069 ± 0.143
5.882ThrSer: 5.882 ± 1.807
5.882ThrThr: 5.882 ± 1.317
6.138ThrVal: 6.138 ± 0.204
0.512ThrTrp: 0.512 ± 0.71
2.558ThrTyr: 2.558 ± 0.364
0.0ThrXaa: 0.0 ± 0.0
Val
3.325ValAla: 3.325 ± 0.277
1.023ValCys: 1.023 ± 0.048
4.092ValAsp: 4.092 ± 0.68
4.348ValGlu: 4.348 ± 0.165
3.069ValPhe: 3.069 ± 0.143
3.069ValGly: 3.069 ± 1.326
2.813ValHis: 2.813 ± 0.498
3.836ValIle: 3.836 ± 0.433
5.371ValLys: 5.371 ± 1.351
5.627ValLeu: 5.627 ± 0.473
1.79ValMet: 1.79 ± 0.529
4.604ValAsn: 4.604 ± 0.459
3.836ValPro: 3.836 ± 0.923
3.325ValGln: 3.325 ± 0.213
2.558ValArg: 2.558 ± 0.615
5.627ValSer: 5.627 ± 1.941
4.859ValThr: 4.859 ± 0.875
6.65ValVal: 6.65 ± 0.915
0.0ValTrp: 0.0 ± 0.0
5.115ValTyr: 5.115 ± 1.231
0.0ValXaa: 0.0 ± 0.0
Trp
0.256TrpAla: 0.256 ± 0.134
0.512TrpCys: 0.512 ± 0.269
0.512TrpAsp: 0.512 ± 0.269
0.0TrpGlu: 0.0 ± 0.0
0.256TrpPhe: 0.256 ± 0.134
0.0TrpGly: 0.0 ± 0.0
0.256TrpHis: 0.256 ± 0.134
0.512TrpIle: 0.512 ± 0.221
1.023TrpLys: 1.023 ± 0.048
0.256TrpLeu: 0.256 ± 0.355
0.512TrpMet: 0.512 ± 0.221
1.279TrpAsn: 1.279 ± 0.182
0.0TrpPro: 0.0 ± 0.0
0.0TrpGln: 0.0 ± 0.0
0.256TrpArg: 0.256 ± 0.134
1.023TrpSer: 1.023 ± 0.442
0.512TrpThr: 0.512 ± 0.221
0.256TrpVal: 0.256 ± 0.355
0.0TrpTrp: 0.0 ± 0.0
0.512TrpTyr: 0.512 ± 0.221
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.558TyrAla: 2.558 ± 0.126
0.767TyrCys: 0.767 ± 0.403
2.046TyrAsp: 2.046 ± 1.074
3.325TyrGlu: 3.325 ± 0.277
1.023TyrPhe: 1.023 ± 0.537
3.069TyrGly: 3.069 ± 1.326
1.79TyrHis: 1.79 ± 0.529
2.046TyrIle: 2.046 ± 0.585
3.069TyrLys: 3.069 ± 0.632
4.348TyrLeu: 4.348 ± 1.304
1.535TyrMet: 1.535 ± 0.663
1.535TyrAsn: 1.535 ± 0.663
3.069TyrPro: 3.069 ± 0.836
2.302TyrGln: 2.302 ± 0.229
1.79TyrArg: 1.79 ± 0.039
3.581TyrSer: 3.581 ± 0.901
1.279TyrThr: 1.279 ± 0.182
3.325TyrVal: 3.325 ± 0.277
0.256TyrTrp: 0.256 ± 0.355
3.069TyrTyr: 3.069 ± 0.347
0.256TyrXaa: 0.256 ± 0.355
Xaa
0.256XaaAla: 0.256 ± 0.355
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.256XaaLeu: 0.256 ± 0.355
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (3911 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski