Amino acid dipepetide frequency for Photobacterium gaetbulicola Gung47

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
8.779AlaAla: 8.779 ± 0.105
1.034AlaCys: 1.034 ± 0.03
4.957AlaAsp: 4.957 ± 0.059
6.004AlaGlu: 6.004 ± 0.082
3.627AlaPhe: 3.627 ± 0.052
6.763AlaGly: 6.763 ± 0.071
1.653AlaHis: 1.653 ± 0.03
6.043AlaIle: 6.043 ± 0.069
4.651AlaLys: 4.651 ± 0.062
9.953AlaLeu: 9.953 ± 0.095
3.172AlaMet: 3.172 ± 0.053
3.545AlaAsn: 3.545 ± 0.048
3.16AlaPro: 3.16 ± 0.049
3.705AlaGln: 3.705 ± 0.05
3.939AlaArg: 3.939 ± 0.057
5.587AlaSer: 5.587 ± 0.061
4.525AlaThr: 4.525 ± 0.051
6.458AlaVal: 6.458 ± 0.071
1.022AlaTrp: 1.022 ± 0.025
2.489AlaTyr: 2.489 ± 0.035
0.0AlaXaa: 0.0 ± 0.0
Cys
0.811CysAla: 0.811 ± 0.022
0.199CysCys: 0.199 ± 0.012
0.654CysAsp: 0.654 ± 0.021
0.658CysGlu: 0.658 ± 0.022
0.459CysPhe: 0.459 ± 0.018
0.952CysGly: 0.952 ± 0.025
0.45CysHis: 0.45 ± 0.022
0.588CysIle: 0.588 ± 0.016
0.424CysLys: 0.424 ± 0.016
1.02CysLeu: 1.02 ± 0.027
0.267CysMet: 0.267 ± 0.012
0.381CysAsn: 0.381 ± 0.015
0.473CysPro: 0.473 ± 0.018
0.601CysGln: 0.601 ± 0.021
0.587CysArg: 0.587 ± 0.02
0.779CysSer: 0.779 ± 0.023
0.48CysThr: 0.48 ± 0.018
0.658CysVal: 0.658 ± 0.021
0.16CysTrp: 0.16 ± 0.01
0.41CysTyr: 0.41 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
4.4AspAla: 4.4 ± 0.061
0.606AspCys: 0.606 ± 0.017
3.024AspAsp: 3.024 ± 0.048
3.746AspGlu: 3.746 ± 0.056
2.461AspPhe: 2.461 ± 0.042
3.997AspGly: 3.997 ± 0.062
1.248AspHis: 1.248 ± 0.029
3.98AspIle: 3.98 ± 0.051
3.081AspLys: 3.081 ± 0.046
4.865AspLeu: 4.865 ± 0.053
1.485AspMet: 1.485 ± 0.028
2.586AspAsn: 2.586 ± 0.04
2.186AspPro: 2.186 ± 0.037
1.993AspGln: 1.993 ± 0.036
2.403AspArg: 2.403 ± 0.039
3.278AspSer: 3.278 ± 0.053
2.706AspThr: 2.706 ± 0.043
3.794AspVal: 3.794 ± 0.048
0.878AspTrp: 0.878 ± 0.028
2.077AspTyr: 2.077 ± 0.04
0.0AspXaa: 0.0 ± 0.0
Glu
5.587GluAla: 5.587 ± 0.07
0.55GluCys: 0.55 ± 0.016
2.761GluAsp: 2.761 ± 0.04
3.692GluGlu: 3.692 ± 0.055
2.197GluPhe: 2.197 ± 0.034
3.768GluGly: 3.768 ± 0.055
1.62GluHis: 1.62 ± 0.035
3.495GluIle: 3.495 ± 0.046
3.426GluLys: 3.426 ± 0.053
6.977GluLeu: 6.977 ± 0.086
1.804GluMet: 1.804 ± 0.034
2.357GluAsn: 2.357 ± 0.043
2.275GluPro: 2.275 ± 0.038
4.394GluGln: 4.394 ± 0.059
3.518GluArg: 3.518 ± 0.059
3.331GluSer: 3.331 ± 0.054
2.973GluThr: 2.973 ± 0.044
4.244GluVal: 4.244 ± 0.059
0.829GluTrp: 0.829 ± 0.024
1.791GluTyr: 1.791 ± 0.037
0.0GluXaa: 0.0 ± 0.0
Phe
3.726PheAla: 3.726 ± 0.056
0.545PheCys: 0.545 ± 0.02
2.809PheAsp: 2.809 ± 0.045
2.412PheGlu: 2.412 ± 0.036
1.775PhePhe: 1.775 ± 0.036
3.298PheGly: 3.298 ± 0.053
0.836PheHis: 0.836 ± 0.022
2.719PheIle: 2.719 ± 0.05
1.761PheLys: 1.761 ± 0.031
3.336PheLeu: 3.336 ± 0.049
1.092PheMet: 1.092 ± 0.026
1.959PheAsn: 1.959 ± 0.039
1.412PhePro: 1.412 ± 0.028
1.166PheGln: 1.166 ± 0.024
1.656PheArg: 1.656 ± 0.034
3.262PheSer: 3.262 ± 0.048
2.325PheThr: 2.325 ± 0.041
2.81PheVal: 2.81 ± 0.05
0.543PheTrp: 0.543 ± 0.018
1.313PheTyr: 1.313 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
5.542GlyAla: 5.542 ± 0.068
1.02GlyCys: 1.02 ± 0.025
3.725GlyAsp: 3.725 ± 0.054
4.487GlyGlu: 4.487 ± 0.054
3.374GlyPhe: 3.374 ± 0.047
5.141GlyGly: 5.141 ± 0.068
1.712GlyHis: 1.712 ± 0.039
5.01GlyIle: 5.01 ± 0.067
3.925GlyLys: 3.925 ± 0.054
7.3GlyLeu: 7.3 ± 0.077
2.299GlyMet: 2.299 ± 0.044
2.776GlyAsn: 2.776 ± 0.043
1.817GlyPro: 1.817 ± 0.037
3.121GlyGln: 3.121 ± 0.039
3.307GlyArg: 3.307 ± 0.048
4.197GlySer: 4.197 ± 0.048
3.554GlyThr: 3.554 ± 0.046
5.281GlyVal: 5.281 ± 0.062
1.143GlyTrp: 1.143 ± 0.029
2.85GlyTyr: 2.85 ± 0.045
0.0GlyXaa: 0.0 ± 0.0
His
1.73HisAla: 1.73 ± 0.035
0.351HisCys: 0.351 ± 0.015
1.217HisAsp: 1.217 ± 0.031
1.129HisGlu: 1.129 ± 0.03
1.122HisPhe: 1.122 ± 0.028
1.686HisGly: 1.686 ± 0.033
0.832HisHis: 0.832 ± 0.029
1.388HisIle: 1.388 ± 0.033
0.954HisLys: 0.954 ± 0.022
2.236HisLeu: 2.236 ± 0.042
0.548HisMet: 0.548 ± 0.017
0.945HisAsn: 0.945 ± 0.026
1.204HisPro: 1.204 ± 0.029
1.334HisGln: 1.334 ± 0.035
1.229HisArg: 1.229 ± 0.03
1.471HisSer: 1.471 ± 0.034
1.118HisThr: 1.118 ± 0.026
1.274HisVal: 1.274 ± 0.026
0.387HisTrp: 0.387 ± 0.016
0.93HisTyr: 0.93 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.456IleAla: 6.456 ± 0.07
0.714IleCys: 0.714 ± 0.023
4.004IleAsp: 4.004 ± 0.047
4.287IleGlu: 4.287 ± 0.053
2.166IlePhe: 2.166 ± 0.046
4.789IleGly: 4.789 ± 0.061
1.313IleHis: 1.313 ± 0.031
3.398IleIle: 3.398 ± 0.055
2.916IleLys: 2.916 ± 0.046
4.919IleLeu: 4.919 ± 0.065
1.419IleMet: 1.419 ± 0.03
2.856IleAsn: 2.856 ± 0.04
2.72IlePro: 2.72 ± 0.039
2.034IleGln: 2.034 ± 0.038
2.838IleArg: 2.838 ± 0.039
4.278IleSer: 4.278 ± 0.052
3.642IleThr: 3.642 ± 0.055
3.952IleVal: 3.952 ± 0.049
0.631IleTrp: 0.631 ± 0.02
1.683IleTyr: 1.683 ± 0.03
0.0IleXaa: 0.0 ± 0.0
Lys
4.71LysAla: 4.71 ± 0.06
0.315LysCys: 0.315 ± 0.014
2.49LysAsp: 2.49 ± 0.042
2.993LysGlu: 2.993 ± 0.044
1.378LysPhe: 1.378 ± 0.031
3.213LysGly: 3.213 ± 0.047
1.201LysHis: 1.201 ± 0.027
2.584LysIle: 2.584 ± 0.043
2.52LysLys: 2.52 ± 0.042
5.015LysLeu: 5.015 ± 0.054
1.376LysMet: 1.376 ± 0.033
1.79LysAsn: 1.79 ± 0.035
2.388LysPro: 2.388 ± 0.036
2.669LysGln: 2.669 ± 0.044
2.5LysArg: 2.5 ± 0.041
2.658LysSer: 2.658 ± 0.041
2.575LysThr: 2.575 ± 0.042
3.542LysVal: 3.542 ± 0.049
0.554LysTrp: 0.554 ± 0.017
1.385LysTyr: 1.385 ± 0.032
0.0LysXaa: 0.0 ± 0.0
Leu
10.764LeuAla: 10.764 ± 0.085
1.142LeuCys: 1.142 ± 0.029
5.769LeuAsp: 5.769 ± 0.069
6.024LeuGlu: 6.024 ± 0.067
4.213LeuPhe: 4.213 ± 0.065
7.359LeuGly: 7.359 ± 0.081
2.033LeuHis: 2.033 ± 0.042
5.912LeuIle: 5.912 ± 0.086
4.873LeuLys: 4.873 ± 0.049
10.688LeuLeu: 10.688 ± 0.125
2.796LeuMet: 2.796 ± 0.049
4.161LeuAsn: 4.161 ± 0.047
5.058LeuPro: 5.058 ± 0.064
3.641LeuGln: 3.641 ± 0.055
4.552LeuArg: 4.552 ± 0.06
7.707LeuSer: 7.707 ± 0.078
5.924LeuThr: 5.924 ± 0.066
7.111LeuVal: 7.111 ± 0.078
1.116LeuTrp: 1.116 ± 0.029
2.641LeuTyr: 2.641 ± 0.046
0.0LeuXaa: 0.0 ± 0.0
Met
3.079MetAla: 3.079 ± 0.049
0.219MetCys: 0.219 ± 0.013
1.366MetAsp: 1.366 ± 0.029
1.348MetGlu: 1.348 ± 0.028
1.008MetPhe: 1.008 ± 0.026
1.987MetGly: 1.987 ± 0.037
0.519MetHis: 0.519 ± 0.017
1.653MetIle: 1.653 ± 0.039
1.489MetLys: 1.489 ± 0.032
2.99MetLeu: 2.99 ± 0.047
0.943MetMet: 0.943 ± 0.024
1.097MetAsn: 1.097 ± 0.025
1.372MetPro: 1.372 ± 0.032
1.089MetGln: 1.089 ± 0.025
1.216MetArg: 1.216 ± 0.025
1.883MetSer: 1.883 ± 0.03
1.778MetThr: 1.778 ± 0.036
2.137MetVal: 2.137 ± 0.041
0.259MetTrp: 0.259 ± 0.015
0.621MetTyr: 0.621 ± 0.021
0.0MetXaa: 0.0 ± 0.0
Asn
3.289AsnAla: 3.289 ± 0.048
0.392AsnCys: 0.392 ± 0.016
2.225AsnAsp: 2.225 ± 0.043
2.234AsnGlu: 2.234 ± 0.039
1.527AsnPhe: 1.527 ± 0.036
3.024AsnGly: 3.024 ± 0.043
1.001AsnHis: 1.001 ± 0.028
2.671AsnIle: 2.671 ± 0.044
2.0AsnLys: 2.0 ± 0.036
3.687AsnLeu: 3.687 ± 0.05
1.016AsnMet: 1.016 ± 0.024
1.848AsnAsn: 1.848 ± 0.042
2.113AsnPro: 2.113 ± 0.044
2.075AsnGln: 2.075 ± 0.038
2.036AsnArg: 2.036 ± 0.039
2.26AsnSer: 2.26 ± 0.038
2.183AsnThr: 2.183 ± 0.038
2.513AsnVal: 2.513 ± 0.042
0.634AsnTrp: 0.634 ± 0.021
1.366AsnTyr: 1.366 ± 0.034
0.0AsnXaa: 0.0 ± 0.0
Pro
3.78ProAla: 3.78 ± 0.054
0.384ProCys: 0.384 ± 0.018
2.49ProAsp: 2.49 ± 0.044
3.427ProGlu: 3.427 ± 0.044
1.802ProPhe: 1.802 ± 0.036
2.591ProGly: 2.591 ± 0.043
0.918ProHis: 0.918 ± 0.028
2.284ProIle: 2.284 ± 0.037
1.795ProLys: 1.795 ± 0.034
4.427ProLeu: 4.427 ± 0.061
1.173ProMet: 1.173 ± 0.028
1.608ProAsn: 1.608 ± 0.035
1.334ProPro: 1.334 ± 0.031
1.864ProGln: 1.864 ± 0.034
1.523ProArg: 1.523 ± 0.031
2.558ProSer: 2.558 ± 0.041
2.192ProThr: 2.192 ± 0.037
3.521ProVal: 3.521 ± 0.046
0.538ProTrp: 0.538 ± 0.015
1.343ProTyr: 1.343 ± 0.031
0.0ProXaa: 0.0 ± 0.0
Gln
4.696GlnAla: 4.696 ± 0.059
0.455GlnCys: 0.455 ± 0.018
2.052GlnAsp: 2.052 ± 0.031
2.373GlnGlu: 2.373 ± 0.04
1.779GlnPhe: 1.779 ± 0.032
3.072GlnGly: 3.072 ± 0.046
1.24GlnHis: 1.24 ± 0.029
2.445GlnIle: 2.445 ± 0.038
1.725GlnLys: 1.725 ± 0.034
5.576GlnLeu: 5.576 ± 0.078
1.14GlnMet: 1.14 ± 0.027
1.416GlnAsn: 1.416 ± 0.026
2.127GlnPro: 2.127 ± 0.039
3.427GlnGln: 3.427 ± 0.071
2.534GlnArg: 2.534 ± 0.041
2.7GlnSer: 2.7 ± 0.043
2.137GlnThr: 2.137 ± 0.039
3.119GlnVal: 3.119 ± 0.044
0.701GlnTrp: 0.701 ± 0.021
1.501GlnTyr: 1.501 ± 0.028
0.0GlnXaa: 0.0 ± 0.0
Arg
3.568ArgAla: 3.568 ± 0.057
0.547ArgCys: 0.547 ± 0.018
2.551ArgAsp: 2.551 ± 0.045
3.158ArgGlu: 3.158 ± 0.056
2.229ArgPhe: 2.229 ± 0.033
2.809ArgGly: 2.809 ± 0.048
1.3ArgHis: 1.3 ± 0.031
2.875ArgIle: 2.875 ± 0.04
2.274ArgLys: 2.274 ± 0.037
5.21ArgLeu: 5.21 ± 0.064
1.244ArgMet: 1.244 ± 0.024
1.911ArgAsn: 1.911 ± 0.031
1.796ArgPro: 1.796 ± 0.032
2.702ArgGln: 2.702 ± 0.043
2.617ArgArg: 2.617 ± 0.055
2.627ArgSer: 2.627 ± 0.038
2.151ArgThr: 2.151 ± 0.04
3.14ArgVal: 3.14 ± 0.046
0.716ArgTrp: 0.716 ± 0.018
1.871ArgTyr: 1.871 ± 0.034
0.0ArgXaa: 0.0 ± 0.0
Ser
5.293SerAla: 5.293 ± 0.055
0.691SerCys: 0.691 ± 0.024
3.415SerAsp: 3.415 ± 0.049
3.826SerGlu: 3.826 ± 0.055
2.708SerPhe: 2.708 ± 0.05
4.994SerGly: 4.994 ± 0.056
1.574SerHis: 1.574 ± 0.031
3.678SerIle: 3.678 ± 0.053
2.787SerLys: 2.787 ± 0.038
6.97SerLeu: 6.97 ± 0.07
1.709SerMet: 1.709 ± 0.032
2.448SerAsn: 2.448 ± 0.042
2.648SerPro: 2.648 ± 0.044
3.072SerGln: 3.072 ± 0.052
3.022SerArg: 3.022 ± 0.048
4.162SerSer: 4.162 ± 0.067
3.149SerThr: 3.149 ± 0.042
4.434SerVal: 4.434 ± 0.054
0.849SerTrp: 0.849 ± 0.023
2.129SerTyr: 2.129 ± 0.041
0.0SerXaa: 0.0 ± 0.0
Thr
4.803ThrAla: 4.803 ± 0.058
0.495ThrCys: 0.495 ± 0.016
2.805ThrAsp: 2.805 ± 0.036
3.056ThrGlu: 3.056 ± 0.039
2.104ThrPhe: 2.104 ± 0.037
4.18ThrGly: 4.18 ± 0.062
1.161ThrHis: 1.161 ± 0.025
3.229ThrIle: 3.229 ± 0.044
2.095ThrLys: 2.095 ± 0.036
6.112ThrLeu: 6.112 ± 0.066
1.313ThrMet: 1.313 ± 0.026
1.919ThrAsn: 1.919 ± 0.041
2.669ThrPro: 2.669 ± 0.042
2.172ThrGln: 2.172 ± 0.036
2.262ThrArg: 2.262 ± 0.041
3.136ThrSer: 3.136 ± 0.048
2.87ThrThr: 2.87 ± 0.046
4.065ThrVal: 4.065 ± 0.049
0.563ThrTrp: 0.563 ± 0.018
1.455ThrTyr: 1.455 ± 0.035
0.0ThrXaa: 0.0 ± 0.0
Val
6.776ValAla: 6.776 ± 0.077
0.781ValCys: 0.781 ± 0.021
4.126ValAsp: 4.126 ± 0.053
4.649ValGlu: 4.649 ± 0.063
2.76ValPhe: 2.76 ± 0.045
4.883ValGly: 4.883 ± 0.06
1.239ValHis: 1.239 ± 0.03
4.758ValIle: 4.758 ± 0.058
3.412ValLys: 3.412 ± 0.05
6.882ValLeu: 6.882 ± 0.067
2.2ValMet: 2.2 ± 0.038
2.851ValAsn: 2.851 ± 0.05
2.799ValPro: 2.799 ± 0.04
2.08ValGln: 2.08 ± 0.035
2.937ValArg: 2.937 ± 0.046
4.833ValSer: 4.833 ± 0.063
4.138ValThr: 4.138 ± 0.053
5.607ValVal: 5.607 ± 0.068
0.769ValTrp: 0.769 ± 0.021
1.84ValTyr: 1.84 ± 0.038
0.0ValXaa: 0.0 ± 0.0
Trp
0.907TrpAla: 0.907 ± 0.028
0.153TrpCys: 0.153 ± 0.01
0.636TrpAsp: 0.636 ± 0.02
0.587TrpGlu: 0.587 ± 0.022
0.609TrpPhe: 0.609 ± 0.023
0.83TrpGly: 0.83 ± 0.024
0.389TrpHis: 0.389 ± 0.016
0.651TrpIle: 0.651 ± 0.019
0.512TrpLys: 0.512 ± 0.018
1.888TrpLeu: 1.888 ± 0.039
0.35TrpMet: 0.35 ± 0.013
0.474TrpAsn: 0.474 ± 0.017
0.511TrpPro: 0.511 ± 0.018
1.034TrpGln: 1.034 ± 0.028
0.719TrpArg: 0.719 ± 0.019
0.734TrpSer: 0.734 ± 0.023
0.527TrpThr: 0.527 ± 0.019
0.869TrpVal: 0.869 ± 0.022
0.24TrpTrp: 0.24 ± 0.013
0.416TrpTyr: 0.416 ± 0.016
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.367TyrAla: 2.367 ± 0.037
0.419TyrCys: 0.419 ± 0.015
1.715TyrAsp: 1.715 ± 0.041
1.501TyrGlu: 1.501 ± 0.029
1.458TyrPhe: 1.458 ± 0.033
2.248TyrGly: 2.248 ± 0.043
0.846TyrHis: 0.846 ± 0.025
1.654TyrIle: 1.654 ± 0.03
1.237TyrLys: 1.237 ± 0.033
3.438TyrLeu: 3.438 ± 0.051
0.705TyrMet: 0.705 ± 0.019
1.167TyrAsn: 1.167 ± 0.032
1.489TyrPro: 1.489 ± 0.031
2.043TyrGln: 2.043 ± 0.041
1.902TyrArg: 1.902 ± 0.036
2.115TyrSer: 2.115 ± 0.042
1.516TyrThr: 1.516 ± 0.031
1.791TyrVal: 1.791 ± 0.034
0.502TyrTrp: 0.502 ± 0.018
1.071TyrTyr: 1.071 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4974 proteins (1665140 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski