Amino acid dipepetide frequency for Martelella mediterranea

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
13.61AlaAla: 13.61 ± 0.124
0.99AlaCys: 0.99 ± 0.027
6.622AlaAsp: 6.622 ± 0.07
7.682AlaGlu: 7.682 ± 0.095
4.756AlaPhe: 4.756 ± 0.068
9.48AlaGly: 9.48 ± 0.095
2.026AlaHis: 2.026 ± 0.043
6.47AlaIle: 6.47 ± 0.064
3.972AlaLys: 3.972 ± 0.061
12.241AlaLeu: 12.241 ± 0.116
3.446AlaMet: 3.446 ± 0.054
2.957AlaAsn: 2.957 ± 0.051
4.367AlaPro: 4.367 ± 0.068
3.232AlaGln: 3.232 ± 0.057
7.266AlaArg: 7.266 ± 0.098
6.311AlaSer: 6.311 ± 0.08
5.193AlaThr: 5.193 ± 0.062
7.995AlaVal: 7.995 ± 0.095
1.256AlaTrp: 1.256 ± 0.033
2.606AlaTyr: 2.606 ± 0.045
0.0AlaXaa: 0.0 ± 0.0
Cys
0.868CysAla: 0.868 ± 0.026
0.137CysCys: 0.137 ± 0.011
0.547CysAsp: 0.547 ± 0.024
0.479CysGlu: 0.479 ± 0.023
0.353CysPhe: 0.353 ± 0.015
0.936CysGly: 0.936 ± 0.032
0.239CysHis: 0.239 ± 0.013
0.389CysIle: 0.389 ± 0.014
0.232CysLys: 0.232 ± 0.011
0.8CysLeu: 0.8 ± 0.028
0.17CysMet: 0.17 ± 0.011
0.228CysAsn: 0.228 ± 0.014
0.42CysPro: 0.42 ± 0.019
0.231CysGln: 0.231 ± 0.013
0.548CysArg: 0.548 ± 0.021
0.491CysSer: 0.491 ± 0.021
0.384CysThr: 0.384 ± 0.02
0.583CysVal: 0.583 ± 0.022
0.11CysTrp: 0.11 ± 0.009
0.206CysTyr: 0.206 ± 0.013
0.0CysXaa: 0.0 ± 0.0
Asp
6.447AspAla: 6.447 ± 0.084
0.514AspCys: 0.514 ± 0.02
3.786AspAsp: 3.786 ± 0.064
4.017AspGlu: 4.017 ± 0.067
2.529AspPhe: 2.529 ± 0.045
5.347AspGly: 5.347 ± 0.067
1.36AspHis: 1.36 ± 0.035
3.702AspIle: 3.702 ± 0.056
2.026AspLys: 2.026 ± 0.04
5.692AspLeu: 5.692 ± 0.061
1.705AspMet: 1.705 ± 0.04
1.762AspAsn: 1.762 ± 0.037
3.223AspPro: 3.223 ± 0.055
1.957AspGln: 1.957 ± 0.035
3.982AspArg: 3.982 ± 0.063
2.285AspSer: 2.285 ± 0.037
2.871AspThr: 2.871 ± 0.049
4.082AspVal: 4.082 ± 0.065
0.952AspTrp: 0.952 ± 0.026
1.701AspTyr: 1.701 ± 0.037
0.0AspXaa: 0.0 ± 0.0
Glu
7.718GluAla: 7.718 ± 0.091
0.4GluCys: 0.4 ± 0.019
3.531GluAsp: 3.531 ± 0.061
3.798GluGlu: 3.798 ± 0.067
1.801GluPhe: 1.801 ± 0.034
4.467GluGly: 4.467 ± 0.062
1.282GluHis: 1.282 ± 0.037
3.969GluIle: 3.969 ± 0.055
3.277GluLys: 3.277 ± 0.06
5.091GluLeu: 5.091 ± 0.072
1.751GluMet: 1.751 ± 0.035
2.649GluAsn: 2.649 ± 0.045
2.723GluPro: 2.723 ± 0.054
2.22GluGln: 2.22 ± 0.044
4.833GluArg: 4.833 ± 0.073
2.641GluSer: 2.641 ± 0.046
4.246GluThr: 4.246 ± 0.067
3.724GluVal: 3.724 ± 0.053
0.761GluTrp: 0.761 ± 0.025
1.195GluTyr: 1.195 ± 0.032
0.0GluXaa: 0.0 ± 0.0
Phe
4.288PheAla: 4.288 ± 0.064
0.413PheCys: 0.413 ± 0.016
2.893PheAsp: 2.893 ± 0.051
2.52PheGlu: 2.52 ± 0.052
1.74PhePhe: 1.74 ± 0.043
3.746PheGly: 3.746 ± 0.061
0.76PheHis: 0.76 ± 0.024
2.182PheIle: 2.182 ± 0.046
1.233PheLys: 1.233 ± 0.031
3.67PheLeu: 3.67 ± 0.06
0.99PheMet: 0.99 ± 0.029
1.324PheAsn: 1.324 ± 0.035
1.69PhePro: 1.69 ± 0.036
1.171PheGln: 1.171 ± 0.031
2.232PheArg: 2.232 ± 0.043
3.066PheSer: 3.066 ± 0.046
2.06PheThr: 2.06 ± 0.046
2.844PheVal: 2.844 ± 0.051
0.596PheTrp: 0.596 ± 0.024
1.095PheTyr: 1.095 ± 0.033
0.0PheXaa: 0.0 ± 0.0
Gly
7.927GlyAla: 7.927 ± 0.086
0.803GlyCys: 0.803 ± 0.027
4.481GlyAsp: 4.481 ± 0.071
4.983GlyGlu: 4.983 ± 0.067
3.728GlyPhe: 3.728 ± 0.05
6.757GlyGly: 6.757 ± 0.148
1.89GlyHis: 1.89 ± 0.038
4.834GlyIle: 4.834 ± 0.066
3.708GlyLys: 3.708 ± 0.061
8.55GlyLeu: 8.55 ± 0.097
2.363GlyMet: 2.363 ± 0.043
2.474GlyAsn: 2.474 ± 0.049
3.114GlyPro: 3.114 ± 0.043
2.736GlyGln: 2.736 ± 0.05
5.332GlyArg: 5.332 ± 0.071
4.578GlySer: 4.578 ± 0.071
4.262GlyThr: 4.262 ± 0.068
5.771GlyVal: 5.771 ± 0.073
1.285GlyTrp: 1.285 ± 0.033
2.359GlyTyr: 2.359 ± 0.046
0.0GlyXaa: 0.0 ± 0.0
His
2.088HisAla: 2.088 ± 0.042
0.245HisCys: 0.245 ± 0.013
1.329HisAsp: 1.329 ± 0.035
1.194HisGlu: 1.194 ± 0.032
0.99HisPhe: 0.99 ± 0.026
1.743HisGly: 1.743 ± 0.04
0.548HisHis: 0.548 ± 0.027
1.091HisIle: 1.091 ± 0.03
0.627HisLys: 0.627 ± 0.022
1.886HisLeu: 1.886 ± 0.039
0.569HisMet: 0.569 ± 0.021
0.567HisAsn: 0.567 ± 0.019
1.211HisPro: 1.211 ± 0.036
0.644HisGln: 0.644 ± 0.026
1.276HisArg: 1.276 ± 0.032
1.027HisSer: 1.027 ± 0.03
0.837HisThr: 0.837 ± 0.029
1.376HisVal: 1.376 ± 0.03
0.28HisTrp: 0.28 ± 0.016
0.589HisTyr: 0.589 ± 0.02
0.0HisXaa: 0.0 ± 0.0
Ile
7.509IleAla: 7.509 ± 0.071
0.572IleCys: 0.572 ± 0.022
3.994IleAsp: 3.994 ± 0.061
3.85IleGlu: 3.85 ± 0.053
2.113IlePhe: 2.113 ± 0.05
5.218IleGly: 5.218 ± 0.071
0.922IleHis: 0.922 ± 0.026
2.939IleIle: 2.939 ± 0.06
1.696IleLys: 1.696 ± 0.036
4.885IleLeu: 4.885 ± 0.074
1.303IleMet: 1.303 ± 0.035
1.749IleAsn: 1.749 ± 0.041
2.447IlePro: 2.447 ± 0.045
1.25IleGln: 1.25 ± 0.032
3.349IleArg: 3.349 ± 0.052
3.718IleSer: 3.718 ± 0.058
2.976IleThr: 2.976 ± 0.057
4.366IleVal: 4.366 ± 0.062
0.694IleTrp: 0.694 ± 0.025
1.35IleTyr: 1.35 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
4.917LysAla: 4.917 ± 0.07
0.202LysCys: 0.202 ± 0.014
2.031LysAsp: 2.031 ± 0.046
1.955LysGlu: 1.955 ± 0.046
0.964LysPhe: 0.964 ± 0.029
2.913LysGly: 2.913 ± 0.057
0.702LysHis: 0.702 ± 0.021
2.161LysIle: 2.161 ± 0.045
1.776LysLys: 1.776 ± 0.046
3.644LysLeu: 3.644 ± 0.058
0.964LysMet: 0.964 ± 0.027
1.334LysAsn: 1.334 ± 0.033
2.328LysPro: 2.328 ± 0.049
1.296LysGln: 1.296 ± 0.03
2.786LysArg: 2.786 ± 0.055
2.309LysSer: 2.309 ± 0.047
2.618LysThr: 2.618 ± 0.051
2.633LysVal: 2.633 ± 0.048
0.43LysTrp: 0.43 ± 0.02
0.705LysTyr: 0.705 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
11.354LeuAla: 11.354 ± 0.111
0.809LeuCys: 0.809 ± 0.025
5.649LeuAsp: 5.649 ± 0.069
5.694LeuGlu: 5.694 ± 0.078
3.925LeuPhe: 3.925 ± 0.064
7.259LeuGly: 7.259 ± 0.091
1.679LeuHis: 1.679 ± 0.041
5.385LeuIle: 5.385 ± 0.07
4.522LeuLys: 4.522 ± 0.056
8.605LeuLeu: 8.605 ± 0.106
2.578LeuMet: 2.578 ± 0.047
3.022LeuAsn: 3.022 ± 0.053
5.117LeuPro: 5.117 ± 0.071
2.82LeuGln: 2.82 ± 0.049
5.554LeuArg: 5.554 ± 0.077
7.545LeuSer: 7.545 ± 0.093
5.608LeuThr: 5.608 ± 0.077
6.584LeuVal: 6.584 ± 0.073
1.011LeuTrp: 1.011 ± 0.033
2.273LeuTyr: 2.273 ± 0.039
0.0LeuXaa: 0.0 ± 0.0
Met
3.199MetAla: 3.199 ± 0.06
0.157MetCys: 0.157 ± 0.011
1.304MetAsp: 1.304 ± 0.033
1.304MetGlu: 1.304 ± 0.036
0.809MetPhe: 0.809 ± 0.025
1.806MetGly: 1.806 ± 0.037
0.485MetHis: 0.485 ± 0.019
1.708MetIle: 1.708 ± 0.037
1.257MetLys: 1.257 ± 0.032
2.62MetLeu: 2.62 ± 0.049
0.834MetMet: 0.834 ± 0.03
1.035MetAsn: 1.035 ± 0.027
1.538MetPro: 1.538 ± 0.038
0.944MetGln: 0.944 ± 0.028
1.922MetArg: 1.922 ± 0.039
1.873MetSer: 1.873 ± 0.039
2.028MetThr: 2.028 ± 0.036
1.877MetVal: 1.877 ± 0.037
0.219MetTrp: 0.219 ± 0.013
0.346MetTyr: 0.346 ± 0.017
0.0MetXaa: 0.0 ± 0.0
Asn
3.621AsnAla: 3.621 ± 0.055
0.245AsnCys: 0.245 ± 0.014
1.848AsnAsp: 1.848 ± 0.04
1.668AsnGlu: 1.668 ± 0.042
1.195AsnPhe: 1.195 ± 0.035
2.962AsnGly: 2.962 ± 0.053
0.625AsnHis: 0.625 ± 0.021
1.734AsnIle: 1.734 ± 0.039
0.943AsnLys: 0.943 ± 0.029
2.785AsnLeu: 2.785 ± 0.043
0.851AsnMet: 0.851 ± 0.024
0.907AsnAsn: 0.907 ± 0.029
2.002AsnPro: 2.002 ± 0.036
1.009AsnGln: 1.009 ± 0.032
2.127AsnArg: 2.127 ± 0.044
1.423AsnSer: 1.423 ± 0.039
1.532AsnThr: 1.532 ± 0.037
2.12AsnVal: 2.12 ± 0.041
0.53AsnTrp: 0.53 ± 0.022
0.8AsnTyr: 0.8 ± 0.026
0.0AsnXaa: 0.0 ± 0.0
Pro
5.025ProAla: 5.025 ± 0.068
0.298ProCys: 0.298 ± 0.015
3.699ProAsp: 3.699 ± 0.056
4.248ProGlu: 4.248 ± 0.065
1.989ProPhe: 1.989 ± 0.044
3.918ProGly: 3.918 ± 0.057
1.01ProHis: 1.01 ± 0.03
2.11ProIle: 2.11 ± 0.037
1.711ProLys: 1.711 ± 0.038
4.405ProLeu: 4.405 ± 0.066
1.132ProMet: 1.132 ± 0.032
1.315ProAsn: 1.315 ± 0.031
1.996ProPro: 1.996 ± 0.047
1.591ProGln: 1.591 ± 0.039
2.288ProArg: 2.288 ± 0.041
2.685ProSer: 2.685 ± 0.051
2.077ProThr: 2.077 ± 0.045
4.226ProVal: 4.226 ± 0.056
0.599ProTrp: 0.599 ± 0.023
1.289ProTyr: 1.289 ± 0.037
0.0ProXaa: 0.0 ± 0.0
Gln
3.714GlnAla: 3.714 ± 0.062
0.197GlnCys: 0.197 ± 0.011
1.511GlnAsp: 1.511 ± 0.038
1.59GlnGlu: 1.59 ± 0.038
1.202GlnPhe: 1.202 ± 0.032
2.038GlnGly: 2.038 ± 0.043
0.627GlnHis: 0.627 ± 0.021
1.961GlnIle: 1.961 ± 0.037
1.521GlnLys: 1.521 ± 0.039
2.885GlnLeu: 2.885 ± 0.051
0.986GlnMet: 0.986 ± 0.027
1.123GlnAsn: 1.123 ± 0.033
1.643GlnPro: 1.643 ± 0.036
1.267GlnGln: 1.267 ± 0.035
2.181GlnArg: 2.181 ± 0.042
2.014GlnSer: 2.014 ± 0.042
1.926GlnThr: 1.926 ± 0.039
2.056GlnVal: 2.056 ± 0.039
0.402GlnTrp: 0.402 ± 0.018
0.72GlnTyr: 0.72 ± 0.024
0.0GlnXaa: 0.0 ± 0.0
Arg
6.477ArgAla: 6.477 ± 0.086
0.468ArgCys: 0.468 ± 0.021
3.747ArgAsp: 3.747 ± 0.056
3.971ArgGlu: 3.971 ± 0.058
2.977ArgPhe: 2.977 ± 0.055
4.051ArgGly: 4.051 ± 0.064
1.559ArgHis: 1.559 ± 0.038
3.907ArgIle: 3.907 ± 0.058
2.724ArgLys: 2.724 ± 0.045
6.921ArgLeu: 6.921 ± 0.09
1.826ArgMet: 1.826 ± 0.039
2.004ArgAsn: 2.004 ± 0.037
2.95ArgPro: 2.95 ± 0.054
2.65ArgGln: 2.65 ± 0.049
4.657ArgArg: 4.657 ± 0.07
3.665ArgSer: 3.665 ± 0.061
2.964ArgThr: 2.964 ± 0.049
3.944ArgVal: 3.944 ± 0.061
0.837ArgTrp: 0.837 ± 0.029
1.817ArgTyr: 1.817 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
6.678SerAla: 6.678 ± 0.078
0.421SerCys: 0.421 ± 0.018
3.713SerAsp: 3.713 ± 0.053
3.843SerGlu: 3.843 ± 0.065
2.6SerPhe: 2.6 ± 0.048
6.239SerGly: 6.239 ± 0.091
1.204SerHis: 1.204 ± 0.033
3.1SerIle: 3.1 ± 0.056
2.005SerLys: 2.005 ± 0.037
5.672SerLeu: 5.672 ± 0.076
1.536SerMet: 1.536 ± 0.036
1.577SerAsn: 1.577 ± 0.037
2.557SerPro: 2.557 ± 0.048
1.806SerGln: 1.806 ± 0.042
3.575SerArg: 3.575 ± 0.058
3.318SerSer: 3.318 ± 0.065
2.739SerThr: 2.739 ± 0.047
4.532SerVal: 4.532 ± 0.061
0.771SerTrp: 0.771 ± 0.023
1.428SerTyr: 1.428 ± 0.032
0.0SerXaa: 0.0 ± 0.0
Thr
5.729ThrAla: 5.729 ± 0.084
0.423ThrCys: 0.423 ± 0.02
2.966ThrAsp: 2.966 ± 0.051
2.853ThrGlu: 2.853 ± 0.051
2.158ThrPhe: 2.158 ± 0.037
5.144ThrGly: 5.144 ± 0.07
1.007ThrHis: 1.007 ± 0.029
3.161ThrIle: 3.161 ± 0.047
1.663ThrLys: 1.663 ± 0.04
5.735ThrLeu: 5.735 ± 0.079
1.348ThrMet: 1.348 ± 0.03
1.462ThrAsn: 1.462 ± 0.035
3.013ThrPro: 3.013 ± 0.048
1.373ThrGln: 1.373 ± 0.035
3.063ThrArg: 3.063 ± 0.051
3.058ThrSer: 3.058 ± 0.057
2.783ThrThr: 2.783 ± 0.059
4.405ThrVal: 4.405 ± 0.067
0.604ThrTrp: 0.604 ± 0.023
1.326ThrTyr: 1.326 ± 0.034
0.0ThrXaa: 0.0 ± 0.0
Val
7.754ValAla: 7.754 ± 0.083
0.652ValCys: 0.652 ± 0.025
4.053ValAsp: 4.053 ± 0.061
4.46ValGlu: 4.46 ± 0.067
3.071ValPhe: 3.071 ± 0.051
4.905ValGly: 4.905 ± 0.06
1.33ValHis: 1.33 ± 0.035
4.277ValIle: 4.277 ± 0.068
2.502ValLys: 2.502 ± 0.051
6.948ValLeu: 6.948 ± 0.097
2.008ValMet: 2.008 ± 0.048
2.16ValAsn: 2.16 ± 0.042
3.372ValPro: 3.372 ± 0.052
1.877ValGln: 1.877 ± 0.036
4.28ValArg: 4.28 ± 0.048
5.125ValSer: 5.125 ± 0.065
4.227ValThr: 4.227 ± 0.061
5.215ValVal: 5.215 ± 0.081
0.863ValTrp: 0.863 ± 0.023
1.629ValTyr: 1.629 ± 0.035
0.0ValXaa: 0.0 ± 0.0
Trp
1.071TrpAla: 1.071 ± 0.03
0.119TrpCys: 0.119 ± 0.009
0.613TrpAsp: 0.613 ± 0.023
0.556TrpGlu: 0.556 ± 0.021
0.566TrpPhe: 0.566 ± 0.022
0.784TrpGly: 0.784 ± 0.026
0.329TrpHis: 0.329 ± 0.018
0.688TrpIle: 0.688 ± 0.025
0.546TrpLys: 0.546 ± 0.021
1.562TrpLeu: 1.562 ± 0.035
0.376TrpMet: 0.376 ± 0.018
0.499TrpAsn: 0.499 ± 0.022
0.674TrpPro: 0.674 ± 0.026
0.622TrpGln: 0.622 ± 0.023
1.014TrpArg: 1.014 ± 0.026
0.84TrpSer: 0.84 ± 0.025
0.688TrpThr: 0.688 ± 0.026
0.729TrpVal: 0.729 ± 0.024
0.235TrpTrp: 0.235 ± 0.014
0.3TrpTyr: 0.3 ± 0.015
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.483TyrAla: 2.483 ± 0.046
0.286TyrCys: 0.286 ± 0.017
1.622TyrAsp: 1.622 ± 0.036
1.412TyrGlu: 1.412 ± 0.032
1.049TyrPhe: 1.049 ± 0.029
2.223TyrGly: 2.223 ± 0.044
0.532TyrHis: 0.532 ± 0.02
1.132TyrIle: 1.132 ± 0.032
0.725TyrLys: 0.725 ± 0.023
2.396TyrLeu: 2.396 ± 0.042
0.529TyrMet: 0.529 ± 0.02
0.777TyrAsn: 0.777 ± 0.029
1.226TyrPro: 1.226 ± 0.03
0.869TyrGln: 0.869 ± 0.028
1.851TyrArg: 1.851 ± 0.039
1.346TyrSer: 1.346 ± 0.034
1.256TyrThr: 1.256 ± 0.037
1.647TyrVal: 1.647 ± 0.033
0.375TyrTrp: 0.375 ± 0.019
0.694TyrTyr: 0.694 ± 0.025
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 4157 proteins (1267225 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski