Amino acid dipepetide frequency for Rhodobacter blasticus DSM 2131

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
21.363AlaAla: 21.363 ± 0.265
1.188AlaCys: 1.188 ± 0.037
7.405AlaAsp: 7.405 ± 0.102
8.957AlaGlu: 8.957 ± 0.108
4.484AlaPhe: 4.484 ± 0.072
12.448AlaGly: 12.448 ± 0.136
2.474AlaHis: 2.474 ± 0.051
5.463AlaIle: 5.463 ± 0.082
3.633AlaLys: 3.633 ± 0.073
16.238AlaLeu: 16.238 ± 0.185
3.951AlaMet: 3.951 ± 0.055
2.512AlaAsn: 2.512 ± 0.05
7.037AlaPro: 7.037 ± 0.117
4.435AlaGln: 4.435 ± 0.072
10.851AlaArg: 10.851 ± 0.112
5.244AlaSer: 5.244 ± 0.081
6.508AlaThr: 6.508 ± 0.085
9.547AlaVal: 9.547 ± 0.121
1.568AlaTrp: 1.568 ± 0.037
2.538AlaTyr: 2.538 ± 0.058
0.001AlaXaa: 0.001 ± 0.001
Cys
0.997CysAla: 0.997 ± 0.034
0.093CysCys: 0.093 ± 0.01
0.596CysAsp: 0.596 ± 0.024
0.367CysGlu: 0.367 ± 0.019
0.324CysPhe: 0.324 ± 0.019
0.98CysGly: 0.98 ± 0.03
0.253CysHis: 0.253 ± 0.015
0.379CysIle: 0.379 ± 0.017
0.199CysLys: 0.199 ± 0.014
0.904CysLeu: 0.904 ± 0.032
0.142CysMet: 0.142 ± 0.012
0.191CysAsn: 0.191 ± 0.015
0.562CysPro: 0.562 ± 0.025
0.19CysGln: 0.19 ± 0.012
0.611CysArg: 0.611 ± 0.026
0.398CysSer: 0.398 ± 0.02
0.452CysThr: 0.452 ± 0.023
0.577CysVal: 0.577 ± 0.021
0.118CysTrp: 0.118 ± 0.011
0.198CysTyr: 0.198 ± 0.015
0.0CysXaa: 0.0 ± 0.0
Asp
6.778AspAla: 6.778 ± 0.082
0.506AspCys: 0.506 ± 0.022
2.911AspAsp: 2.911 ± 0.075
3.054AspGlu: 3.054 ± 0.054
2.158AspPhe: 2.158 ± 0.048
5.265AspGly: 5.265 ± 0.084
1.331AspHis: 1.331 ± 0.04
2.526AspIle: 2.526 ± 0.053
1.425AspLys: 1.425 ± 0.041
6.823AspLeu: 6.823 ± 0.084
1.431AspMet: 1.431 ± 0.037
1.009AspAsn: 1.009 ± 0.037
3.888AspPro: 3.888 ± 0.06
1.822AspGln: 1.822 ± 0.046
4.812AspArg: 4.812 ± 0.065
2.124AspSer: 2.124 ± 0.056
2.565AspThr: 2.565 ± 0.064
3.702AspVal: 3.702 ± 0.065
1.341AspTrp: 1.341 ± 0.038
1.449AspTyr: 1.449 ± 0.039
0.0AspXaa: 0.0 ± 0.0
Glu
8.471GluAla: 8.471 ± 0.118
0.281GluCys: 0.281 ± 0.017
2.959GluAsp: 2.959 ± 0.058
2.904GluGlu: 2.904 ± 0.065
1.624GluPhe: 1.624 ± 0.04
4.968GluGly: 4.968 ± 0.071
0.896GluHis: 0.896 ± 0.03
3.132GluIle: 3.132 ± 0.063
1.87GluLys: 1.87 ± 0.048
4.404GluLeu: 4.404 ± 0.059
1.59GluMet: 1.59 ± 0.036
1.296GluAsn: 1.296 ± 0.038
2.502GluPro: 2.502 ± 0.051
1.549GluGln: 1.549 ± 0.044
3.727GluArg: 3.727 ± 0.069
1.915GluSer: 1.915 ± 0.039
3.632GluThr: 3.632 ± 0.06
4.401GluVal: 4.401 ± 0.067
0.622GluTrp: 0.622 ± 0.026
0.89GluTyr: 0.89 ± 0.03
0.0GluXaa: 0.0 ± 0.0
Phe
4.691PheAla: 4.691 ± 0.075
0.391PheCys: 0.391 ± 0.019
2.566PheAsp: 2.566 ± 0.048
1.734PheGlu: 1.734 ± 0.042
1.228PhePhe: 1.228 ± 0.039
3.712PheGly: 3.712 ± 0.065
0.761PheHis: 0.761 ± 0.026
1.405PheIle: 1.405 ± 0.037
0.805PheLys: 0.805 ± 0.029
3.683PheLeu: 3.683 ± 0.066
0.669PheMet: 0.669 ± 0.026
0.872PheAsn: 0.872 ± 0.026
1.659PhePro: 1.659 ± 0.042
0.936PheGln: 0.936 ± 0.029
2.481PheArg: 2.481 ± 0.049
1.87PheSer: 1.87 ± 0.044
1.985PheThr: 1.985 ± 0.04
2.512PheVal: 2.512 ± 0.048
0.588PheTrp: 0.588 ± 0.026
0.829PheTyr: 0.829 ± 0.026
0.0PheXaa: 0.0 ± 0.0
Gly
10.948GlyAla: 10.948 ± 0.115
0.921GlyCys: 0.921 ± 0.031
4.572GlyAsp: 4.572 ± 0.091
4.461GlyGlu: 4.461 ± 0.068
3.733GlyPhe: 3.733 ± 0.066
8.044GlyGly: 8.044 ± 0.117
1.979GlyHis: 1.979 ± 0.044
4.31GlyIle: 4.31 ± 0.073
3.086GlyLys: 3.086 ± 0.063
10.356GlyLeu: 10.356 ± 0.134
2.631GlyMet: 2.631 ± 0.051
2.069GlyAsn: 2.069 ± 0.078
4.449GlyPro: 4.449 ± 0.068
3.317GlyGln: 3.317 ± 0.059
6.649GlyArg: 6.649 ± 0.075
4.073GlySer: 4.073 ± 0.064
4.91GlyThr: 4.91 ± 0.068
6.469GlyVal: 6.469 ± 0.086
1.718GlyTrp: 1.718 ± 0.042
2.208GlyTyr: 2.208 ± 0.051
0.0GlyXaa: 0.0 ± 0.0
His
2.317HisAla: 2.317 ± 0.052
0.225HisCys: 0.225 ± 0.016
1.273HisAsp: 1.273 ± 0.038
0.943HisGlu: 0.943 ± 0.031
0.754HisPhe: 0.754 ± 0.025
1.956HisGly: 1.956 ± 0.044
0.521HisHis: 0.521 ± 0.024
0.821HisIle: 0.821 ± 0.029
0.407HisLys: 0.407 ± 0.019
2.272HisLeu: 2.272 ± 0.051
0.5HisMet: 0.5 ± 0.024
0.389HisAsn: 0.389 ± 0.018
1.449HisPro: 1.449 ± 0.039
0.529HisGln: 0.529 ± 0.021
1.481HisArg: 1.481 ± 0.036
0.856HisSer: 0.856 ± 0.029
0.735HisThr: 0.735 ± 0.026
1.516HisVal: 1.516 ± 0.037
0.364HisTrp: 0.364 ± 0.018
0.499HisTyr: 0.499 ± 0.024
0.0HisXaa: 0.0 ± 0.0
Ile
6.716IleAla: 6.716 ± 0.09
0.519IleCys: 0.519 ± 0.023
3.079IleAsp: 3.079 ± 0.058
2.908IleGlu: 2.908 ± 0.057
1.48IlePhe: 1.48 ± 0.042
4.531IleGly: 4.531 ± 0.074
0.846IleHis: 0.846 ± 0.029
1.725IleIle: 1.725 ± 0.046
1.108IleLys: 1.108 ± 0.035
4.636IleLeu: 4.636 ± 0.082
0.852IleMet: 0.852 ± 0.028
1.094IleAsn: 1.094 ± 0.033
2.26IlePro: 2.26 ± 0.047
0.946IleGln: 0.946 ± 0.032
3.267IleArg: 3.267 ± 0.054
2.467IleSer: 2.467 ± 0.048
2.689IleThr: 2.689 ± 0.049
3.383IleVal: 3.383 ± 0.068
0.658IleTrp: 0.658 ± 0.024
0.977IleTyr: 0.977 ± 0.034
0.0IleXaa: 0.0 ± 0.0
Lys
4.212LysAla: 4.212 ± 0.082
0.151LysCys: 0.151 ± 0.012
1.603LysAsp: 1.603 ± 0.042
1.302LysGlu: 1.302 ± 0.043
0.735LysPhe: 0.735 ± 0.028
2.823LysGly: 2.823 ± 0.066
0.465LysHis: 0.465 ± 0.021
1.355LysIle: 1.355 ± 0.037
1.065LysLys: 1.065 ± 0.042
2.669LysLeu: 2.669 ± 0.058
0.761LysMet: 0.761 ± 0.03
0.628LysAsn: 0.628 ± 0.03
1.806LysPro: 1.806 ± 0.05
0.724LysGln: 0.724 ± 0.025
1.825LysArg: 1.825 ± 0.044
1.413LysSer: 1.413 ± 0.039
1.702LysThr: 1.702 ± 0.047
2.325LysVal: 2.325 ± 0.055
0.345LysTrp: 0.345 ± 0.019
0.524LysTyr: 0.524 ± 0.025
0.0LysXaa: 0.0 ± 0.0
Leu
15.983LeuAla: 15.983 ± 0.195
0.976LeuCys: 0.976 ± 0.034
5.897LeuAsp: 5.897 ± 0.072
4.872LeuGlu: 4.872 ± 0.075
3.514LeuPhe: 3.514 ± 0.066
8.808LeuGly: 8.808 ± 0.119
2.058LeuHis: 2.058 ± 0.047
5.073LeuIle: 5.073 ± 0.082
3.163LeuLys: 3.163 ± 0.066
9.587LeuLeu: 9.587 ± 0.154
2.679LeuMet: 2.679 ± 0.056
2.415LeuAsn: 2.415 ± 0.044
6.719LeuPro: 6.719 ± 0.088
2.715LeuGln: 2.715 ± 0.058
7.989LeuArg: 7.989 ± 0.102
6.269LeuSer: 6.269 ± 0.089
6.56LeuThr: 6.56 ± 0.09
7.615LeuVal: 7.615 ± 0.094
1.554LeuTrp: 1.554 ± 0.048
1.98LeuTyr: 1.98 ± 0.047
0.0LeuXaa: 0.0 ± 0.0
Met
3.756MetAla: 3.756 ± 0.064
0.152MetCys: 0.152 ± 0.013
1.309MetAsp: 1.309 ± 0.037
1.148MetGlu: 1.148 ± 0.036
0.758MetPhe: 0.758 ± 0.026
2.262MetGly: 2.262 ± 0.049
0.384MetHis: 0.384 ± 0.02
1.344MetIle: 1.344 ± 0.035
0.952MetLys: 0.952 ± 0.031
2.476MetLeu: 2.476 ± 0.051
0.684MetMet: 0.684 ± 0.026
0.724MetAsn: 0.724 ± 0.029
1.518MetPro: 1.518 ± 0.04
0.944MetGln: 0.944 ± 0.028
1.803MetArg: 1.803 ± 0.04
1.337MetSer: 1.337 ± 0.033
2.128MetThr: 2.128 ± 0.046
1.962MetVal: 1.962 ± 0.042
0.215MetTrp: 0.215 ± 0.013
0.314MetTyr: 0.314 ± 0.015
0.0MetXaa: 0.0 ± 0.0
Asn
2.731AsnAla: 2.731 ± 0.053
0.188AsnCys: 0.188 ± 0.013
1.343AsnAsp: 1.343 ± 0.077
0.905AsnGlu: 0.905 ± 0.03
0.822AsnPhe: 0.822 ± 0.026
2.145AsnGly: 2.145 ± 0.056
0.452AsnHis: 0.452 ± 0.023
1.057AsnIle: 1.057 ± 0.031
0.505AsnLys: 0.505 ± 0.026
2.355AsnLeu: 2.355 ± 0.049
0.542AsnMet: 0.542 ± 0.022
0.491AsnAsn: 0.491 ± 0.022
1.819AsnPro: 1.819 ± 0.042
0.512AsnGln: 0.512 ± 0.022
1.651AsnArg: 1.651 ± 0.039
0.941AsnSer: 0.941 ± 0.03
1.103AsnThr: 1.103 ± 0.034
1.621AsnVal: 1.621 ± 0.044
0.364AsnTrp: 0.364 ± 0.02
0.5AsnTyr: 0.5 ± 0.021
0.0AsnXaa: 0.0 ± 0.0
Pro
8.012ProAla: 8.012 ± 0.106
0.409ProCys: 0.409 ± 0.018
4.131ProAsp: 4.131 ± 0.068
4.256ProGlu: 4.256 ± 0.073
2.027ProPhe: 2.027 ± 0.041
5.601ProGly: 5.601 ± 0.077
1.095ProHis: 1.095 ± 0.034
1.862ProIle: 1.862 ± 0.048
1.556ProLys: 1.556 ± 0.042
5.314ProLeu: 5.314 ± 0.09
1.406ProMet: 1.406 ± 0.036
1.121ProAsn: 1.121 ± 0.037
3.065ProPro: 3.065 ± 0.069
1.938ProGln: 1.938 ± 0.046
3.372ProArg: 3.372 ± 0.062
2.284ProSer: 2.284 ± 0.048
2.34ProThr: 2.34 ± 0.051
5.055ProVal: 5.055 ± 0.067
0.824ProTrp: 0.824 ± 0.026
1.079ProTyr: 1.079 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
4.545GlnAla: 4.545 ± 0.074
0.182GlnCys: 0.182 ± 0.014
1.6GlnAsp: 1.6 ± 0.043
1.306GlnGlu: 1.306 ± 0.04
0.924GlnPhe: 0.924 ± 0.03
2.901GlnGly: 2.901 ± 0.053
0.541GlnHis: 0.541 ± 0.025
1.789GlnIle: 1.789 ± 0.041
0.922GlnLys: 0.922 ± 0.029
2.479GlnLeu: 2.479 ± 0.054
1.004GlnMet: 1.004 ± 0.031
0.735GlnAsn: 0.735 ± 0.027
1.712GlnPro: 1.712 ± 0.04
0.805GlnGln: 0.805 ± 0.03
2.057GlnArg: 2.057 ± 0.044
1.398GlnSer: 1.398 ± 0.035
1.666GlnThr: 1.666 ± 0.039
2.523GlnVal: 2.523 ± 0.043
0.353GlnTrp: 0.353 ± 0.017
0.46GlnTyr: 0.46 ± 0.02
0.0GlnXaa: 0.0 ± 0.0
Arg
9.727ArgAla: 9.727 ± 0.105
0.517ArgCys: 0.517 ± 0.023
4.135ArgAsp: 4.135 ± 0.061
3.483ArgGlu: 3.483 ± 0.061
2.756ArgPhe: 2.756 ± 0.058
5.155ArgGly: 5.155 ± 0.071
1.641ArgHis: 1.641 ± 0.04
3.925ArgIle: 3.925 ± 0.057
2.211ArgLys: 2.211 ± 0.05
8.86ArgLeu: 8.86 ± 0.117
2.116ArgMet: 2.116 ± 0.049
1.577ArgAsn: 1.577 ± 0.047
4.185ArgPro: 4.185 ± 0.07
2.364ArgGln: 2.364 ± 0.044
5.712ArgArg: 5.712 ± 0.09
3.249ArgSer: 3.249 ± 0.058
2.882ArgThr: 2.882 ± 0.059
5.104ArgVal: 5.104 ± 0.085
1.066ArgTrp: 1.066 ± 0.031
1.391ArgTyr: 1.391 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
5.603SerAla: 5.603 ± 0.083
0.387SerCys: 0.387 ± 0.018
2.625SerAsp: 2.625 ± 0.048
2.225SerGlu: 2.225 ± 0.049
2.011SerPhe: 2.011 ± 0.045
5.107SerGly: 5.107 ± 0.072
0.95SerHis: 0.95 ± 0.031
2.0SerIle: 2.0 ± 0.042
1.166SerLys: 1.166 ± 0.04
4.912SerLeu: 4.912 ± 0.063
1.094SerMet: 1.094 ± 0.029
1.098SerAsn: 1.098 ± 0.035
2.588SerPro: 2.588 ± 0.052
1.28SerGln: 1.28 ± 0.034
3.186SerArg: 3.186 ± 0.054
2.123SerSer: 2.123 ± 0.051
2.239SerThr: 2.239 ± 0.057
3.492SerVal: 3.492 ± 0.06
0.616SerTrp: 0.616 ± 0.025
1.146SerTyr: 1.146 ± 0.028
0.0SerXaa: 0.0 ± 0.0
Thr
6.889ThrAla: 6.889 ± 0.087
0.484ThrCys: 0.484 ± 0.022
2.942ThrAsp: 2.942 ± 0.053
2.831ThrGlu: 2.831 ± 0.057
1.947ThrPhe: 1.947 ± 0.049
5.736ThrGly: 5.736 ± 0.078
1.075ThrHis: 1.075 ± 0.033
2.417ThrIle: 2.417 ± 0.046
1.212ThrLys: 1.212 ± 0.038
6.241ThrLeu: 6.241 ± 0.086
1.159ThrMet: 1.159 ± 0.032
1.107ThrAsn: 1.107 ± 0.032
3.641ThrPro: 3.641 ± 0.061
1.314ThrGln: 1.314 ± 0.034
3.573ThrArg: 3.573 ± 0.061
2.239ThrSer: 2.239 ± 0.046
2.847ThrThr: 2.847 ± 0.064
4.306ThrVal: 4.306 ± 0.059
0.696ThrTrp: 0.696 ± 0.029
1.104ThrTyr: 1.104 ± 0.036
0.0ThrXaa: 0.0 ± 0.0
Val
9.951ValAla: 9.951 ± 0.123
0.622ValCys: 0.622 ± 0.025
3.781ValAsp: 3.781 ± 0.053
4.307ValGlu: 4.307 ± 0.07
2.809ValPhe: 2.809 ± 0.047
5.303ValGly: 5.303 ± 0.079
1.268ValHis: 1.268 ± 0.038
4.095ValIle: 4.095 ± 0.069
2.187ValLys: 2.187 ± 0.057
8.214ValLeu: 8.214 ± 0.1
2.15ValMet: 2.15 ± 0.051
1.842ValAsn: 1.842 ± 0.047
4.03ValPro: 4.03 ± 0.061
2.291ValGln: 2.291 ± 0.041
4.397ValArg: 4.397 ± 0.06
3.885ValSer: 3.885 ± 0.071
4.972ValThr: 4.972 ± 0.074
5.915ValVal: 5.915 ± 0.089
1.118ValTrp: 1.118 ± 0.032
1.376ValTyr: 1.376 ± 0.034
0.0ValXaa: 0.0 ± 0.0
Trp
1.681TrpAla: 1.681 ± 0.037
0.14TrpCys: 0.14 ± 0.011
0.774TrpAsp: 0.774 ± 0.028
0.612TrpGlu: 0.612 ± 0.024
0.543TrpPhe: 0.543 ± 0.021
1.134TrpGly: 1.134 ± 0.034
0.346TrpHis: 0.346 ± 0.018
0.638TrpIle: 0.638 ± 0.025
0.454TrpLys: 0.454 ± 0.022
1.843TrpLeu: 1.843 ± 0.04
0.394TrpMet: 0.394 ± 0.019
0.373TrpAsn: 0.373 ± 0.017
0.822TrpPro: 0.822 ± 0.029
0.662TrpGln: 0.662 ± 0.027
1.169TrpArg: 1.169 ± 0.038
0.75TrpSer: 0.75 ± 0.027
0.811TrpThr: 0.811 ± 0.027
1.025TrpVal: 1.025 ± 0.033
0.265TrpTrp: 0.265 ± 0.016
0.293TrpTyr: 0.293 ± 0.018
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.473TyrAla: 2.473 ± 0.046
0.203TyrCys: 0.203 ± 0.014
1.422TyrAsp: 1.422 ± 0.035
1.058TyrGlu: 1.058 ± 0.034
0.749TyrPhe: 0.749 ± 0.03
2.021TyrGly: 2.021 ± 0.049
0.469TyrHis: 0.469 ± 0.021
0.827TyrIle: 0.827 ± 0.028
0.505TyrLys: 0.505 ± 0.021
2.16TyrLeu: 2.16 ± 0.04
0.417TyrMet: 0.417 ± 0.022
0.505TyrAsn: 0.505 ± 0.025
1.032TyrPro: 1.032 ± 0.028
0.629TyrGln: 0.629 ± 0.025
1.444TyrArg: 1.444 ± 0.039
1.01TyrSer: 1.01 ± 0.032
1.034TyrThr: 1.034 ± 0.033
1.462TyrVal: 1.462 ± 0.041
0.335TyrTrp: 0.335 ± 0.018
0.461TyrTyr: 0.461 ± 0.023
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.001XaaLeu: 0.001 ± 0.001
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3451 proteins (1074197 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski