Amino acid dipepetide frequency for Flavobacterium sp. Leaf359

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.544AlaAla: 4.544 ± 0.1
0.544AlaCys: 0.544 ± 0.023
3.368AlaAsp: 3.368 ± 0.074
4.194AlaGlu: 4.194 ± 0.074
3.425AlaPhe: 3.425 ± 0.063
4.472AlaGly: 4.472 ± 0.09
1.004AlaHis: 1.004 ± 0.028
5.527AlaIle: 5.527 ± 0.078
5.026AlaLys: 5.026 ± 0.099
6.348AlaLeu: 6.348 ± 0.088
1.532AlaMet: 1.532 ± 0.045
3.532AlaAsn: 3.532 ± 0.08
2.025AlaPro: 2.025 ± 0.053
2.496AlaGln: 2.496 ± 0.048
2.04AlaArg: 2.04 ± 0.052
4.398AlaSer: 4.398 ± 0.077
4.224AlaThr: 4.224 ± 0.118
4.454AlaVal: 4.454 ± 0.073
0.6AlaTrp: 0.6 ± 0.024
2.558AlaTyr: 2.558 ± 0.051
0.0AlaXaa: 0.0 ± 0.0
Cys
0.461CysAla: 0.461 ± 0.03
0.101CysCys: 0.101 ± 0.01
0.415CysAsp: 0.415 ± 0.022
0.451CysGlu: 0.451 ± 0.026
0.386CysPhe: 0.386 ± 0.018
0.65CysGly: 0.65 ± 0.033
0.192CysHis: 0.192 ± 0.014
0.56CysIle: 0.56 ± 0.024
0.43CysLys: 0.43 ± 0.022
0.658CysLeu: 0.658 ± 0.026
0.121CysMet: 0.121 ± 0.01
0.429CysAsn: 0.429 ± 0.023
0.311CysPro: 0.311 ± 0.02
0.266CysGln: 0.266 ± 0.021
0.223CysArg: 0.223 ± 0.015
0.58CysSer: 0.58 ± 0.028
0.446CysThr: 0.446 ± 0.028
0.41CysVal: 0.41 ± 0.02
0.061CysTrp: 0.061 ± 0.008
0.295CysTyr: 0.295 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
3.536AspAla: 3.536 ± 0.062
0.42AspCys: 0.42 ± 0.021
2.671AspAsp: 2.671 ± 0.059
3.563AspGlu: 3.563 ± 0.063
3.699AspPhe: 3.699 ± 0.064
3.589AspGly: 3.589 ± 0.071
0.744AspHis: 0.744 ± 0.026
4.248AspIle: 4.248 ± 0.064
4.228AspLys: 4.228 ± 0.08
4.761AspLeu: 4.761 ± 0.07
1.097AspMet: 1.097 ± 0.033
3.179AspAsn: 3.179 ± 0.06
1.559AspPro: 1.559 ± 0.046
1.268AspGln: 1.268 ± 0.036
1.884AspArg: 1.884 ± 0.042
3.162AspSer: 3.162 ± 0.065
2.675AspThr: 2.675 ± 0.055
3.284AspVal: 3.284 ± 0.054
0.69AspTrp: 0.69 ± 0.025
2.82AspTyr: 2.82 ± 0.056
0.0AspXaa: 0.0 ± 0.0
Glu
4.487GluAla: 4.487 ± 0.077
0.323GluCys: 0.323 ± 0.02
3.138GluAsp: 3.138 ± 0.052
4.888GluGlu: 4.888 ± 0.093
2.942GluPhe: 2.942 ± 0.061
3.469GluGly: 3.469 ± 0.062
1.023GluHis: 1.023 ± 0.029
5.795GluIle: 5.795 ± 0.085
6.773GluLys: 6.773 ± 0.102
5.84GluLeu: 5.84 ± 0.088
1.681GluMet: 1.681 ± 0.036
4.949GluAsn: 4.949 ± 0.073
1.554GluPro: 1.554 ± 0.043
2.24GluGln: 2.24 ± 0.055
2.484GluArg: 2.484 ± 0.054
3.359GluSer: 3.359 ± 0.06
3.603GluThr: 3.603 ± 0.062
3.97GluVal: 3.97 ± 0.072
0.574GluTrp: 0.574 ± 0.025
2.486GluTyr: 2.486 ± 0.052
0.0GluXaa: 0.0 ± 0.0
Phe
3.326PheAla: 3.326 ± 0.065
0.497PheCys: 0.497 ± 0.028
3.235PheAsp: 3.235 ± 0.06
3.432PheGlu: 3.432 ± 0.064
2.982PhePhe: 2.982 ± 0.056
3.692PheGly: 3.692 ± 0.064
0.836PheHis: 0.836 ± 0.029
3.88PheIle: 3.88 ± 0.082
3.479PheLys: 3.479 ± 0.064
4.961PheLeu: 4.961 ± 0.095
1.114PheMet: 1.114 ± 0.033
3.114PheAsn: 3.114 ± 0.055
1.798PhePro: 1.798 ± 0.045
1.68PheGln: 1.68 ± 0.045
1.798PheArg: 1.798 ± 0.039
4.232PheSer: 4.232 ± 0.074
3.241PheThr: 3.241 ± 0.068
3.049PheVal: 3.049 ± 0.054
0.558PheTrp: 0.558 ± 0.02
2.291PheTyr: 2.291 ± 0.047
0.0PheXaa: 0.0 ± 0.0
Gly
4.078GlyAla: 4.078 ± 0.071
0.611GlyCys: 0.611 ± 0.03
3.1GlyAsp: 3.1 ± 0.051
3.409GlyGlu: 3.409 ± 0.057
3.632GlyPhe: 3.632 ± 0.064
4.38GlyGly: 4.38 ± 0.107
1.111GlyHis: 1.111 ± 0.033
5.583GlyIle: 5.583 ± 0.087
5.332GlyLys: 5.332 ± 0.084
5.588GlyLeu: 5.588 ± 0.073
1.558GlyMet: 1.558 ± 0.039
4.068GlyAsn: 4.068 ± 0.076
1.32GlyPro: 1.32 ± 0.042
2.063GlyGln: 2.063 ± 0.053
2.076GlyArg: 2.076 ± 0.046
4.065GlySer: 4.065 ± 0.079
4.513GlyThr: 4.513 ± 0.132
3.849GlyVal: 3.849 ± 0.068
0.698GlyTrp: 0.698 ± 0.031
2.918GlyTyr: 2.918 ± 0.05
0.001GlyXaa: 0.001 ± 0.001
His
0.892HisAla: 0.892 ± 0.028
0.16HisCys: 0.16 ± 0.013
0.837HisAsp: 0.837 ± 0.031
1.002HisGlu: 1.002 ± 0.035
1.145HisPhe: 1.145 ± 0.04
0.99HisGly: 0.99 ± 0.03
0.411HisHis: 0.411 ± 0.021
1.326HisIle: 1.326 ± 0.035
1.158HisLys: 1.158 ± 0.033
1.609HisLeu: 1.609 ± 0.036
0.298HisMet: 0.298 ± 0.016
0.956HisAsn: 0.956 ± 0.027
0.771HisPro: 0.771 ± 0.031
0.648HisGln: 0.648 ± 0.027
0.618HisArg: 0.618 ± 0.026
1.022HisSer: 1.022 ± 0.03
0.928HisThr: 0.928 ± 0.029
0.797HisVal: 0.797 ± 0.031
0.163HisTrp: 0.163 ± 0.011
0.804HisTyr: 0.804 ± 0.025
0.0HisXaa: 0.0 ± 0.0
Ile
5.861IleAla: 5.861 ± 0.092
0.636IleCys: 0.636 ± 0.03
4.566IleAsp: 4.566 ± 0.068
5.317IleGlu: 5.317 ± 0.081
3.589IlePhe: 3.589 ± 0.066
4.996IleGly: 4.996 ± 0.077
1.316IleHis: 1.316 ± 0.036
6.026IleIle: 6.026 ± 0.098
5.732IleLys: 5.732 ± 0.09
6.924IleLeu: 6.924 ± 0.098
1.395IleMet: 1.395 ± 0.038
4.573IleAsn: 4.573 ± 0.069
3.2IlePro: 3.2 ± 0.061
2.838IleGln: 2.838 ± 0.056
2.821IleArg: 2.821 ± 0.052
5.685IleSer: 5.685 ± 0.073
4.901IleThr: 4.901 ± 0.101
4.948IleVal: 4.948 ± 0.075
0.716IleTrp: 0.716 ± 0.025
2.919IleTyr: 2.919 ± 0.051
0.0IleXaa: 0.0 ± 0.0
Lys
5.368LysAla: 5.368 ± 0.1
0.329LysCys: 0.329 ± 0.019
4.379LysAsp: 4.379 ± 0.079
6.316LysGlu: 6.316 ± 0.102
3.1LysPhe: 3.1 ± 0.059
4.462LysGly: 4.462 ± 0.074
1.218LysHis: 1.218 ± 0.033
6.918LysIle: 6.918 ± 0.103
7.458LysLys: 7.458 ± 0.108
6.603LysLeu: 6.603 ± 0.101
2.093LysMet: 2.093 ± 0.049
5.704LysAsn: 5.704 ± 0.086
2.546LysPro: 2.546 ± 0.054
2.705LysGln: 2.705 ± 0.053
2.598LysArg: 2.598 ± 0.059
4.572LysSer: 4.572 ± 0.074
4.92LysThr: 4.92 ± 0.071
4.463LysVal: 4.463 ± 0.066
0.762LysTrp: 0.762 ± 0.029
3.013LysTyr: 3.013 ± 0.053
0.0LysXaa: 0.0 ± 0.0
Leu
5.987LeuAla: 5.987 ± 0.084
0.648LeuCys: 0.648 ± 0.023
4.828LeuAsp: 4.828 ± 0.071
5.984LeuGlu: 5.984 ± 0.084
5.143LeuPhe: 5.143 ± 0.087
5.577LeuGly: 5.577 ± 0.087
1.563LeuHis: 1.563 ± 0.04
6.351LeuIle: 6.351 ± 0.104
7.631LeuLys: 7.631 ± 0.111
9.193LeuLeu: 9.193 ± 0.134
2.094LeuMet: 2.094 ± 0.052
5.302LeuAsn: 5.302 ± 0.083
3.574LeuPro: 3.574 ± 0.058
3.299LeuGln: 3.299 ± 0.064
3.048LeuArg: 3.048 ± 0.063
6.696LeuSer: 6.696 ± 0.087
5.109LeuThr: 5.109 ± 0.087
5.128LeuVal: 5.128 ± 0.064
0.772LeuTrp: 0.772 ± 0.027
3.309LeuTyr: 3.309 ± 0.059
0.0LeuXaa: 0.0 ± 0.0
Met
1.683MetAla: 1.683 ± 0.046
0.109MetCys: 0.109 ± 0.009
1.136MetAsp: 1.136 ± 0.034
1.501MetGlu: 1.501 ± 0.04
0.842MetPhe: 0.842 ± 0.028
1.352MetGly: 1.352 ± 0.038
0.385MetHis: 0.385 ± 0.018
1.577MetIle: 1.577 ± 0.042
2.368MetLys: 2.368 ± 0.048
1.989MetLeu: 1.989 ± 0.046
0.619MetMet: 0.619 ± 0.027
1.37MetAsn: 1.37 ± 0.039
0.869MetPro: 0.869 ± 0.033
0.895MetGln: 0.895 ± 0.027
0.819MetArg: 0.819 ± 0.028
1.383MetSer: 1.383 ± 0.036
1.221MetThr: 1.221 ± 0.036
1.269MetVal: 1.269 ± 0.034
0.151MetTrp: 0.151 ± 0.013
0.664MetTyr: 0.664 ± 0.026
0.0MetXaa: 0.0 ± 0.0
Asn
4.071AsnAla: 4.071 ± 0.065
0.489AsnCys: 0.489 ± 0.037
3.2AsnAsp: 3.2 ± 0.057
3.669AsnGlu: 3.669 ± 0.056
3.227AsnPhe: 3.227 ± 0.061
4.423AsnGly: 4.423 ± 0.104
1.067AsnHis: 1.067 ± 0.03
4.638AsnIle: 4.638 ± 0.068
4.162AsnLys: 4.162 ± 0.071
5.402AsnLeu: 5.402 ± 0.08
1.342AsnMet: 1.342 ± 0.035
3.867AsnAsn: 3.867 ± 0.081
3.133AsnPro: 3.133 ± 0.063
2.186AsnGln: 2.186 ± 0.053
2.223AsnArg: 2.223 ± 0.049
4.001AsnSer: 4.001 ± 0.078
3.688AsnThr: 3.688 ± 0.083
3.567AsnVal: 3.567 ± 0.069
0.741AsnTrp: 0.741 ± 0.03
2.914AsnTyr: 2.914 ± 0.061
0.0AsnXaa: 0.0 ± 0.0
Pro
2.358ProAla: 2.358 ± 0.064
0.202ProCys: 0.202 ± 0.014
2.147ProAsp: 2.147 ± 0.044
2.883ProGlu: 2.883 ± 0.058
1.926ProPhe: 1.926 ± 0.047
2.075ProGly: 2.075 ± 0.059
0.52ProHis: 0.52 ± 0.023
2.562ProIle: 2.562 ± 0.047
2.682ProLys: 2.682 ± 0.064
2.882ProLeu: 2.882 ± 0.057
0.728ProMet: 0.728 ± 0.025
2.243ProAsn: 2.243 ± 0.065
0.845ProPro: 0.845 ± 0.03
1.178ProGln: 1.178 ± 0.036
0.842ProArg: 0.842 ± 0.029
2.085ProSer: 2.085 ± 0.052
2.057ProThr: 2.057 ± 0.074
2.582ProVal: 2.582 ± 0.046
0.286ProTrp: 0.286 ± 0.016
1.449ProTyr: 1.449 ± 0.041
0.0ProXaa: 0.0 ± 0.0
Gln
2.121GlnAla: 2.121 ± 0.046
0.192GlnCys: 0.192 ± 0.019
1.624GlnAsp: 1.624 ± 0.037
2.358GlnGlu: 2.358 ± 0.046
1.744GlnPhe: 1.744 ± 0.043
1.801GlnGly: 1.801 ± 0.043
0.565GlnHis: 0.565 ± 0.026
2.617GlnIle: 2.617 ± 0.054
3.26GlnLys: 3.26 ± 0.066
3.364GlnLeu: 3.364 ± 0.053
0.849GlnMet: 0.849 ± 0.027
2.484GlnAsn: 2.484 ± 0.049
1.135GlnPro: 1.135 ± 0.037
1.487GlnGln: 1.487 ± 0.041
1.155GlnArg: 1.155 ± 0.035
2.003GlnSer: 2.003 ± 0.041
1.987GlnThr: 1.987 ± 0.051
1.958GlnVal: 1.958 ± 0.044
0.368GlnTrp: 0.368 ± 0.02
1.428GlnTyr: 1.428 ± 0.041
0.0GlnXaa: 0.0 ± 0.0
Arg
1.941ArgAla: 1.941 ± 0.046
0.209ArgCys: 0.209 ± 0.014
1.647ArgAsp: 1.647 ± 0.041
2.274ArgGlu: 2.274 ± 0.051
1.909ArgPhe: 1.909 ± 0.041
1.732ArgGly: 1.732 ± 0.042
0.583ArgHis: 0.583 ± 0.025
3.009ArgIle: 3.009 ± 0.052
2.929ArgLys: 2.929 ± 0.062
3.323ArgLeu: 3.323 ± 0.068
0.939ArgMet: 0.939 ± 0.032
2.306ArgAsn: 2.306 ± 0.053
1.092ArgPro: 1.092 ± 0.034
1.193ArgGln: 1.193 ± 0.036
1.281ArgArg: 1.281 ± 0.036
1.843ArgSer: 1.843 ± 0.045
1.829ArgThr: 1.829 ± 0.039
2.002ArgVal: 2.002 ± 0.043
0.396ArgTrp: 0.396 ± 0.018
1.506ArgTyr: 1.506 ± 0.04
0.0ArgXaa: 0.0 ± 0.0
Ser
3.819SerAla: 3.819 ± 0.069
0.697SerCys: 0.697 ± 0.032
3.447SerAsp: 3.447 ± 0.053
4.034SerGlu: 4.034 ± 0.065
3.807SerPhe: 3.807 ± 0.065
5.018SerGly: 5.018 ± 0.101
1.104SerHis: 1.104 ± 0.037
4.91SerIle: 4.91 ± 0.071
4.83SerLys: 4.83 ± 0.078
6.097SerLeu: 6.097 ± 0.086
1.351SerMet: 1.351 ± 0.037
3.806SerAsn: 3.806 ± 0.081
2.22SerPro: 2.22 ± 0.049
2.378SerGln: 2.378 ± 0.053
2.232SerArg: 2.232 ± 0.046
4.287SerSer: 4.287 ± 0.077
3.421SerThr: 3.421 ± 0.082
4.102SerVal: 4.102 ± 0.068
0.667SerTrp: 0.667 ± 0.028
2.881SerTyr: 2.881 ± 0.061
0.001SerXaa: 0.001 ± 0.001
Thr
4.458ThrAla: 4.458 ± 0.132
0.354ThrCys: 0.354 ± 0.022
3.197ThrAsp: 3.197 ± 0.067
3.443ThrGlu: 3.443 ± 0.05
3.113ThrPhe: 3.113 ± 0.051
4.451ThrGly: 4.451 ± 0.108
0.942ThrHis: 0.942 ± 0.034
5.065ThrIle: 5.065 ± 0.09
3.828ThrLys: 3.828 ± 0.056
5.281ThrLeu: 5.281 ± 0.087
0.936ThrMet: 0.936 ± 0.03
3.26ThrAsn: 3.26 ± 0.075
2.629ThrPro: 2.629 ± 0.077
1.98ThrGln: 1.98 ± 0.05
1.687ThrArg: 1.687 ± 0.044
3.805ThrSer: 3.805 ± 0.081
3.843ThrThr: 3.843 ± 0.115
4.319ThrVal: 4.319 ± 0.133
0.556ThrTrp: 0.556 ± 0.028
2.397ThrTyr: 2.397 ± 0.066
0.001ThrXaa: 0.001 ± 0.001
Val
4.215ValAla: 4.215 ± 0.077
0.512ValCys: 0.512 ± 0.023
3.143ValAsp: 3.143 ± 0.051
3.73ValGlu: 3.73 ± 0.068
3.427ValPhe: 3.427 ± 0.064
3.637ValGly: 3.637 ± 0.063
0.948ValHis: 0.948 ± 0.031
4.721ValIle: 4.721 ± 0.077
4.415ValLys: 4.415 ± 0.07
5.745ValLeu: 5.745 ± 0.072
1.365ValMet: 1.365 ± 0.04
3.552ValAsn: 3.552 ± 0.062
2.256ValPro: 2.256 ± 0.042
1.801ValGln: 1.801 ± 0.044
2.119ValArg: 2.119 ± 0.046
4.48ValSer: 4.48 ± 0.069
3.903ValThr: 3.903 ± 0.112
3.945ValVal: 3.945 ± 0.07
0.594ValTrp: 0.594 ± 0.024
2.43ValTyr: 2.43 ± 0.05
0.0ValXaa: 0.0 ± 0.0
Trp
0.622TrpAla: 0.622 ± 0.027
0.085TrpCys: 0.085 ± 0.009
0.546TrpAsp: 0.546 ± 0.024
0.633TrpGlu: 0.633 ± 0.026
0.529TrpPhe: 0.529 ± 0.023
0.585TrpGly: 0.585 ± 0.025
0.212TrpHis: 0.212 ± 0.015
0.775TrpIle: 0.775 ± 0.027
0.824TrpLys: 0.824 ± 0.03
0.913TrpLeu: 0.913 ± 0.031
0.281TrpMet: 0.281 ± 0.019
0.748TrpAsn: 0.748 ± 0.031
0.187TrpPro: 0.187 ± 0.014
0.399TrpGln: 0.399 ± 0.022
0.339TrpArg: 0.339 ± 0.017
0.59TrpSer: 0.59 ± 0.024
0.569TrpThr: 0.569 ± 0.028
0.523TrpVal: 0.523 ± 0.023
0.129TrpTrp: 0.129 ± 0.011
0.417TrpTyr: 0.417 ± 0.019
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.486TyrAla: 2.486 ± 0.045
0.33TyrCys: 0.33 ± 0.019
2.451TyrAsp: 2.451 ± 0.056
2.499TyrGlu: 2.499 ± 0.051
2.63TyrPhe: 2.63 ± 0.055
2.597TyrGly: 2.597 ± 0.053
0.791TyrHis: 0.791 ± 0.031
2.869TyrIle: 2.869 ± 0.062
2.981TyrLys: 2.981 ± 0.055
3.755TyrLeu: 3.755 ± 0.071
0.806TyrMet: 0.806 ± 0.028
2.572TyrAsn: 2.572 ± 0.059
1.508TyrPro: 1.508 ± 0.037
1.5TyrGln: 1.5 ± 0.04
1.655TyrArg: 1.655 ± 0.042
2.861TyrSer: 2.861 ± 0.063
2.464TyrThr: 2.464 ± 0.067
2.319TyrVal: 2.319 ± 0.048
0.423TyrTrp: 0.423 ± 0.02
1.994TyrTyr: 1.994 ± 0.054
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.001XaaIle: 0.001 ± 0.001
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.001XaaGln: 0.001 ± 0.001
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.001XaaTrp: 0.001 ± 0.001
0.0XaaTyr: 0.0 ± 0.0
0.004XaaXaa: 0.004 ± 0.003
Statistics based on 3309 proteins (1096483 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski