Amino acid dipepetide frequency for Bifidobacterium minimum

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
11.933AlaAla: 11.933 ± 0.194
1.054AlaCys: 1.054 ± 0.052
7.149AlaAsp: 7.149 ± 0.124
4.94AlaGlu: 4.94 ± 0.106
3.14AlaPhe: 3.14 ± 0.091
9.042AlaGly: 9.042 ± 0.144
2.174AlaHis: 2.174 ± 0.065
5.587AlaIle: 5.587 ± 0.114
3.237AlaLys: 3.237 ± 0.081
9.974AlaLeu: 9.974 ± 0.154
3.144AlaMet: 3.144 ± 0.071
2.456AlaAsn: 2.456 ± 0.072
3.897AlaPro: 3.897 ± 0.099
3.585AlaGln: 3.585 ± 0.084
7.736AlaArg: 7.736 ± 0.137
7.776AlaSer: 7.776 ± 0.137
6.113AlaThr: 6.113 ± 0.107
9.283AlaVal: 9.283 ± 0.158
1.305AlaTrp: 1.305 ± 0.053
2.343AlaTyr: 2.343 ± 0.063
0.0AlaXaa: 0.0 ± 0.001
Cys
1.041CysAla: 1.041 ± 0.053
0.117CysCys: 0.117 ± 0.016
0.631CysAsp: 0.631 ± 0.037
0.412CysGlu: 0.412 ± 0.029
0.255CysPhe: 0.255 ± 0.018
1.065CysGly: 1.065 ± 0.05
0.233CysHis: 0.233 ± 0.021
0.323CysIle: 0.323 ± 0.024
0.139CysLys: 0.139 ± 0.017
0.79CysLeu: 0.79 ± 0.042
0.22CysMet: 0.22 ± 0.02
0.161CysAsn: 0.161 ± 0.016
0.513CysPro: 0.513 ± 0.031
0.189CysGln: 0.189 ± 0.018
0.643CysArg: 0.643 ± 0.034
0.645CysSer: 0.645 ± 0.04
0.447CysThr: 0.447 ± 0.031
0.777CysVal: 0.777 ± 0.042
0.104CysTrp: 0.104 ± 0.012
0.174CysTyr: 0.174 ± 0.018
0.0CysXaa: 0.0 ± 0.0
Asp
8.21AspAla: 8.21 ± 0.161
0.513AspCys: 0.513 ± 0.028
6.049AspAsp: 6.049 ± 0.126
4.696AspGlu: 4.696 ± 0.099
2.084AspPhe: 2.084 ± 0.071
7.039AspGly: 7.039 ± 0.14
1.743AspHis: 1.743 ± 0.069
3.16AspIle: 3.16 ± 0.08
1.752AspLys: 1.752 ± 0.071
5.552AspLeu: 5.552 ± 0.113
1.796AspMet: 1.796 ± 0.064
1.421AspAsn: 1.421 ± 0.057
4.194AspPro: 4.194 ± 0.097
1.751AspGln: 1.751 ± 0.058
4.55AspArg: 4.55 ± 0.122
4.471AspSer: 4.471 ± 0.092
3.083AspThr: 3.083 ± 0.092
6.08AspVal: 6.08 ± 0.109
0.874AspTrp: 0.874 ± 0.044
1.78AspTyr: 1.78 ± 0.067
0.0AspXaa: 0.0 ± 0.0
Glu
5.292GluAla: 5.292 ± 0.123
0.444GluCys: 0.444 ± 0.031
3.419GluAsp: 3.419 ± 0.1
2.799GluGlu: 2.799 ± 0.083
1.622GluPhe: 1.622 ± 0.049
4.071GluGly: 4.071 ± 0.095
1.301GluHis: 1.301 ± 0.043
2.482GluIle: 2.482 ± 0.07
1.782GluLys: 1.782 ± 0.071
4.462GluLeu: 4.462 ± 0.098
1.349GluMet: 1.349 ± 0.052
1.582GluAsn: 1.582 ± 0.059
2.344GluPro: 2.344 ± 0.073
1.692GluGln: 1.692 ± 0.065
4.442GluArg: 4.442 ± 0.11
3.888GluSer: 3.888 ± 0.091
2.724GluThr: 2.724 ± 0.071
3.479GluVal: 3.479 ± 0.101
0.609GluTrp: 0.609 ± 0.035
1.274GluTyr: 1.274 ± 0.049
0.0GluXaa: 0.0 ± 0.0
Phe
3.36PheAla: 3.36 ± 0.093
0.277PheCys: 0.277 ± 0.022
2.291PheAsp: 2.291 ± 0.065
1.556PheGlu: 1.556 ± 0.055
1.016PhePhe: 1.016 ± 0.045
2.828PheGly: 2.828 ± 0.076
0.753PheHis: 0.753 ± 0.032
1.518PheIle: 1.518 ± 0.059
0.647PheLys: 0.647 ± 0.039
2.779PheLeu: 2.779 ± 0.075
0.746PheMet: 0.746 ± 0.039
0.922PheAsn: 0.922 ± 0.037
1.411PhePro: 1.411 ± 0.055
0.731PheGln: 0.731 ± 0.04
1.778PheArg: 1.778 ± 0.05
2.286PheSer: 2.286 ± 0.074
1.994PheThr: 1.994 ± 0.068
2.482PheVal: 2.482 ± 0.068
0.348PheTrp: 0.348 ± 0.027
0.741PheTyr: 0.741 ± 0.04
0.0PheXaa: 0.0 ± 0.0
Gly
7.598GlyAla: 7.598 ± 0.146
0.858GlyCys: 0.858 ± 0.046
5.475GlyAsp: 5.475 ± 0.121
4.238GlyGlu: 4.238 ± 0.109
3.135GlyPhe: 3.135 ± 0.064
6.507GlyGly: 6.507 ± 0.13
2.06GlyHis: 2.06 ± 0.064
4.804GlyIle: 4.804 ± 0.108
3.05GlyLys: 3.05 ± 0.091
7.448GlyLeu: 7.448 ± 0.134
2.46GlyMet: 2.46 ± 0.079
2.359GlyAsn: 2.359 ± 0.064
3.113GlyPro: 3.113 ± 0.083
2.363GlyGln: 2.363 ± 0.068
6.471GlyArg: 6.471 ± 0.117
6.705GlySer: 6.705 ± 0.126
5.485GlyThr: 5.485 ± 0.116
7.123GlyVal: 7.123 ± 0.116
1.224GlyTrp: 1.224 ± 0.046
2.44GlyTyr: 2.44 ± 0.076
0.0GlyXaa: 0.0 ± 0.0
His
2.443HisAla: 2.443 ± 0.066
0.198HisCys: 0.198 ± 0.019
2.051HisAsp: 2.051 ± 0.061
1.234HisGlu: 1.234 ± 0.05
0.499HisPhe: 0.499 ± 0.029
2.251HisGly: 2.251 ± 0.066
0.621HisHis: 0.621 ± 0.041
1.019HisIle: 1.019 ± 0.05
0.445HisLys: 0.445 ± 0.027
1.653HisLeu: 1.653 ± 0.051
0.566HisMet: 0.566 ± 0.029
0.563HisAsn: 0.563 ± 0.033
1.575HisPro: 1.575 ± 0.058
0.62HisGln: 0.62 ± 0.034
1.879HisArg: 1.879 ± 0.062
1.157HisSer: 1.157 ± 0.045
1.104HisThr: 1.104 ± 0.042
1.971HisVal: 1.971 ± 0.059
0.312HisTrp: 0.312 ± 0.026
0.563HisTyr: 0.563 ± 0.033
0.0HisXaa: 0.0 ± 0.0
Ile
6.084IleAla: 6.084 ± 0.113
0.455IleCys: 0.455 ± 0.028
4.135IleAsp: 4.135 ± 0.098
2.755IleGlu: 2.755 ± 0.073
1.21IlePhe: 1.21 ± 0.051
4.396IleGly: 4.396 ± 0.091
1.144IleHis: 1.144 ± 0.044
2.953IleIle: 2.953 ± 0.091
1.206IleLys: 1.206 ± 0.056
4.071IleLeu: 4.071 ± 0.099
1.21IleMet: 1.21 ± 0.045
1.411IleAsn: 1.411 ± 0.052
2.935IlePro: 2.935 ± 0.073
1.158IleGln: 1.158 ± 0.047
3.556IleArg: 3.556 ± 0.09
3.136IleSer: 3.136 ± 0.083
3.215IleThr: 3.215 ± 0.081
5.006IleVal: 5.006 ± 0.118
0.554IleTrp: 0.554 ± 0.036
0.931IleTyr: 0.931 ± 0.045
0.0IleXaa: 0.0 ± 0.0
Lys
3.78LysAla: 3.78 ± 0.108
0.115LysCys: 0.115 ± 0.013
2.033LysAsp: 2.033 ± 0.06
1.52LysGlu: 1.52 ± 0.058
0.636LysPhe: 0.636 ± 0.043
2.399LysGly: 2.399 ± 0.074
0.475LysHis: 0.475 ± 0.026
1.309LysIle: 1.309 ± 0.065
1.146LysLys: 1.146 ± 0.058
2.148LysLeu: 2.148 ± 0.08
0.532LysMet: 0.532 ± 0.029
0.948LysAsn: 0.948 ± 0.048
1.481LysPro: 1.481 ± 0.058
0.874LysGln: 0.874 ± 0.05
1.837LysArg: 1.837 ± 0.061
1.923LysSer: 1.923 ± 0.076
1.939LysThr: 1.939 ± 0.063
2.132LysVal: 2.132 ± 0.077
0.282LysTrp: 0.282 ± 0.024
0.719LysTyr: 0.719 ± 0.038
0.0LysXaa: 0.0 ± 0.0
Leu
9.565LeuAla: 9.565 ± 0.164
0.909LeuCys: 0.909 ± 0.045
6.333LeuAsp: 6.333 ± 0.111
3.947LeuGlu: 3.947 ± 0.106
2.707LeuPhe: 2.707 ± 0.071
7.035LeuGly: 7.035 ± 0.118
1.905LeuHis: 1.905 ± 0.066
4.592LeuIle: 4.592 ± 0.128
2.381LeuLys: 2.381 ± 0.075
7.528LeuLeu: 7.528 ± 0.15
2.295LeuMet: 2.295 ± 0.07
2.224LeuAsn: 2.224 ± 0.062
4.372LeuPro: 4.372 ± 0.089
1.923LeuGln: 1.923 ± 0.064
6.451LeuArg: 6.451 ± 0.121
6.863LeuSer: 6.863 ± 0.124
5.514LeuThr: 5.514 ± 0.112
7.054LeuVal: 7.054 ± 0.14
1.056LeuTrp: 1.056 ± 0.048
1.785LeuTyr: 1.785 ± 0.069
0.002LeuXaa: 0.002 ± 0.001
Met
3.146MetAla: 3.146 ± 0.081
0.203MetCys: 0.203 ± 0.02
1.743MetAsp: 1.743 ± 0.057
1.215MetGlu: 1.215 ± 0.053
0.733MetPhe: 0.733 ± 0.041
2.18MetGly: 2.18 ± 0.064
0.515MetHis: 0.515 ± 0.029
1.303MetIle: 1.303 ± 0.049
0.742MetLys: 0.742 ± 0.043
2.332MetLeu: 2.332 ± 0.066
0.797MetMet: 0.797 ± 0.043
0.779MetAsn: 0.779 ± 0.036
1.461MetPro: 1.461 ± 0.057
0.598MetGln: 0.598 ± 0.035
2.176MetArg: 2.176 ± 0.061
2.22MetSer: 2.22 ± 0.067
2.141MetThr: 2.141 ± 0.066
2.137MetVal: 2.137 ± 0.069
0.288MetTrp: 0.288 ± 0.024
0.471MetTyr: 0.471 ± 0.027
0.0MetXaa: 0.0 ± 0.0
Asn
2.781AsnAla: 2.781 ± 0.075
0.189AsnCys: 0.189 ± 0.019
1.774AsnAsp: 1.774 ± 0.067
1.268AsnGlu: 1.268 ± 0.044
0.693AsnPhe: 0.693 ± 0.042
2.711AsnGly: 2.711 ± 0.088
0.57AsnHis: 0.57 ± 0.037
1.243AsnIle: 1.243 ± 0.053
0.73AsnLys: 0.73 ± 0.046
2.136AsnLeu: 2.136 ± 0.068
0.649AsnMet: 0.649 ± 0.036
0.746AsnAsn: 0.746 ± 0.051
1.938AsnPro: 1.938 ± 0.062
0.722AsnGln: 0.722 ± 0.043
1.811AsnArg: 1.811 ± 0.053
1.516AsnSer: 1.516 ± 0.064
1.52AsnThr: 1.52 ± 0.057
2.055AsnVal: 2.055 ± 0.061
0.315AsnTrp: 0.315 ± 0.028
0.735AsnTyr: 0.735 ± 0.043
0.0AsnXaa: 0.0 ± 0.0
Pro
4.594ProAla: 4.594 ± 0.096
0.379ProCys: 0.379 ± 0.027
3.926ProAsp: 3.926 ± 0.091
2.99ProGlu: 2.99 ± 0.077
1.521ProPhe: 1.521 ± 0.054
3.965ProGly: 3.965 ± 0.094
1.169ProHis: 1.169 ± 0.046
2.106ProIle: 2.106 ± 0.064
1.202ProLys: 1.202 ± 0.054
3.901ProLeu: 3.901 ± 0.084
1.191ProMet: 1.191 ± 0.046
1.144ProAsn: 1.144 ± 0.044
1.545ProPro: 1.545 ± 0.069
1.73ProGln: 1.73 ± 0.058
3.58ProArg: 3.58 ± 0.104
3.824ProSer: 3.824 ± 0.083
3.125ProThr: 3.125 ± 0.091
4.205ProVal: 4.205 ± 0.092
0.719ProTrp: 0.719 ± 0.034
1.234ProTyr: 1.234 ± 0.051
0.0ProXaa: 0.0 ± 0.0
Gln
3.105GlnAla: 3.105 ± 0.085
0.231GlnCys: 0.231 ± 0.019
1.521GlnAsp: 1.521 ± 0.048
1.567GlnGlu: 1.567 ± 0.054
0.808GlnPhe: 0.808 ± 0.039
2.39GlnGly: 2.39 ± 0.076
0.546GlnHis: 0.546 ± 0.035
1.707GlnIle: 1.707 ± 0.056
0.929GlnLys: 0.929 ± 0.049
2.531GlnLeu: 2.531 ± 0.075
0.821GlnMet: 0.821 ± 0.038
0.821GlnAsn: 0.821 ± 0.046
1.129GlnPro: 1.129 ± 0.051
1.074GlnGln: 1.074 ± 0.054
2.311GlnArg: 2.311 ± 0.075
1.938GlnSer: 1.938 ± 0.066
1.716GlnThr: 1.716 ± 0.054
2.073GlnVal: 2.073 ± 0.06
0.539GlnTrp: 0.539 ± 0.032
0.753GlnTyr: 0.753 ± 0.042
0.0GlnXaa: 0.0 ± 0.0
Arg
6.322ArgAla: 6.322 ± 0.128
0.64ArgCys: 0.64 ± 0.045
4.936ArgAsp: 4.936 ± 0.12
4.106ArgGlu: 4.106 ± 0.11
2.552ArgPhe: 2.552 ± 0.072
5.16ArgGly: 5.16 ± 0.111
2.141ArgHis: 2.141 ± 0.071
4.333ArgIle: 4.333 ± 0.097
2.178ArgLys: 2.178 ± 0.064
6.713ArgLeu: 6.713 ± 0.156
2.429ArgMet: 2.429 ± 0.068
1.985ArgAsn: 1.985 ± 0.062
3.49ArgPro: 3.49 ± 0.082
2.381ArgGln: 2.381 ± 0.074
7.409ArgArg: 7.409 ± 0.187
5.127ArgSer: 5.127 ± 0.121
4.146ArgThr: 4.146 ± 0.092
5.36ArgVal: 5.36 ± 0.101
1.094ArgTrp: 1.094 ± 0.048
2.108ArgTyr: 2.108 ± 0.063
0.0ArgXaa: 0.0 ± 0.0
Ser
7.481SerAla: 7.481 ± 0.123
0.631SerCys: 0.631 ± 0.037
4.9SerAsp: 4.9 ± 0.108
3.025SerGlu: 3.025 ± 0.08
2.051SerPhe: 2.051 ± 0.063
6.859SerGly: 6.859 ± 0.129
1.622SerHis: 1.622 ± 0.054
3.556SerIle: 3.556 ± 0.086
1.785SerLys: 1.785 ± 0.055
6.249SerLeu: 6.249 ± 0.106
2.044SerMet: 2.044 ± 0.065
1.752SerAsn: 1.752 ± 0.069
3.338SerPro: 3.338 ± 0.091
2.398SerGln: 2.398 ± 0.077
5.499SerArg: 5.499 ± 0.114
6.856SerSer: 6.856 ± 0.23
4.834SerThr: 4.834 ± 0.128
5.664SerVal: 5.664 ± 0.112
0.972SerTrp: 0.972 ± 0.046
1.705SerTyr: 1.705 ± 0.066
0.0SerXaa: 0.0 ± 0.0
Thr
6.419ThrAla: 6.419 ± 0.122
0.466ThrCys: 0.466 ± 0.03
3.734ThrAsp: 3.734 ± 0.091
2.489ThrGlu: 2.489 ± 0.079
1.773ThrPhe: 1.773 ± 0.064
5.474ThrGly: 5.474 ± 0.116
1.283ThrHis: 1.283 ± 0.056
3.461ThrIle: 3.461 ± 0.086
1.602ThrLys: 1.602 ± 0.058
5.21ThrLeu: 5.21 ± 0.102
1.672ThrMet: 1.672 ± 0.061
1.633ThrAsn: 1.633 ± 0.075
3.521ThrPro: 3.521 ± 0.088
1.648ThrGln: 1.648 ± 0.06
3.925ThrArg: 3.925 ± 0.1
4.236ThrSer: 4.236 ± 0.105
4.52ThrThr: 4.52 ± 0.127
5.756ThrVal: 5.756 ± 0.112
0.753ThrTrp: 0.753 ± 0.039
1.417ThrTyr: 1.417 ± 0.061
0.0ThrXaa: 0.0 ± 0.0
Val
9.028ValAla: 9.028 ± 0.155
0.841ValCys: 0.841 ± 0.037
6.181ValAsp: 6.181 ± 0.124
4.449ValGlu: 4.449 ± 0.1
2.751ValPhe: 2.751 ± 0.082
6.135ValGly: 6.135 ± 0.14
1.659ValHis: 1.659 ± 0.058
4.449ValIle: 4.449 ± 0.087
2.246ValLys: 2.246 ± 0.073
7.567ValLeu: 7.567 ± 0.147
2.247ValMet: 2.247 ± 0.074
2.169ValAsn: 2.169 ± 0.063
4.073ValPro: 4.073 ± 0.085
1.855ValGln: 1.855 ± 0.063
5.56ValArg: 5.56 ± 0.11
6.19ValSer: 6.19 ± 0.103
5.186ValThr: 5.186 ± 0.101
8.177ValVal: 8.177 ± 0.165
1.006ValTrp: 1.006 ± 0.045
1.73ValTyr: 1.73 ± 0.066
0.002ValXaa: 0.002 ± 0.002
Trp
1.061TrpAla: 1.061 ± 0.047
0.134TrpCys: 0.134 ± 0.014
0.702TrpAsp: 0.702 ± 0.038
0.5TrpGlu: 0.5 ± 0.037
0.493TrpPhe: 0.493 ± 0.033
1.001TrpGly: 1.001 ± 0.045
0.29TrpHis: 0.29 ± 0.026
0.693TrpIle: 0.693 ± 0.033
0.444TrpLys: 0.444 ± 0.031
1.323TrpLeu: 1.323 ± 0.052
0.502TrpMet: 0.502 ± 0.033
0.455TrpAsn: 0.455 ± 0.033
0.546TrpPro: 0.546 ± 0.03
0.438TrpGln: 0.438 ± 0.032
1.104TrpArg: 1.104 ± 0.047
0.999TrpSer: 0.999 ± 0.042
0.891TrpThr: 0.891 ± 0.045
0.761TrpVal: 0.761 ± 0.036
0.28TrpTrp: 0.28 ± 0.024
0.37TrpTyr: 0.37 ± 0.027
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.623TyrAla: 2.623 ± 0.066
0.227TyrCys: 0.227 ± 0.023
1.815TyrAsp: 1.815 ± 0.062
1.349TyrGlu: 1.349 ± 0.056
0.788TyrPhe: 0.788 ± 0.044
2.407TyrGly: 2.407 ± 0.082
0.517TyrHis: 0.517 ± 0.03
0.924TyrIle: 0.924 ± 0.042
0.583TyrLys: 0.583 ± 0.041
2.073TyrLeu: 2.073 ± 0.059
0.508TyrMet: 0.508 ± 0.031
0.57TyrAsn: 0.57 ± 0.043
1.175TyrPro: 1.175 ± 0.047
0.783TyrGln: 0.783 ± 0.037
1.895TyrArg: 1.895 ± 0.065
1.465TyrSer: 1.465 ± 0.05
1.292TyrThr: 1.292 ± 0.064
1.927TyrVal: 1.927 ± 0.062
0.352TyrTrp: 0.352 ± 0.026
0.665TyrTyr: 0.665 ± 0.034
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.002XaaAla: 0.002 ± 0.002
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.002XaaThr: 0.002 ± 0.002
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1589 proteins (545536 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski