Amino acid dipepetide frequency for Flavobacterium sp. 123

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.453AlaAla: 4.453 ± 0.109
0.547AlaCys: 0.547 ± 0.033
3.322AlaAsp: 3.322 ± 0.084
3.877AlaGlu: 3.877 ± 0.113
3.356AlaPhe: 3.356 ± 0.069
4.237AlaGly: 4.237 ± 0.097
1.006AlaHis: 1.006 ± 0.037
5.901AlaIle: 5.901 ± 0.099
5.151AlaLys: 5.151 ± 0.086
6.346AlaLeu: 6.346 ± 0.107
1.592AlaMet: 1.592 ± 0.054
3.708AlaAsn: 3.708 ± 0.075
2.06AlaPro: 2.06 ± 0.073
2.468AlaGln: 2.468 ± 0.061
1.821AlaArg: 1.821 ± 0.054
4.498AlaSer: 4.498 ± 0.104
4.401AlaThr: 4.401 ± 0.189
4.27AlaVal: 4.27 ± 0.084
0.571AlaTrp: 0.571 ± 0.023
2.305AlaTyr: 2.305 ± 0.056
0.0AlaXaa: 0.0 ± 0.0
Cys
0.497CysAla: 0.497 ± 0.037
0.108CysCys: 0.108 ± 0.01
0.438CysAsp: 0.438 ± 0.024
0.432CysGlu: 0.432 ± 0.025
0.479CysPhe: 0.479 ± 0.031
0.646CysGly: 0.646 ± 0.036
0.182CysHis: 0.182 ± 0.017
0.639CysIle: 0.639 ± 0.026
0.466CysLys: 0.466 ± 0.023
0.637CysLeu: 0.637 ± 0.028
0.154CysMet: 0.154 ± 0.013
0.447CysAsn: 0.447 ± 0.027
0.372CysPro: 0.372 ± 0.029
0.224CysGln: 0.224 ± 0.017
0.189CysArg: 0.189 ± 0.012
0.675CysSer: 0.675 ± 0.049
0.519CysThr: 0.519 ± 0.033
0.471CysVal: 0.471 ± 0.025
0.064CysTrp: 0.064 ± 0.008
0.328CysTyr: 0.328 ± 0.019
0.0CysXaa: 0.0 ± 0.0
Asp
3.728AspAla: 3.728 ± 0.091
0.455AspCys: 0.455 ± 0.027
2.427AspAsp: 2.427 ± 0.069
3.409AspGlu: 3.409 ± 0.086
3.595AspPhe: 3.595 ± 0.07
3.502AspGly: 3.502 ± 0.109
0.71AspHis: 0.71 ± 0.032
4.174AspIle: 4.174 ± 0.074
4.29AspLys: 4.29 ± 0.072
5.089AspLeu: 5.089 ± 0.099
1.018AspMet: 1.018 ± 0.037
2.976AspAsn: 2.976 ± 0.077
1.546AspPro: 1.546 ± 0.048
1.339AspGln: 1.339 ± 0.037
1.609AspArg: 1.609 ± 0.045
3.095AspSer: 3.095 ± 0.063
2.686AspThr: 2.686 ± 0.061
3.526AspVal: 3.526 ± 0.071
0.666AspTrp: 0.666 ± 0.029
2.62AspTyr: 2.62 ± 0.054
0.0AspXaa: 0.0 ± 0.0
Glu
4.3GluAla: 4.3 ± 0.101
0.349GluCys: 0.349 ± 0.019
3.119GluAsp: 3.119 ± 0.069
4.376GluGlu: 4.376 ± 0.093
2.766GluPhe: 2.766 ± 0.071
3.511GluGly: 3.511 ± 0.068
1.045GluHis: 1.045 ± 0.033
5.851GluIle: 5.851 ± 0.107
6.109GluLys: 6.109 ± 0.123
5.743GluLeu: 5.743 ± 0.098
1.629GluMet: 1.629 ± 0.044
4.769GluAsn: 4.769 ± 0.095
1.463GluPro: 1.463 ± 0.043
2.161GluGln: 2.161 ± 0.054
2.176GluArg: 2.176 ± 0.064
3.329GluSer: 3.329 ± 0.064
3.581GluThr: 3.581 ± 0.069
3.945GluVal: 3.945 ± 0.087
0.57GluTrp: 0.57 ± 0.031
2.25GluTyr: 2.25 ± 0.053
0.0GluXaa: 0.0 ± 0.0
Phe
3.224PheAla: 3.224 ± 0.065
0.443PheCys: 0.443 ± 0.021
3.359PheAsp: 3.359 ± 0.073
3.508PheGlu: 3.508 ± 0.074
2.887PhePhe: 2.887 ± 0.074
3.886PheGly: 3.886 ± 0.081
0.854PheHis: 0.854 ± 0.037
4.09PheIle: 4.09 ± 0.085
3.705PheLys: 3.705 ± 0.082
4.819PheLeu: 4.819 ± 0.11
1.187PheMet: 1.187 ± 0.041
3.15PheAsn: 3.15 ± 0.072
1.811PhePro: 1.811 ± 0.046
1.671PheGln: 1.671 ± 0.045
1.498PheArg: 1.498 ± 0.046
4.188PheSer: 4.188 ± 0.08
2.991PheThr: 2.991 ± 0.067
3.272PheVal: 3.272 ± 0.073
0.567PheTrp: 0.567 ± 0.029
2.155PheTyr: 2.155 ± 0.062
0.0PheXaa: 0.0 ± 0.0
Gly
4.382GlyAla: 4.382 ± 0.18
0.686GlyCys: 0.686 ± 0.041
3.136GlyAsp: 3.136 ± 0.076
3.28GlyGlu: 3.28 ± 0.069
3.693GlyPhe: 3.693 ± 0.085
4.452GlyGly: 4.452 ± 0.105
1.102GlyHis: 1.102 ± 0.041
6.027GlyIle: 6.027 ± 0.101
5.026GlyLys: 5.026 ± 0.086
5.735GlyLeu: 5.735 ± 0.101
1.661GlyMet: 1.661 ± 0.046
3.637GlyAsn: 3.637 ± 0.096
1.356GlyPro: 1.356 ± 0.043
1.891GlyGln: 1.891 ± 0.05
1.886GlyArg: 1.886 ± 0.048
4.193GlySer: 4.193 ± 0.107
4.552GlyThr: 4.552 ± 0.192
4.241GlyVal: 4.241 ± 0.082
0.708GlyTrp: 0.708 ± 0.031
2.654GlyTyr: 2.654 ± 0.066
0.0GlyXaa: 0.0 ± 0.0
His
0.926HisAla: 0.926 ± 0.033
0.167HisCys: 0.167 ± 0.013
0.777HisAsp: 0.777 ± 0.035
0.954HisGlu: 0.954 ± 0.036
1.205HisPhe: 1.205 ± 0.042
0.944HisGly: 0.944 ± 0.032
0.415HisHis: 0.415 ± 0.023
1.369HisIle: 1.369 ± 0.047
1.103HisLys: 1.103 ± 0.037
1.663HisLeu: 1.663 ± 0.055
0.286HisMet: 0.286 ± 0.019
0.928HisAsn: 0.928 ± 0.034
0.846HisPro: 0.846 ± 0.031
0.705HisGln: 0.705 ± 0.029
0.56HisArg: 0.56 ± 0.028
1.024HisSer: 1.024 ± 0.034
0.843HisThr: 0.843 ± 0.032
0.86HisVal: 0.86 ± 0.032
0.206HisTrp: 0.206 ± 0.016
0.761HisTyr: 0.761 ± 0.027
0.0HisXaa: 0.0 ± 0.0
Ile
6.223IleAla: 6.223 ± 0.09
0.736IleCys: 0.736 ± 0.031
4.861IleAsp: 4.861 ± 0.078
5.64IleGlu: 5.64 ± 0.112
3.76IlePhe: 3.76 ± 0.099
5.55IleGly: 5.55 ± 0.091
1.489IleHis: 1.489 ± 0.043
6.837IleIle: 6.837 ± 0.112
6.173IleLys: 6.173 ± 0.104
7.531IleLeu: 7.531 ± 0.128
1.579IleMet: 1.579 ± 0.046
5.026IleAsn: 5.026 ± 0.074
3.498IlePro: 3.498 ± 0.067
2.918IleGln: 2.918 ± 0.062
2.494IleArg: 2.494 ± 0.062
6.031IleSer: 6.031 ± 0.09
5.199IleThr: 5.199 ± 0.124
5.257IleVal: 5.257 ± 0.093
0.697IleTrp: 0.697 ± 0.031
2.846IleTyr: 2.846 ± 0.055
0.0IleXaa: 0.0 ± 0.0
Lys
5.083LysAla: 5.083 ± 0.095
0.378LysCys: 0.378 ± 0.023
4.095LysAsp: 4.095 ± 0.084
5.949LysGlu: 5.949 ± 0.113
3.081LysPhe: 3.081 ± 0.065
4.638LysGly: 4.638 ± 0.069
1.184LysHis: 1.184 ± 0.042
7.24LysIle: 7.24 ± 0.115
7.199LysLys: 7.199 ± 0.136
6.53LysLeu: 6.53 ± 0.095
2.266LysMet: 2.266 ± 0.057
5.795LysAsn: 5.795 ± 0.104
2.584LysPro: 2.584 ± 0.054
2.678LysGln: 2.678 ± 0.065
2.532LysArg: 2.532 ± 0.062
4.897LysSer: 4.897 ± 0.074
4.93LysThr: 4.93 ± 0.069
4.769LysVal: 4.769 ± 0.08
0.777LysTrp: 0.777 ± 0.033
3.084LysTyr: 3.084 ± 0.065
0.0LysXaa: 0.0 ± 0.0
Leu
6.007LeuAla: 6.007 ± 0.095
0.692LeuCys: 0.692 ± 0.031
4.885LeuAsp: 4.885 ± 0.091
5.892LeuGlu: 5.892 ± 0.1
5.186LeuPhe: 5.186 ± 0.107
5.803LeuGly: 5.803 ± 0.106
1.479LeuHis: 1.479 ± 0.041
7.21LeuIle: 7.21 ± 0.11
7.471LeuLys: 7.471 ± 0.107
8.587LeuLeu: 8.587 ± 0.145
2.067LeuMet: 2.067 ± 0.056
5.624LeuAsn: 5.624 ± 0.075
3.508LeuPro: 3.508 ± 0.067
3.061LeuGln: 3.061 ± 0.074
2.777LeuArg: 2.777 ± 0.067
6.624LeuSer: 6.624 ± 0.093
5.178LeuThr: 5.178 ± 0.12
5.469LeuVal: 5.469 ± 0.095
0.713LeuTrp: 0.713 ± 0.031
3.105LeuTyr: 3.105 ± 0.057
0.0LeuXaa: 0.0 ± 0.0
Met
1.598MetAla: 1.598 ± 0.054
0.152MetCys: 0.152 ± 0.015
1.157MetAsp: 1.157 ± 0.037
1.339MetGlu: 1.339 ± 0.045
0.869MetPhe: 0.869 ± 0.037
1.416MetGly: 1.416 ± 0.046
0.4MetHis: 0.4 ± 0.022
1.786MetIle: 1.786 ± 0.046
2.325MetLys: 2.325 ± 0.052
1.903MetLeu: 1.903 ± 0.048
0.626MetMet: 0.626 ± 0.029
1.44MetAsn: 1.44 ± 0.042
0.833MetPro: 0.833 ± 0.029
0.854MetGln: 0.854 ± 0.031
0.812MetArg: 0.812 ± 0.028
1.463MetSer: 1.463 ± 0.038
1.205MetThr: 1.205 ± 0.042
1.366MetVal: 1.366 ± 0.041
0.157MetTrp: 0.157 ± 0.014
0.697MetTyr: 0.697 ± 0.031
0.0MetXaa: 0.0 ± 0.0
Asn
3.925AsnAla: 3.925 ± 0.072
0.548AsnCys: 0.548 ± 0.063
3.185AsnAsp: 3.185 ± 0.062
3.753AsnGlu: 3.753 ± 0.068
3.174AsnPhe: 3.174 ± 0.07
4.127AsnGly: 4.127 ± 0.138
1.118AsnHis: 1.118 ± 0.038
4.697AsnIle: 4.697 ± 0.084
4.878AsnLys: 4.878 ± 0.088
5.611AsnLeu: 5.611 ± 0.082
1.253AsnMet: 1.253 ± 0.037
3.832AsnAsn: 3.832 ± 0.096
2.912AsnPro: 2.912 ± 0.056
2.46AsnGln: 2.46 ± 0.058
1.998AsnArg: 1.998 ± 0.053
4.153AsnSer: 4.153 ± 0.089
3.754AsnThr: 3.754 ± 0.086
3.637AsnVal: 3.637 ± 0.083
0.776AsnTrp: 0.776 ± 0.036
2.857AsnTyr: 2.857 ± 0.069
0.0AsnXaa: 0.0 ± 0.0
Pro
2.045ProAla: 2.045 ± 0.053
0.197ProCys: 0.197 ± 0.016
1.804ProAsp: 1.804 ± 0.054
2.723ProGlu: 2.723 ± 0.078
2.051ProPhe: 2.051 ± 0.049
1.776ProGly: 1.776 ± 0.058
0.532ProHis: 0.532 ± 0.028
2.997ProIle: 2.997 ± 0.065
2.661ProLys: 2.661 ± 0.066
2.964ProLeu: 2.964 ± 0.065
0.758ProMet: 0.758 ± 0.032
2.35ProAsn: 2.35 ± 0.057
0.7ProPro: 0.7 ± 0.037
1.006ProGln: 1.006 ± 0.035
0.861ProArg: 0.861 ± 0.033
2.222ProSer: 2.222 ± 0.077
2.189ProThr: 2.189 ± 0.079
2.396ProVal: 2.396 ± 0.062
0.305ProTrp: 0.305 ± 0.02
1.352ProTyr: 1.352 ± 0.039
0.0ProXaa: 0.0 ± 0.0
Gln
1.978GlnAla: 1.978 ± 0.05
0.206GlnCys: 0.206 ± 0.021
1.541GlnAsp: 1.541 ± 0.045
2.362GlnGlu: 2.362 ± 0.06
1.656GlnPhe: 1.656 ± 0.043
1.744GlnGly: 1.744 ± 0.101
0.516GlnHis: 0.516 ± 0.029
2.838GlnIle: 2.838 ± 0.055
3.123GlnLys: 3.123 ± 0.068
3.307GlnLeu: 3.307 ± 0.081
0.838GlnMet: 0.838 ± 0.031
2.424GlnAsn: 2.424 ± 0.058
0.958GlnPro: 0.958 ± 0.04
1.243GlnGln: 1.243 ± 0.043
1.008GlnArg: 1.008 ± 0.036
1.991GlnSer: 1.991 ± 0.051
1.983GlnThr: 1.983 ± 0.06
1.927GlnVal: 1.927 ± 0.047
0.37GlnTrp: 0.37 ± 0.023
1.273GlnTyr: 1.273 ± 0.039
0.0GlnXaa: 0.0 ± 0.0
Arg
1.99ArgAla: 1.99 ± 0.072
0.189ArgCys: 0.189 ± 0.014
1.58ArgAsp: 1.58 ± 0.048
1.918ArgGlu: 1.918 ± 0.047
1.677ArgPhe: 1.677 ± 0.044
1.712ArgGly: 1.712 ± 0.046
0.497ArgHis: 0.497 ± 0.026
2.779ArgIle: 2.779 ± 0.063
2.62ArgLys: 2.62 ± 0.065
2.895ArgLeu: 2.895 ± 0.073
0.865ArgMet: 0.865 ± 0.027
1.962ArgAsn: 1.962 ± 0.051
0.936ArgPro: 0.936 ± 0.039
0.907ArgGln: 0.907 ± 0.031
1.069ArgArg: 1.069 ± 0.043
1.705ArgSer: 1.705 ± 0.045
1.679ArgThr: 1.679 ± 0.045
1.779ArgVal: 1.779 ± 0.051
0.334ArgTrp: 0.334 ± 0.019
1.293ArgTyr: 1.293 ± 0.042
0.0ArgXaa: 0.0 ± 0.0
Ser
3.954SerAla: 3.954 ± 0.105
0.696SerCys: 0.696 ± 0.038
3.523SerAsp: 3.523 ± 0.065
3.89SerGlu: 3.89 ± 0.079
4.082SerPhe: 4.082 ± 0.081
5.045SerGly: 5.045 ± 0.142
1.157SerHis: 1.157 ± 0.043
5.436SerIle: 5.436 ± 0.095
5.131SerLys: 5.131 ± 0.08
5.916SerLeu: 5.916 ± 0.087
1.326SerMet: 1.326 ± 0.039
3.976SerAsn: 3.976 ± 0.086
2.157SerPro: 2.157 ± 0.06
2.251SerGln: 2.251 ± 0.055
1.935SerArg: 1.935 ± 0.05
4.51SerSer: 4.51 ± 0.108
3.697SerThr: 3.697 ± 0.101
4.333SerVal: 4.333 ± 0.091
0.704SerTrp: 0.704 ± 0.036
2.701SerTyr: 2.701 ± 0.067
0.0SerXaa: 0.0 ± 0.0
Thr
4.19ThrAla: 4.19 ± 0.164
0.399ThrCys: 0.399 ± 0.029
3.142ThrAsp: 3.142 ± 0.082
3.359ThrGlu: 3.359 ± 0.078
3.136ThrPhe: 3.136 ± 0.072
4.215ThrGly: 4.215 ± 0.154
0.956ThrHis: 0.956 ± 0.033
5.801ThrIle: 5.801 ± 0.128
4.095ThrLys: 4.095 ± 0.074
5.4ThrLeu: 5.4 ± 0.098
1.006ThrMet: 1.006 ± 0.038
3.738ThrAsn: 3.738 ± 0.122
2.503ThrPro: 2.503 ± 0.074
1.861ThrGln: 1.861 ± 0.053
1.555ThrArg: 1.555 ± 0.067
3.8ThrSer: 3.8 ± 0.109
4.042ThrThr: 4.042 ± 0.172
4.124ThrVal: 4.124 ± 0.153
0.585ThrTrp: 0.585 ± 0.047
2.403ThrTyr: 2.403 ± 0.094
0.0ThrXaa: 0.0 ± 0.0
Val
4.438ValAla: 4.438 ± 0.075
0.563ValCys: 0.563 ± 0.028
3.427ValAsp: 3.427 ± 0.071
3.557ValGlu: 3.557 ± 0.075
3.596ValPhe: 3.596 ± 0.079
3.924ValGly: 3.924 ± 0.095
0.925ValHis: 0.925 ± 0.034
5.067ValIle: 5.067 ± 0.078
4.457ValLys: 4.457 ± 0.069
6.133ValLeu: 6.133 ± 0.088
1.303ValMet: 1.303 ± 0.042
3.606ValAsn: 3.606 ± 0.08
2.164ValPro: 2.164 ± 0.06
1.759ValGln: 1.759 ± 0.05
1.883ValArg: 1.883 ± 0.047
4.596ValSer: 4.596 ± 0.088
4.019ValThr: 4.019 ± 0.167
4.322ValVal: 4.322 ± 0.1
0.588ValTrp: 0.588 ± 0.027
2.383ValTyr: 2.383 ± 0.059
0.0ValXaa: 0.0 ± 0.0
Trp
0.6TrpAla: 0.6 ± 0.028
0.092TrpCys: 0.092 ± 0.01
0.557TrpAsp: 0.557 ± 0.026
0.597TrpGlu: 0.597 ± 0.028
0.53TrpPhe: 0.53 ± 0.025
0.666TrpGly: 0.666 ± 0.029
0.226TrpHis: 0.226 ± 0.015
0.779TrpIle: 0.779 ± 0.031
0.78TrpLys: 0.78 ± 0.03
0.878TrpLeu: 0.878 ± 0.037
0.275TrpMet: 0.275 ± 0.019
0.768TrpAsn: 0.768 ± 0.039
0.217TrpPro: 0.217 ± 0.017
0.365TrpGln: 0.365 ± 0.021
0.329TrpArg: 0.329 ± 0.021
0.609TrpSer: 0.609 ± 0.026
0.538TrpThr: 0.538 ± 0.039
0.586TrpVal: 0.586 ± 0.029
0.11TrpTrp: 0.11 ± 0.011
0.403TrpTyr: 0.403 ± 0.022
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.347TyrAla: 2.347 ± 0.063
0.363TyrCys: 0.363 ± 0.021
2.121TyrAsp: 2.121 ± 0.056
2.226TyrGlu: 2.226 ± 0.05
2.486TyrPhe: 2.486 ± 0.053
2.504TyrGly: 2.504 ± 0.06
0.747TyrHis: 0.747 ± 0.029
2.828TyrIle: 2.828 ± 0.055
2.925TyrLys: 2.925 ± 0.07
3.654TyrLeu: 3.654 ± 0.071
0.713TyrMet: 0.713 ± 0.025
2.49TyrAsn: 2.49 ± 0.065
1.463TyrPro: 1.463 ± 0.043
1.465TyrGln: 1.465 ± 0.048
1.396TyrArg: 1.396 ± 0.042
2.818TyrSer: 2.818 ± 0.077
2.323TyrThr: 2.323 ± 0.085
2.16TyrVal: 2.16 ± 0.056
0.439TyrTrp: 0.439 ± 0.023
1.734TyrTyr: 1.734 ± 0.048
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2513 proteins (879023 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski