Amino acid dipepetide frequency for Flavobacterium noncentrifugens

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.048AlaAla: 6.048 ± 0.125
0.591AlaCys: 0.591 ± 0.027
3.84AlaAsp: 3.84 ± 0.06
4.271AlaGlu: 4.271 ± 0.07
3.847AlaPhe: 3.847 ± 0.055
4.886AlaGly: 4.886 ± 0.103
1.11AlaHis: 1.11 ± 0.03
5.625AlaIle: 5.625 ± 0.079
4.86AlaLys: 4.86 ± 0.079
6.627AlaLeu: 6.627 ± 0.086
1.673AlaMet: 1.673 ± 0.041
4.179AlaAsn: 4.179 ± 0.074
2.307AlaPro: 2.307 ± 0.06
2.645AlaGln: 2.645 ± 0.043
1.937AlaArg: 1.937 ± 0.042
4.658AlaSer: 4.658 ± 0.087
4.829AlaThr: 4.829 ± 0.149
4.998AlaVal: 4.998 ± 0.084
0.621AlaTrp: 0.621 ± 0.025
2.465AlaTyr: 2.465 ± 0.046
0.0AlaXaa: 0.0 ± 0.0
Cys
0.587CysAla: 0.587 ± 0.035
0.118CysCys: 0.118 ± 0.01
0.412CysAsp: 0.412 ± 0.021
0.458CysGlu: 0.458 ± 0.024
0.417CysPhe: 0.417 ± 0.018
0.665CysGly: 0.665 ± 0.027
0.182CysHis: 0.182 ± 0.013
0.583CysIle: 0.583 ± 0.025
0.477CysLys: 0.477 ± 0.024
0.631CysLeu: 0.631 ± 0.022
0.148CysMet: 0.148 ± 0.012
0.467CysAsn: 0.467 ± 0.026
0.333CysPro: 0.333 ± 0.022
0.222CysGln: 0.222 ± 0.015
0.237CysArg: 0.237 ± 0.014
0.698CysSer: 0.698 ± 0.035
0.53CysThr: 0.53 ± 0.034
0.456CysVal: 0.456 ± 0.021
0.063CysTrp: 0.063 ± 0.008
0.325CysTyr: 0.325 ± 0.017
0.0CysXaa: 0.0 ± 0.0
Asp
4.073AspAla: 4.073 ± 0.065
0.455AspCys: 0.455 ± 0.022
2.785AspAsp: 2.785 ± 0.057
3.409AspGlu: 3.409 ± 0.062
3.728AspPhe: 3.728 ± 0.065
3.863AspGly: 3.863 ± 0.075
0.935AspHis: 0.935 ± 0.03
4.013AspIle: 4.013 ± 0.057
4.015AspLys: 4.015 ± 0.076
5.017AspLeu: 5.017 ± 0.068
1.109AspMet: 1.109 ± 0.033
2.995AspAsn: 2.995 ± 0.056
1.718AspPro: 1.718 ± 0.035
1.554AspGln: 1.554 ± 0.037
1.83AspArg: 1.83 ± 0.041
2.977AspSer: 2.977 ± 0.055
2.386AspThr: 2.386 ± 0.045
3.433AspVal: 3.433 ± 0.064
0.736AspTrp: 0.736 ± 0.029
2.707AspTyr: 2.707 ± 0.053
0.0AspXaa: 0.0 ± 0.0
Glu
4.178GluAla: 4.178 ± 0.077
0.316GluCys: 0.316 ± 0.016
2.848GluAsp: 2.848 ± 0.055
3.651GluGlu: 3.651 ± 0.075
2.824GluPhe: 2.824 ± 0.06
3.048GluGly: 3.048 ± 0.052
0.953GluHis: 0.953 ± 0.03
5.435GluIle: 5.435 ± 0.078
5.797GluLys: 5.797 ± 0.102
5.307GluLeu: 5.307 ± 0.089
1.569GluMet: 1.569 ± 0.039
4.891GluAsn: 4.891 ± 0.075
1.517GluPro: 1.517 ± 0.035
2.042GluGln: 2.042 ± 0.048
2.132GluArg: 2.132 ± 0.054
3.397GluSer: 3.397 ± 0.062
3.763GluThr: 3.763 ± 0.059
3.595GluVal: 3.595 ± 0.057
0.552GluTrp: 0.552 ± 0.023
2.036GluTyr: 2.036 ± 0.043
0.0GluXaa: 0.0 ± 0.0
Phe
3.711PheAla: 3.711 ± 0.063
0.508PheCys: 0.508 ± 0.021
3.409PheAsp: 3.409 ± 0.051
3.524PheGlu: 3.524 ± 0.061
2.873PhePhe: 2.873 ± 0.059
3.808PheGly: 3.808 ± 0.062
0.914PheHis: 0.914 ± 0.027
3.773PheIle: 3.773 ± 0.07
3.46PheLys: 3.46 ± 0.064
4.75PheLeu: 4.75 ± 0.074
1.16PheMet: 1.16 ± 0.031
3.092PheAsn: 3.092 ± 0.064
1.94PhePro: 1.94 ± 0.038
1.595PheGln: 1.595 ± 0.039
1.849PheArg: 1.849 ± 0.041
4.165PheSer: 4.165 ± 0.065
3.526PheThr: 3.526 ± 0.065
3.185PheVal: 3.185 ± 0.057
0.576PheTrp: 0.576 ± 0.025
2.329PheTyr: 2.329 ± 0.051
0.0PheXaa: 0.0 ± 0.0
Gly
4.257GlyAla: 4.257 ± 0.107
0.679GlyCys: 0.679 ± 0.037
3.131GlyAsp: 3.131 ± 0.062
3.259GlyGlu: 3.259 ± 0.055
3.876GlyPhe: 3.876 ± 0.064
4.53GlyGly: 4.53 ± 0.114
1.059GlyHis: 1.059 ± 0.034
5.664GlyIle: 5.664 ± 0.071
5.199GlyLys: 5.199 ± 0.084
5.55GlyLeu: 5.55 ± 0.081
1.564GlyMet: 1.564 ± 0.037
4.153GlyAsn: 4.153 ± 0.077
1.329GlyPro: 1.329 ± 0.039
2.048GlyGln: 2.048 ± 0.051
1.925GlyArg: 1.925 ± 0.049
4.398GlySer: 4.398 ± 0.094
4.883GlyThr: 4.883 ± 0.148
3.949GlyVal: 3.949 ± 0.071
0.744GlyTrp: 0.744 ± 0.026
2.81GlyTyr: 2.81 ± 0.065
0.0GlyXaa: 0.0 ± 0.0
His
0.997HisAla: 0.997 ± 0.027
0.188HisCys: 0.188 ± 0.012
0.884HisAsp: 0.884 ± 0.026
0.956HisGlu: 0.956 ± 0.031
1.245HisPhe: 1.245 ± 0.037
1.015HisGly: 1.015 ± 0.03
0.46HisHis: 0.46 ± 0.021
1.269HisIle: 1.269 ± 0.038
1.083HisLys: 1.083 ± 0.032
1.689HisLeu: 1.689 ± 0.038
0.296HisMet: 0.296 ± 0.016
0.933HisAsn: 0.933 ± 0.028
0.863HisPro: 0.863 ± 0.03
0.688HisGln: 0.688 ± 0.021
0.576HisArg: 0.576 ± 0.022
1.121HisSer: 1.121 ± 0.038
0.888HisThr: 0.888 ± 0.028
0.894HisVal: 0.894 ± 0.028
0.222HisTrp: 0.222 ± 0.015
0.811HisTyr: 0.811 ± 0.028
0.0HisXaa: 0.0 ± 0.0
Ile
6.077IleAla: 6.077 ± 0.073
0.729IleCys: 0.729 ± 0.028
4.626IleAsp: 4.626 ± 0.069
4.859IleGlu: 4.859 ± 0.082
3.823IlePhe: 3.823 ± 0.063
5.176IleGly: 5.176 ± 0.077
1.333IleHis: 1.333 ± 0.036
5.661IleIle: 5.661 ± 0.096
5.314IleLys: 5.314 ± 0.078
6.87IleLeu: 6.87 ± 0.092
1.424IleMet: 1.424 ± 0.032
4.383IleAsn: 4.383 ± 0.074
3.27IlePro: 3.27 ± 0.055
2.497IleGln: 2.497 ± 0.047
2.824IleArg: 2.824 ± 0.054
5.868IleSer: 5.868 ± 0.075
4.851IleThr: 4.851 ± 0.089
4.931IleVal: 4.931 ± 0.087
0.72IleTrp: 0.72 ± 0.025
2.736IleTyr: 2.736 ± 0.053
0.0IleXaa: 0.0 ± 0.0
Lys
4.951LysAla: 4.951 ± 0.083
0.361LysCys: 0.361 ± 0.019
4.07LysAsp: 4.07 ± 0.078
4.786LysGlu: 4.786 ± 0.1
3.258LysPhe: 3.258 ± 0.058
4.093LysGly: 4.093 ± 0.068
1.21LysHis: 1.21 ± 0.034
6.699LysIle: 6.699 ± 0.087
6.42LysLys: 6.42 ± 0.096
6.196LysLeu: 6.196 ± 0.084
2.221LysMet: 2.221 ± 0.043
5.258LysAsn: 5.258 ± 0.086
2.597LysPro: 2.597 ± 0.051
2.532LysGln: 2.532 ± 0.052
2.477LysArg: 2.477 ± 0.054
4.762LysSer: 4.762 ± 0.073
4.843LysThr: 4.843 ± 0.069
4.321LysVal: 4.321 ± 0.065
0.715LysTrp: 0.715 ± 0.025
2.934LysTyr: 2.934 ± 0.049
0.0LysXaa: 0.0 ± 0.0
Leu
6.211LeuAla: 6.211 ± 0.088
0.671LeuCys: 0.671 ± 0.026
4.629LeuAsp: 4.629 ± 0.076
5.353LeuGlu: 5.353 ± 0.084
4.879LeuPhe: 4.879 ± 0.083
5.536LeuGly: 5.536 ± 0.082
1.623LeuHis: 1.623 ± 0.04
6.386LeuIle: 6.386 ± 0.104
7.159LeuLys: 7.159 ± 0.094
8.638LeuLeu: 8.638 ± 0.113
2.053LeuMet: 2.053 ± 0.049
5.272LeuAsn: 5.272 ± 0.087
3.704LeuPro: 3.704 ± 0.067
3.459LeuGln: 3.459 ± 0.058
3.061LeuArg: 3.061 ± 0.055
6.741LeuSer: 6.741 ± 0.083
5.277LeuThr: 5.277 ± 0.108
5.017LeuVal: 5.017 ± 0.067
0.796LeuTrp: 0.796 ± 0.032
3.059LeuTyr: 3.059 ± 0.055
0.0LeuXaa: 0.0 ± 0.0
Met
1.751MetAla: 1.751 ± 0.043
0.134MetCys: 0.134 ± 0.01
1.106MetAsp: 1.106 ± 0.031
1.336MetGlu: 1.336 ± 0.037
0.875MetPhe: 0.875 ± 0.028
1.274MetGly: 1.274 ± 0.037
0.422MetHis: 0.422 ± 0.021
1.619MetIle: 1.619 ± 0.04
2.327MetLys: 2.327 ± 0.049
2.064MetLeu: 2.064 ± 0.044
0.625MetMet: 0.625 ± 0.027
1.313MetAsn: 1.313 ± 0.033
0.927MetPro: 0.927 ± 0.028
0.897MetGln: 0.897 ± 0.03
0.809MetArg: 0.809 ± 0.028
1.386MetSer: 1.386 ± 0.035
1.297MetThr: 1.297 ± 0.033
1.375MetVal: 1.375 ± 0.04
0.158MetTrp: 0.158 ± 0.012
0.694MetTyr: 0.694 ± 0.025
0.0MetXaa: 0.0 ± 0.0
Asn
4.632AsnAla: 4.632 ± 0.076
0.502AsnCys: 0.502 ± 0.031
3.313AsnAsp: 3.313 ± 0.061
3.426AsnGlu: 3.426 ± 0.064
3.265AsnPhe: 3.265 ± 0.053
4.508AsnGly: 4.508 ± 0.096
1.099AsnHis: 1.099 ± 0.032
4.794AsnIle: 4.794 ± 0.074
3.771AsnLys: 3.771 ± 0.058
5.606AsnLeu: 5.606 ± 0.099
1.269AsnMet: 1.269 ± 0.034
3.759AsnAsn: 3.759 ± 0.076
3.074AsnPro: 3.074 ± 0.057
2.124AsnGln: 2.124 ± 0.047
2.077AsnArg: 2.077 ± 0.039
3.97AsnSer: 3.97 ± 0.071
3.673AsnThr: 3.673 ± 0.081
3.748AsnVal: 3.748 ± 0.058
0.821AsnTrp: 0.821 ± 0.028
2.839AsnTyr: 2.839 ± 0.06
0.0AsnXaa: 0.0 ± 0.0
Pro
2.619ProAla: 2.619 ± 0.072
0.211ProCys: 0.211 ± 0.016
2.218ProAsp: 2.218 ± 0.045
2.956ProGlu: 2.956 ± 0.056
2.005ProPhe: 2.005 ± 0.038
2.168ProGly: 2.168 ± 0.046
0.529ProHis: 0.529 ± 0.021
2.588ProIle: 2.588 ± 0.052
2.571ProLys: 2.571 ± 0.057
2.992ProLeu: 2.992 ± 0.059
0.819ProMet: 0.819 ± 0.028
2.297ProAsn: 2.297 ± 0.046
0.884ProPro: 0.884 ± 0.034
1.171ProGln: 1.171 ± 0.034
0.857ProArg: 0.857 ± 0.031
2.125ProSer: 2.125 ± 0.049
2.268ProThr: 2.268 ± 0.072
2.781ProVal: 2.781 ± 0.056
0.324ProTrp: 0.324 ± 0.024
1.313ProTyr: 1.313 ± 0.036
0.0ProXaa: 0.0 ± 0.0
Gln
2.127GlnAla: 2.127 ± 0.044
0.236GlnCys: 0.236 ± 0.018
1.459GlnAsp: 1.459 ± 0.038
1.897GlnGlu: 1.897 ± 0.042
1.762GlnPhe: 1.762 ± 0.043
1.619GlnGly: 1.619 ± 0.038
0.631GlnHis: 0.631 ± 0.024
2.835GlnIle: 2.835 ± 0.052
3.086GlnLys: 3.086 ± 0.062
3.339GlnLeu: 3.339 ± 0.053
0.885GlnMet: 0.885 ± 0.029
2.463GlnAsn: 2.463 ± 0.045
1.172GlnPro: 1.172 ± 0.04
1.64GlnGln: 1.64 ± 0.047
1.155GlnArg: 1.155 ± 0.029
2.198GlnSer: 2.198 ± 0.043
2.142GlnThr: 2.142 ± 0.048
1.821GlnVal: 1.821 ± 0.037
0.403GlnTrp: 0.403 ± 0.025
1.386GlnTyr: 1.386 ± 0.038
0.0GlnXaa: 0.0 ± 0.0
Arg
1.938ArgAla: 1.938 ± 0.044
0.198ArgCys: 0.198 ± 0.013
1.625ArgAsp: 1.625 ± 0.041
2.065ArgGlu: 2.065 ± 0.052
1.918ArgPhe: 1.918 ± 0.037
1.684ArgGly: 1.684 ± 0.044
0.593ArgHis: 0.593 ± 0.022
2.858ArgIle: 2.858 ± 0.049
2.807ArgLys: 2.807 ± 0.058
2.978ArgLeu: 2.978 ± 0.058
0.92ArgMet: 0.92 ± 0.028
2.212ArgAsn: 2.212 ± 0.046
1.043ArgPro: 1.043 ± 0.033
1.109ArgGln: 1.109 ± 0.03
1.213ArgArg: 1.213 ± 0.036
1.884ArgSer: 1.884 ± 0.042
1.819ArgThr: 1.819 ± 0.045
1.908ArgVal: 1.908 ± 0.039
0.328ArgTrp: 0.328 ± 0.017
1.402ArgTyr: 1.402 ± 0.037
0.0ArgXaa: 0.0 ± 0.0
Ser
4.74SerAla: 4.74 ± 0.085
0.664SerCys: 0.664 ± 0.031
3.977SerAsp: 3.977 ± 0.06
4.209SerGlu: 4.209 ± 0.073
3.686SerPhe: 3.686 ± 0.054
5.649SerGly: 5.649 ± 0.107
1.156SerHis: 1.156 ± 0.031
4.862SerIle: 4.862 ± 0.06
4.657SerLys: 4.657 ± 0.093
5.655SerLeu: 5.655 ± 0.093
1.338SerMet: 1.338 ± 0.035
4.021SerAsn: 4.021 ± 0.075
2.287SerPro: 2.287 ± 0.048
2.252SerGln: 2.252 ± 0.055
2.065SerArg: 2.065 ± 0.044
4.052SerSer: 4.052 ± 0.071
3.578SerThr: 3.578 ± 0.068
4.271SerVal: 4.271 ± 0.065
0.739SerTrp: 0.739 ± 0.033
2.675SerTyr: 2.675 ± 0.052
0.0SerXaa: 0.0 ± 0.0
Thr
5.258ThrAla: 5.258 ± 0.15
0.405ThrCys: 0.405 ± 0.023
3.354ThrAsp: 3.354 ± 0.063
3.359ThrGlu: 3.359 ± 0.06
3.352ThrPhe: 3.352 ± 0.063
4.787ThrGly: 4.787 ± 0.13
0.923ThrHis: 0.923 ± 0.027
5.113ThrIle: 5.113 ± 0.093
3.648ThrLys: 3.648 ± 0.063
5.431ThrLeu: 5.431 ± 0.088
0.986ThrMet: 0.986 ± 0.025
3.558ThrAsn: 3.558 ± 0.083
2.693ThrPro: 2.693 ± 0.085
1.957ThrGln: 1.957 ± 0.051
1.635ThrArg: 1.635 ± 0.04
3.978ThrSer: 3.978 ± 0.085
4.335ThrThr: 4.335 ± 0.15
4.433ThrVal: 4.433 ± 0.111
0.567ThrTrp: 0.567 ± 0.027
2.408ThrTyr: 2.408 ± 0.075
0.0ThrXaa: 0.0 ± 0.0
Val
4.708ValAla: 4.708 ± 0.084
0.571ValCys: 0.571 ± 0.028
3.17ValAsp: 3.17 ± 0.055
3.339ValGlu: 3.339 ± 0.059
3.402ValPhe: 3.402 ± 0.064
3.632ValGly: 3.632 ± 0.07
0.968ValHis: 0.968 ± 0.032
4.737ValIle: 4.737 ± 0.074
4.448ValLys: 4.448 ± 0.072
5.768ValLeu: 5.768 ± 0.074
1.364ValMet: 1.364 ± 0.035
3.689ValAsn: 3.689 ± 0.059
2.297ValPro: 2.297 ± 0.047
1.812ValGln: 1.812 ± 0.039
2.031ValArg: 2.031 ± 0.046
4.834ValSer: 4.834 ± 0.07
4.191ValThr: 4.191 ± 0.104
4.357ValVal: 4.357 ± 0.07
0.55ValTrp: 0.55 ± 0.025
2.416ValTyr: 2.416 ± 0.05
0.001ValXaa: 0.001 ± 0.001
Trp
0.675TrpAla: 0.675 ± 0.026
0.095TrpCys: 0.095 ± 0.009
0.532TrpAsp: 0.532 ± 0.023
0.603TrpGlu: 0.603 ± 0.021
0.549TrpPhe: 0.549 ± 0.02
0.625TrpGly: 0.625 ± 0.028
0.225TrpHis: 0.225 ± 0.015
0.772TrpIle: 0.772 ± 0.031
0.841TrpLys: 0.841 ± 0.025
0.893TrpLeu: 0.893 ± 0.028
0.271TrpMet: 0.271 ± 0.015
0.734TrpAsn: 0.734 ± 0.026
0.241TrpPro: 0.241 ± 0.014
0.446TrpGln: 0.446 ± 0.023
0.343TrpArg: 0.343 ± 0.016
0.623TrpSer: 0.623 ± 0.029
0.632TrpThr: 0.632 ± 0.028
0.576TrpVal: 0.576 ± 0.028
0.133TrpTrp: 0.133 ± 0.01
0.419TrpTyr: 0.419 ± 0.024
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.48TyrAla: 2.48 ± 0.044
0.381TyrCys: 0.381 ± 0.022
2.349TyrAsp: 2.349 ± 0.051
2.134TyrGlu: 2.134 ± 0.045
2.562TyrPhe: 2.562 ± 0.048
2.44TyrGly: 2.44 ± 0.057
0.754TyrHis: 0.754 ± 0.029
2.685TyrIle: 2.685 ± 0.049
2.765TyrLys: 2.765 ± 0.061
3.521TyrLeu: 3.521 ± 0.059
0.694TyrMet: 0.694 ± 0.025
2.599TyrAsn: 2.599 ± 0.06
1.502TyrPro: 1.502 ± 0.037
1.565TyrGln: 1.565 ± 0.038
1.469TyrArg: 1.469 ± 0.04
2.697TyrSer: 2.697 ± 0.057
2.47TyrThr: 2.47 ± 0.076
2.236TyrVal: 2.236 ± 0.049
0.462TyrTrp: 0.462 ± 0.023
1.844TyrTyr: 1.844 ± 0.05
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.001XaaAsn: 0.001 ± 0.001
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.002XaaXaa: 0.002 ± 0.002
Statistics based on 3558 proteins (1205208 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski