Amino acid dipepetide frequency for Brassica campestris (Field mustard)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
6.124AlaAla: 6.124 ± 0.03
1.162AlaCys: 1.162 ± 0.01
2.94AlaAsp: 2.94 ± 0.014
4.15AlaGlu: 4.15 ± 0.022
2.691AlaPhe: 2.691 ± 0.014
3.902AlaGly: 3.902 ± 0.019
1.202AlaHis: 1.202 ± 0.009
3.364AlaIle: 3.364 ± 0.018
3.973AlaLys: 3.973 ± 0.02
6.221AlaLeu: 6.221 ± 0.023
1.79AlaMet: 1.79 ± 0.01
2.419AlaAsn: 2.419 ± 0.015
2.757AlaPro: 2.757 ± 0.017
1.933AlaGln: 1.933 ± 0.012
3.328AlaArg: 3.328 ± 0.016
6.03AlaSer: 6.03 ± 0.026
3.653AlaThr: 3.653 ± 0.014
4.854AlaVal: 4.854 ± 0.023
0.718AlaTrp: 0.718 ± 0.007
1.755AlaTyr: 1.755 ± 0.012
0.0AlaXaa: 0.0 ± 0.0
Cys
0.888CysAla: 0.888 ± 0.007
0.544CysCys: 0.544 ± 0.007
0.911CysAsp: 0.911 ± 0.008
0.908CysGlu: 0.908 ± 0.007
0.951CysPhe: 0.951 ± 0.007
1.421CysGly: 1.421 ± 0.012
0.429CysHis: 0.429 ± 0.006
0.928CysIle: 0.928 ± 0.009
1.127CysLys: 1.127 ± 0.01
1.888CysLeu: 1.888 ± 0.013
0.419CysMet: 0.419 ± 0.005
0.796CysAsn: 0.796 ± 0.008
0.901CysPro: 0.901 ± 0.008
0.53CysGln: 0.53 ± 0.007
1.058CysArg: 1.058 ± 0.009
1.757CysSer: 1.757 ± 0.012
0.83CysThr: 0.83 ± 0.007
1.216CysVal: 1.216 ± 0.01
0.222CysTrp: 0.222 ± 0.004
0.578CysTyr: 0.578 ± 0.006
0.0CysXaa: 0.0 ± 0.0
Asp
3.281AspAla: 3.281 ± 0.015
0.935AspCys: 0.935 ± 0.008
3.935AspAsp: 3.935 ± 0.021
4.258AspGlu: 4.258 ± 0.022
2.308AspPhe: 2.308 ± 0.013
3.747AspGly: 3.747 ± 0.016
1.32AspHis: 1.32 ± 0.01
2.815AspIle: 2.815 ± 0.013
2.9AspLys: 2.9 ± 0.017
5.106AspLeu: 5.106 ± 0.019
1.354AspMet: 1.354 ± 0.01
2.064AspAsn: 2.064 ± 0.014
2.65AspPro: 2.65 ± 0.012
1.782AspGln: 1.782 ± 0.011
2.326AspArg: 2.326 ± 0.014
4.293AspSer: 4.293 ± 0.017
2.283AspThr: 2.283 ± 0.011
3.878AspVal: 3.878 ± 0.016
0.686AspTrp: 0.686 ± 0.006
1.591AspTyr: 1.591 ± 0.011
0.0AspXaa: 0.0 ± 0.0
Glu
4.976GluAla: 4.976 ± 0.022
0.899GluCys: 0.899 ± 0.007
4.246GluAsp: 4.246 ± 0.021
7.296GluGlu: 7.296 ± 0.04
2.418GluPhe: 2.418 ± 0.013
3.612GluGly: 3.612 ± 0.018
1.231GluHis: 1.231 ± 0.009
3.775GluIle: 3.775 ± 0.018
5.091GluLys: 5.091 ± 0.03
5.717GluLeu: 5.717 ± 0.025
1.866GluMet: 1.866 ± 0.012
2.975GluAsn: 2.975 ± 0.013
2.247GluPro: 2.247 ± 0.013
2.045GluGln: 2.045 ± 0.013
3.597GluArg: 3.597 ± 0.019
4.903GluSer: 4.903 ± 0.02
3.756GluThr: 3.756 ± 0.017
4.221GluVal: 4.221 ± 0.018
0.756GluTrp: 0.756 ± 0.008
1.673GluTyr: 1.673 ± 0.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.478PheAla: 2.478 ± 0.014
0.875PheCys: 0.875 ± 0.008
2.358PheAsp: 2.358 ± 0.012
2.306PheGlu: 2.306 ± 0.012
2.005PhePhe: 2.005 ± 0.014
3.064PheGly: 3.064 ± 0.016
1.087PheHis: 1.087 ± 0.008
2.022PheIle: 2.022 ± 0.012
2.248PheLys: 2.248 ± 0.012
4.308PheLeu: 4.308 ± 0.022
1.018PheMet: 1.018 ± 0.008
1.701PheAsn: 1.701 ± 0.01
2.065PhePro: 2.065 ± 0.012
1.457PheGln: 1.457 ± 0.01
2.072PheArg: 2.072 ± 0.011
4.092PheSer: 4.092 ± 0.018
2.2PheThr: 2.2 ± 0.013
2.859PheVal: 2.859 ± 0.016
0.56PheTrp: 0.56 ± 0.006
1.268PheTyr: 1.268 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.643GlyAla: 3.643 ± 0.02
1.264GlyCys: 1.264 ± 0.011
3.593GlyAsp: 3.593 ± 0.016
3.99GlyGlu: 3.99 ± 0.018
3.236GlyPhe: 3.236 ± 0.015
5.966GlyGly: 5.966 ± 0.046
1.45GlyHis: 1.45 ± 0.012
3.231GlyIle: 3.231 ± 0.016
4.152GlyLys: 4.152 ± 0.019
5.757GlyLeu: 5.757 ± 0.021
1.468GlyMet: 1.468 ± 0.011
2.971GlyAsn: 2.971 ± 0.016
2.315GlyPro: 2.315 ± 0.014
1.958GlyGln: 1.958 ± 0.012
3.603GlyArg: 3.603 ± 0.017
5.958GlySer: 5.958 ± 0.025
3.211GlyThr: 3.211 ± 0.016
4.312GlyVal: 4.312 ± 0.018
0.854GlyTrp: 0.854 ± 0.008
2.124GlyTyr: 2.124 ± 0.013
0.0GlyXaa: 0.0 ± 0.0
His
1.238HisAla: 1.238 ± 0.009
0.506HisCys: 0.506 ± 0.006
1.125HisAsp: 1.125 ± 0.009
1.302HisGlu: 1.302 ± 0.01
0.97HisPhe: 0.97 ± 0.008
1.667HisGly: 1.667 ± 0.012
1.061HisHis: 1.061 ± 0.012
1.117HisIle: 1.117 ± 0.009
1.206HisLys: 1.206 ± 0.009
2.275HisLeu: 2.275 ± 0.012
0.58HisMet: 0.58 ± 0.006
0.956HisAsn: 0.956 ± 0.008
1.294HisPro: 1.294 ± 0.01
1.045HisGln: 1.045 ± 0.01
1.407HisArg: 1.407 ± 0.01
1.811HisSer: 1.811 ± 0.011
0.965HisThr: 0.965 ± 0.008
1.577HisVal: 1.577 ± 0.009
0.294HisTrp: 0.294 ± 0.004
0.701HisTyr: 0.701 ± 0.007
0.0HisXaa: 0.0 ± 0.0
Ile
3.323IleAla: 3.323 ± 0.016
1.062IleCys: 1.062 ± 0.008
2.784IleAsp: 2.784 ± 0.014
2.997IleGlu: 2.997 ± 0.016
2.138IlePhe: 2.138 ± 0.012
3.219IleGly: 3.219 ± 0.017
1.23IleHis: 1.23 ± 0.009
2.587IleIle: 2.587 ± 0.013
2.917IleLys: 2.917 ± 0.012
4.743IleLeu: 4.743 ± 0.019
1.124IleMet: 1.124 ± 0.008
2.109IleAsn: 2.109 ± 0.012
2.751IlePro: 2.751 ± 0.016
1.837IleGln: 1.837 ± 0.011
2.602IleArg: 2.602 ± 0.014
4.616IleSer: 4.616 ± 0.018
2.699IleThr: 2.699 ± 0.016
3.387IleVal: 3.387 ± 0.016
0.654IleTrp: 0.654 ± 0.007
1.442IleTyr: 1.442 ± 0.01
0.0IleXaa: 0.0 ± 0.0
Lys
4.12LysAla: 4.12 ± 0.02
0.964LysCys: 0.964 ± 0.009
3.278LysAsp: 3.278 ± 0.017
4.909LysGlu: 4.909 ± 0.03
2.079LysPhe: 2.079 ± 0.012
3.599LysGly: 3.599 ± 0.019
1.368LysHis: 1.368 ± 0.009
3.269LysIle: 3.269 ± 0.015
5.413LysLys: 5.413 ± 0.028
6.079LysLeu: 6.079 ± 0.024
1.577LysMet: 1.577 ± 0.01
2.641LysAsn: 2.641 ± 0.013
3.037LysPro: 3.037 ± 0.016
2.292LysGln: 2.292 ± 0.014
3.955LysArg: 3.955 ± 0.019
4.765LysSer: 4.765 ± 0.02
3.422LysThr: 3.422 ± 0.015
3.869LysVal: 3.869 ± 0.017
0.825LysTrp: 0.825 ± 0.007
1.566LysTyr: 1.566 ± 0.011
0.0LysXaa: 0.0 ± 0.0
Leu
6.208LeuAla: 6.208 ± 0.024
1.835LeuCys: 1.835 ± 0.012
4.926LeuAsp: 4.926 ± 0.021
6.16LeuGlu: 6.16 ± 0.029
3.843LeuPhe: 3.843 ± 0.02
5.582LeuGly: 5.582 ± 0.02
2.399LeuHis: 2.399 ± 0.013
4.42LeuIle: 4.42 ± 0.019
6.099LeuLys: 6.099 ± 0.025
9.528LeuLeu: 9.528 ± 0.037
2.224LeuMet: 2.224 ± 0.014
3.728LeuAsn: 3.728 ± 0.019
5.019LeuPro: 5.019 ± 0.023
3.794LeuGln: 3.794 ± 0.016
5.447LeuArg: 5.447 ± 0.017
8.556LeuSer: 8.556 ± 0.029
4.688LeuThr: 4.688 ± 0.019
6.521LeuVal: 6.521 ± 0.025
1.139LeuTrp: 1.139 ± 0.009
2.45LeuTyr: 2.45 ± 0.012
0.0LeuXaa: 0.0 ± 0.0
Met
2.063MetAla: 2.063 ± 0.012
0.349MetCys: 0.349 ± 0.005
1.434MetAsp: 1.434 ± 0.009
2.042MetGlu: 2.042 ± 0.013
0.963MetPhe: 0.963 ± 0.009
1.576MetGly: 1.576 ± 0.01
0.478MetHis: 0.478 ± 0.007
1.301MetIle: 1.301 ± 0.009
1.719MetLys: 1.719 ± 0.011
2.09MetLeu: 2.09 ± 0.011
0.815MetMet: 0.815 ± 0.009
1.076MetAsn: 1.076 ± 0.009
0.96MetPro: 0.96 ± 0.008
0.819MetGln: 0.819 ± 0.009
1.289MetArg: 1.289 ± 0.008
2.002MetSer: 2.002 ± 0.012
1.128MetThr: 1.128 ± 0.008
1.767MetVal: 1.767 ± 0.011
0.275MetTrp: 0.275 ± 0.004
0.623MetTyr: 0.623 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.577AsnAla: 2.577 ± 0.014
0.764AsnCys: 0.764 ± 0.008
2.001AsnAsp: 2.001 ± 0.012
2.406AsnGlu: 2.406 ± 0.011
1.656AsnPhe: 1.656 ± 0.01
3.219AsnGly: 3.219 ± 0.016
1.155AsnHis: 1.155 ± 0.009
2.301AsnIle: 2.301 ± 0.013
2.498AsnLys: 2.498 ± 0.014
4.28AsnLeu: 4.28 ± 0.02
1.107AsnMet: 1.107 ± 0.007
2.494AsnAsn: 2.494 ± 0.019
2.361AsnPro: 2.361 ± 0.013
1.698AsnGln: 1.698 ± 0.011
2.132AsnArg: 2.132 ± 0.012
3.51AsnSer: 3.51 ± 0.016
2.075AsnThr: 2.075 ± 0.012
2.876AsnVal: 2.876 ± 0.014
0.528AsnTrp: 0.528 ± 0.006
1.213AsnTyr: 1.213 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
2.883ProAla: 2.883 ± 0.017
0.779ProCys: 0.779 ± 0.008
2.425ProAsp: 2.425 ± 0.012
3.373ProGlu: 3.373 ± 0.019
1.953ProPhe: 1.953 ± 0.012
2.719ProGly: 2.719 ± 0.015
1.074ProHis: 1.074 ± 0.008
2.105ProIle: 2.105 ± 0.011
2.836ProLys: 2.836 ± 0.017
4.265ProLeu: 4.265 ± 0.017
1.012ProMet: 1.012 ± 0.009
2.168ProAsn: 2.168 ± 0.013
4.369ProPro: 4.369 ± 0.047
1.802ProGln: 1.802 ± 0.013
2.613ProArg: 2.613 ± 0.015
5.215ProSer: 5.215 ± 0.023
2.61ProThr: 2.61 ± 0.016
3.333ProVal: 3.333 ± 0.018
0.618ProTrp: 0.618 ± 0.006
1.348ProTyr: 1.348 ± 0.012
0.0ProXaa: 0.0 ± 0.0
Gln
2.249GlnAla: 2.249 ± 0.014
0.546GlnCys: 0.546 ± 0.006
1.651GlnAsp: 1.651 ± 0.01
2.516GlnGlu: 2.516 ± 0.016
1.285GlnPhe: 1.285 ± 0.01
2.058GlnGly: 2.058 ± 0.014
0.836GlnHis: 0.836 ± 0.008
1.813GlnIle: 1.813 ± 0.01
2.141GlnLys: 2.141 ± 0.013
3.125GlnLeu: 3.125 ± 0.015
0.933GlnMet: 0.933 ± 0.008
1.58GlnAsn: 1.58 ± 0.011
1.717GlnPro: 1.717 ± 0.013
1.984GlnGln: 1.984 ± 0.022
2.185GlnArg: 2.185 ± 0.013
2.713GlnSer: 2.713 ± 0.013
1.878GlnThr: 1.878 ± 0.012
2.269GlnVal: 2.269 ± 0.011
0.437GlnTrp: 0.437 ± 0.005
0.875GlnTyr: 0.875 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.12ArgAla: 3.12 ± 0.016
1.049ArgCys: 1.049 ± 0.008
2.819ArgAsp: 2.819 ± 0.014
3.555ArgGlu: 3.555 ± 0.018
2.494ArgPhe: 2.494 ± 0.013
3.383ArgGly: 3.383 ± 0.02
1.24ArgHis: 1.24 ± 0.009
2.804ArgIle: 2.804 ± 0.011
3.863ArgLys: 3.863 ± 0.018
5.124ArgLeu: 5.124 ± 0.018
1.321ArgMet: 1.321 ± 0.009
2.457ArgAsn: 2.457 ± 0.013
2.375ArgPro: 2.375 ± 0.013
1.835ArgGln: 1.835 ± 0.011
4.324ArgArg: 4.324 ± 0.022
4.717ArgSer: 4.717 ± 0.022
2.551ArgThr: 2.551 ± 0.012
3.614ArgVal: 3.614 ± 0.014
0.762ArgTrp: 0.762 ± 0.007
1.517ArgTyr: 1.517 ± 0.011
0.0ArgXaa: 0.0 ± 0.0
Ser
5.126SerAla: 5.126 ± 0.023
1.713SerCys: 1.713 ± 0.012
4.609SerAsp: 4.609 ± 0.018
4.952SerGlu: 4.952 ± 0.02
4.136SerPhe: 4.136 ± 0.016
6.014SerGly: 6.014 ± 0.026
2.036SerHis: 2.036 ± 0.012
4.073SerIle: 4.073 ± 0.019
5.035SerLys: 5.035 ± 0.022
9.025SerLeu: 9.025 ± 0.032
2.053SerMet: 2.053 ± 0.012
3.851SerAsn: 3.851 ± 0.018
4.962SerPro: 4.962 ± 0.03
2.954SerGln: 2.954 ± 0.014
4.7SerArg: 4.7 ± 0.023
11.994SerSer: 11.994 ± 0.056
4.573SerThr: 4.573 ± 0.02
5.645SerVal: 5.645 ± 0.02
1.143SerTrp: 1.143 ± 0.01
2.449SerTyr: 2.449 ± 0.013
0.0SerXaa: 0.0 ± 0.0
Thr
3.425ThrAla: 3.425 ± 0.016
1.021ThrCys: 1.021 ± 0.009
2.396ThrAsp: 2.396 ± 0.013
3.128ThrGlu: 3.128 ± 0.015
2.111ThrPhe: 2.111 ± 0.011
3.459ThrGly: 3.459 ± 0.017
1.079ThrHis: 1.079 ± 0.008
2.693ThrIle: 2.693 ± 0.012
2.999ThrLys: 2.999 ± 0.013
4.782ThrLeu: 4.782 ± 0.019
1.262ThrMet: 1.262 ± 0.009
2.195ThrAsn: 2.195 ± 0.012
2.709ThrPro: 2.709 ± 0.016
1.604ThrGln: 1.604 ± 0.011
2.671ThrArg: 2.671 ± 0.013
4.829ThrSer: 4.829 ± 0.02
3.4ThrThr: 3.4 ± 0.018
3.699ThrVal: 3.699 ± 0.016
0.696ThrTrp: 0.696 ± 0.006
1.414ThrTyr: 1.414 ± 0.01
0.0ThrXaa: 0.0 ± 0.0
Val
4.809ValAla: 4.809 ± 0.022
1.201ValCys: 1.201 ± 0.009
3.822ValAsp: 3.822 ± 0.016
4.632ValGlu: 4.632 ± 0.022
2.928ValPhe: 2.928 ± 0.015
3.982ValGly: 3.982 ± 0.017
1.458ValHis: 1.458 ± 0.01
3.456ValIle: 3.456 ± 0.015
4.297ValLys: 4.297 ± 0.019
6.367ValLeu: 6.367 ± 0.023
1.73ValMet: 1.73 ± 0.011
2.694ValAsn: 2.694 ± 0.013
3.264ValPro: 3.264 ± 0.016
2.118ValGln: 2.118 ± 0.012
3.251ValArg: 3.251 ± 0.016
5.99ValSer: 5.99 ± 0.024
3.634ValThr: 3.634 ± 0.015
5.276ValVal: 5.276 ± 0.024
0.807ValTrp: 0.807 ± 0.007
2.079ValTyr: 2.079 ± 0.013
0.0ValXaa: 0.0 ± 0.0
Trp
0.694TrpAla: 0.694 ± 0.007
0.241TrpCys: 0.241 ± 0.004
0.689TrpAsp: 0.689 ± 0.007
0.786TrpGlu: 0.786 ± 0.008
0.591TrpPhe: 0.591 ± 0.006
0.72TrpGly: 0.72 ± 0.009
0.25TrpHis: 0.25 ± 0.004
0.721TrpIle: 0.721 ± 0.008
0.903TrpLys: 0.903 ± 0.008
1.189TrpLeu: 1.189 ± 0.009
0.339TrpMet: 0.339 ± 0.005
0.673TrpAsn: 0.673 ± 0.006
0.47TrpPro: 0.47 ± 0.005
0.388TrpGln: 0.388 ± 0.004
0.905TrpArg: 0.905 ± 0.008
1.041TrpSer: 1.041 ± 0.008
0.654TrpThr: 0.654 ± 0.007
0.783TrpVal: 0.783 ± 0.006
0.239TrpTrp: 0.239 ± 0.005
0.346TrpTyr: 0.346 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.739TyrAla: 1.739 ± 0.013
0.594TyrCys: 0.594 ± 0.007
1.568TyrAsp: 1.568 ± 0.011
1.634TyrGlu: 1.634 ± 0.011
1.285TyrPhe: 1.285 ± 0.008
2.147TyrGly: 2.147 ± 0.013
0.717TyrHis: 0.717 ± 0.007
1.427TyrIle: 1.427 ± 0.009
1.611TyrLys: 1.611 ± 0.01
2.699TyrLeu: 2.699 ± 0.014
0.776TyrMet: 0.776 ± 0.007
1.304TyrAsn: 1.304 ± 0.009
1.275TyrPro: 1.275 ± 0.011
0.917TyrGln: 0.917 ± 0.007
1.461TyrArg: 1.461 ± 0.01
2.251TyrSer: 2.251 ± 0.012
1.358TyrThr: 1.358 ± 0.01
1.842TyrVal: 1.842 ± 0.011
0.408TyrTrp: 0.408 ± 0.005
1.029TyrTyr: 1.029 ± 0.012
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.003XaaXaa: 0.003 ± 0.001
Statistics based on 42060 proteins (16253173 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski