Amino acid dipepetide frequency for Cucumis melo var. makuwa

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
4.88AlaAla: 4.88 ± 0.029
1.169AlaCys: 1.169 ± 0.01
2.943AlaAsp: 2.943 ± 0.015
3.859AlaGlu: 3.859 ± 0.017
2.593AlaPhe: 2.593 ± 0.016
3.366AlaGly: 3.366 ± 0.021
1.372AlaHis: 1.372 ± 0.01
3.615AlaIle: 3.615 ± 0.019
3.764AlaLys: 3.764 ± 0.018
5.995AlaLeu: 5.995 ± 0.026
1.729AlaMet: 1.729 ± 0.013
2.538AlaAsn: 2.538 ± 0.015
3.089AlaPro: 3.089 ± 0.023
2.075AlaGln: 2.075 ± 0.012
3.348AlaArg: 3.348 ± 0.017
5.592AlaSer: 5.592 ± 0.023
3.554AlaThr: 3.554 ± 0.015
4.163AlaVal: 4.163 ± 0.022
0.738AlaTrp: 0.738 ± 0.008
1.771AlaTyr: 1.771 ± 0.012
0.0AlaXaa: 0.0 ± 0.0
Cys
1.053CysAla: 1.053 ± 0.012
0.47CysCys: 0.47 ± 0.006
0.829CysAsp: 0.829 ± 0.009
1.023CysGlu: 1.023 ± 0.009
0.941CysPhe: 0.941 ± 0.008
1.217CysGly: 1.217 ± 0.011
0.521CysHis: 0.521 ± 0.007
1.027CysIle: 1.027 ± 0.009
1.158CysLys: 1.158 ± 0.011
1.787CysLeu: 1.787 ± 0.012
0.425CysMet: 0.425 ± 0.006
0.749CysAsn: 0.749 ± 0.008
0.973CysPro: 0.973 ± 0.009
0.647CysGln: 0.647 ± 0.008
1.116CysArg: 1.116 ± 0.01
1.637CysSer: 1.637 ± 0.012
0.975CysThr: 0.975 ± 0.011
1.133CysVal: 1.133 ± 0.009
0.275CysTrp: 0.275 ± 0.005
0.535CysTyr: 0.535 ± 0.007
0.0CysXaa: 0.0 ± 0.0
Asp
3.245AspAla: 3.245 ± 0.017
1.024AspCys: 1.024 ± 0.009
3.558AspAsp: 3.558 ± 0.022
3.978AspGlu: 3.978 ± 0.021
2.554AspPhe: 2.554 ± 0.015
3.427AspGly: 3.427 ± 0.017
1.261AspHis: 1.261 ± 0.01
3.041AspIle: 3.041 ± 0.015
2.716AspLys: 2.716 ± 0.013
5.12AspLeu: 5.12 ± 0.02
1.561AspMet: 1.561 ± 0.012
2.076AspAsn: 2.076 ± 0.014
2.583AspPro: 2.583 ± 0.016
1.877AspGln: 1.877 ± 0.012
2.769AspArg: 2.769 ± 0.015
4.104AspSer: 4.104 ± 0.018
2.09AspThr: 2.09 ± 0.013
3.947AspVal: 3.947 ± 0.018
0.815AspTrp: 0.815 ± 0.007
1.583AspTyr: 1.583 ± 0.012
0.0AspXaa: 0.0 ± 0.0
Glu
4.745GluAla: 4.745 ± 0.018
0.987GluCys: 0.987 ± 0.008
4.021GluAsp: 4.021 ± 0.017
6.124GluGlu: 6.124 ± 0.032
2.762GluPhe: 2.762 ± 0.014
3.809GluGly: 3.809 ± 0.023
1.349GluHis: 1.349 ± 0.009
4.042GluIle: 4.042 ± 0.02
4.867GluLys: 4.867 ± 0.027
6.293GluLeu: 6.293 ± 0.025
1.842GluMet: 1.842 ± 0.014
3.081GluAsn: 3.081 ± 0.015
2.072GluPro: 2.072 ± 0.011
2.377GluGln: 2.377 ± 0.016
3.55GluArg: 3.55 ± 0.019
4.192GluSer: 4.192 ± 0.025
3.073GluThr: 3.073 ± 0.016
4.714GluVal: 4.714 ± 0.022
0.78GluTrp: 0.78 ± 0.008
1.619GluTyr: 1.619 ± 0.012
0.0GluXaa: 0.0 ± 0.0
Phe
2.217PheAla: 2.217 ± 0.014
0.878PheCys: 0.878 ± 0.008
2.602PheAsp: 2.602 ± 0.013
2.756PheGlu: 2.756 ± 0.014
1.924PhePhe: 1.924 ± 0.014
2.89PheGly: 2.89 ± 0.019
1.222PheHis: 1.222 ± 0.01
2.419PheIle: 2.419 ± 0.014
2.409PheLys: 2.409 ± 0.014
4.439PheLeu: 4.439 ± 0.022
0.987PheMet: 0.987 ± 0.009
1.832PheAsn: 1.832 ± 0.014
1.969PhePro: 1.969 ± 0.013
1.663PheGln: 1.663 ± 0.013
2.25PheArg: 2.25 ± 0.017
3.826PheSer: 3.826 ± 0.018
2.125PheThr: 2.125 ± 0.012
3.165PheVal: 3.165 ± 0.019
0.603PheTrp: 0.603 ± 0.006
1.19PheTyr: 1.19 ± 0.009
0.0PheXaa: 0.0 ± 0.0
Gly
3.361GlyAla: 3.361 ± 0.021
1.351GlyCys: 1.351 ± 0.013
3.057GlyAsp: 3.057 ± 0.016
3.351GlyGlu: 3.351 ± 0.016
2.799GlyPhe: 2.799 ± 0.016
4.284GlyGly: 4.284 ± 0.036
1.69GlyHis: 1.69 ± 0.012
3.358GlyIle: 3.358 ± 0.016
4.172GlyLys: 4.172 ± 0.02
5.511GlyLeu: 5.511 ± 0.021
1.462GlyMet: 1.462 ± 0.01
2.612GlyAsn: 2.612 ± 0.014
2.15GlyPro: 2.15 ± 0.015
1.878GlyGln: 1.878 ± 0.011
3.686GlyArg: 3.686 ± 0.018
5.475GlySer: 5.475 ± 0.024
3.165GlyThr: 3.165 ± 0.015
3.855GlyVal: 3.855 ± 0.018
0.86GlyTrp: 0.86 ± 0.008
2.225GlyTyr: 2.225 ± 0.014
0.001GlyXaa: 0.001 ± 0.0
His
1.308HisAla: 1.308 ± 0.009
0.469HisCys: 0.469 ± 0.006
1.174HisAsp: 1.174 ± 0.009
1.36HisGlu: 1.36 ± 0.009
1.204HisPhe: 1.204 ± 0.01
1.497HisGly: 1.497 ± 0.011
0.865HisHis: 0.865 ± 0.011
1.28HisIle: 1.28 ± 0.01
1.223HisLys: 1.223 ± 0.009
2.631HisLeu: 2.631 ± 0.013
0.594HisMet: 0.594 ± 0.006
0.962HisAsn: 0.962 ± 0.008
1.462HisPro: 1.462 ± 0.01
1.133HisGln: 1.133 ± 0.01
1.394HisArg: 1.394 ± 0.012
1.93HisSer: 1.93 ± 0.013
1.027HisThr: 1.027 ± 0.009
1.817HisVal: 1.817 ± 0.012
0.364HisTrp: 0.364 ± 0.005
0.82HisTyr: 0.82 ± 0.008
0.0HisXaa: 0.0 ± 0.0
Ile
3.546IleAla: 3.546 ± 0.017
1.25IleCys: 1.25 ± 0.012
3.388IleAsp: 3.388 ± 0.016
3.776IleGlu: 3.776 ± 0.018
2.249IlePhe: 2.249 ± 0.014
3.181IleGly: 3.181 ± 0.018
1.341IleHis: 1.341 ± 0.01
2.667IleIle: 2.667 ± 0.019
3.096IleLys: 3.096 ± 0.016
5.364IleLeu: 5.364 ± 0.021
1.101IleMet: 1.101 ± 0.01
2.204IleAsn: 2.204 ± 0.013
2.925IlePro: 2.925 ± 0.019
1.981IleGln: 1.981 ± 0.012
2.88IleArg: 2.88 ± 0.013
4.771IleSer: 4.771 ± 0.021
2.68IleThr: 2.68 ± 0.015
3.7IleVal: 3.7 ± 0.016
0.742IleTrp: 0.742 ± 0.007
1.482IleTyr: 1.482 ± 0.011
0.0IleXaa: 0.0 ± 0.0
Lys
4.04LysAla: 4.04 ± 0.019
1.194LysCys: 1.194 ± 0.009
3.454LysAsp: 3.454 ± 0.019
5.135LysGlu: 5.135 ± 0.025
2.52LysPhe: 2.52 ± 0.014
4.211LysGly: 4.211 ± 0.02
1.414LysHis: 1.414 ± 0.01
3.513LysIle: 3.513 ± 0.018
5.266LysLys: 5.266 ± 0.03
5.828LysLeu: 5.828 ± 0.026
1.611LysMet: 1.611 ± 0.011
2.937LysAsn: 2.937 ± 0.017
2.589LysPro: 2.589 ± 0.016
2.38LysGln: 2.38 ± 0.013
3.681LysArg: 3.681 ± 0.019
4.709LysSer: 4.709 ± 0.022
3.231LysThr: 3.231 ± 0.017
4.347LysVal: 4.347 ± 0.023
0.966LysTrp: 0.966 ± 0.009
2.009LysTyr: 2.009 ± 0.013
0.001LysXaa: 0.001 ± 0.0
Leu
5.957LeuAla: 5.957 ± 0.025
1.864LeuCys: 1.864 ± 0.012
4.834LeuAsp: 4.834 ± 0.021
6.551LeuGlu: 6.551 ± 0.026
3.64LeuPhe: 3.64 ± 0.018
5.55LeuGly: 5.55 ± 0.02
2.53LeuHis: 2.53 ± 0.014
4.673LeuIle: 4.673 ± 0.022
6.806LeuLys: 6.806 ± 0.026
8.825LeuLeu: 8.825 ± 0.035
2.293LeuMet: 2.293 ± 0.012
4.037LeuAsn: 4.037 ± 0.018
4.957LeuPro: 4.957 ± 0.02
4.094LeuGln: 4.094 ± 0.021
5.842LeuArg: 5.842 ± 0.025
8.153LeuSer: 8.153 ± 0.028
4.633LeuThr: 4.633 ± 0.02
6.148LeuVal: 6.148 ± 0.023
1.148LeuTrp: 1.148 ± 0.01
2.514LeuTyr: 2.514 ± 0.016
0.001LeuXaa: 0.001 ± 0.0
Met
1.876MetAla: 1.876 ± 0.011
0.361MetCys: 0.361 ± 0.006
1.621MetAsp: 1.621 ± 0.01
2.053MetGlu: 2.053 ± 0.013
0.903MetPhe: 0.903 ± 0.008
1.356MetGly: 1.356 ± 0.011
0.576MetHis: 0.576 ± 0.007
1.247MetIle: 1.247 ± 0.011
1.947MetLys: 1.947 ± 0.012
2.374MetLeu: 2.374 ± 0.013
0.695MetMet: 0.695 ± 0.008
1.161MetAsn: 1.161 ± 0.011
1.043MetPro: 1.043 ± 0.009
0.909MetGln: 0.909 ± 0.008
1.33MetArg: 1.33 ± 0.009
1.823MetSer: 1.823 ± 0.011
1.251MetThr: 1.251 ± 0.01
1.573MetVal: 1.573 ± 0.011
0.266MetTrp: 0.266 ± 0.005
0.64MetTyr: 0.64 ± 0.007
0.0MetXaa: 0.0 ± 0.0
Asn
2.518AsnAla: 2.518 ± 0.014
0.797AsnCys: 0.797 ± 0.008
2.32AsnAsp: 2.32 ± 0.015
2.652AsnGlu: 2.652 ± 0.016
1.891AsnPhe: 1.891 ± 0.014
2.899AsnGly: 2.899 ± 0.019
1.165AsnHis: 1.165 ± 0.009
2.335AsnIle: 2.335 ± 0.015
2.532AsnLys: 2.532 ± 0.014
4.3AsnLeu: 4.3 ± 0.022
1.139AsnMet: 1.139 ± 0.009
2.243AsnAsn: 2.243 ± 0.019
2.224AsnPro: 2.224 ± 0.015
1.742AsnGln: 1.742 ± 0.012
2.222AsnArg: 2.222 ± 0.013
3.647AsnSer: 3.647 ± 0.021
1.85AsnThr: 1.85 ± 0.01
2.953AsnVal: 2.953 ± 0.015
0.592AsnTrp: 0.592 ± 0.007
1.249AsnTyr: 1.249 ± 0.01
0.0AsnXaa: 0.0 ± 0.0
Pro
2.626ProAla: 2.626 ± 0.017
0.675ProCys: 0.675 ± 0.007
2.276ProAsp: 2.276 ± 0.014
2.872ProGlu: 2.872 ± 0.017
2.141ProPhe: 2.141 ± 0.013
2.246ProGly: 2.246 ± 0.016
1.107ProHis: 1.107 ± 0.009
2.685ProIle: 2.685 ± 0.016
2.896ProLys: 2.896 ± 0.015
4.35ProLeu: 4.35 ± 0.018
1.061ProMet: 1.061 ± 0.01
2.129ProAsn: 2.129 ± 0.014
3.632ProPro: 3.632 ± 0.033
1.723ProGln: 1.723 ± 0.013
2.519ProArg: 2.519 ± 0.016
5.12ProSer: 5.12 ± 0.025
3.005ProThr: 3.005 ± 0.015
3.142ProVal: 3.142 ± 0.017
0.596ProTrp: 0.596 ± 0.007
1.418ProTyr: 1.418 ± 0.011
0.001ProXaa: 0.001 ± 0.0
Gln
2.359GlnAla: 2.359 ± 0.013
0.568GlnCys: 0.568 ± 0.007
1.548GlnAsp: 1.548 ± 0.01
2.374GlnGlu: 2.374 ± 0.015
1.438GlnPhe: 1.438 ± 0.011
2.087GlnGly: 2.087 ± 0.015
0.806GlnHis: 0.806 ± 0.009
1.975GlnIle: 1.975 ± 0.011
2.58GlnLys: 2.58 ± 0.016
3.756GlnLeu: 3.756 ± 0.021
0.996GlnMet: 0.996 ± 0.009
1.74GlnAsn: 1.74 ± 0.013
2.044GlnPro: 2.044 ± 0.015
1.938GlnGln: 1.938 ± 0.021
2.325GlnArg: 2.325 ± 0.015
3.01GlnSer: 3.01 ± 0.019
1.94GlnThr: 1.94 ± 0.012
2.241GlnVal: 2.241 ± 0.013
0.456GlnTrp: 0.456 ± 0.006
0.934GlnTyr: 0.934 ± 0.008
0.0GlnXaa: 0.0 ± 0.0
Arg
3.495ArgAla: 3.495 ± 0.018
1.08ArgCys: 1.08 ± 0.009
2.725ArgAsp: 2.725 ± 0.017
3.63ArgGlu: 3.63 ± 0.017
2.381ArgPhe: 2.381 ± 0.013
3.535ArgGly: 3.535 ± 0.021
1.435ArgHis: 1.435 ± 0.011
3.034ArgIle: 3.034 ± 0.013
4.248ArgLys: 4.248 ± 0.021
5.549ArgLeu: 5.549 ± 0.022
1.577ArgMet: 1.577 ± 0.013
2.469ArgAsn: 2.469 ± 0.013
2.614ArgPro: 2.614 ± 0.017
2.092ArgGln: 2.092 ± 0.014
4.305ArgArg: 4.305 ± 0.022
4.629ArgSer: 4.629 ± 0.024
2.722ArgThr: 2.722 ± 0.014
3.283ArgVal: 3.283 ± 0.016
0.849ArgTrp: 0.849 ± 0.007
1.609ArgTyr: 1.609 ± 0.011
0.0ArgXaa: 0.0 ± 0.0
Ser
4.603SerAla: 4.603 ± 0.023
1.475SerCys: 1.475 ± 0.011
4.281SerAsp: 4.281 ± 0.018
4.367SerGlu: 4.367 ± 0.023
4.216SerPhe: 4.216 ± 0.022
5.105SerGly: 5.105 ± 0.025
2.071SerHis: 2.071 ± 0.014
4.641SerIle: 4.641 ± 0.021
5.419SerLys: 5.419 ± 0.022
8.027SerLeu: 8.027 ± 0.031
2.141SerMet: 2.141 ± 0.013
3.755SerAsn: 3.755 ± 0.019
4.128SerPro: 4.128 ± 0.026
3.094SerGln: 3.094 ± 0.018
4.895SerArg: 4.895 ± 0.02
10.237SerSer: 10.237 ± 0.05
5.232SerThr: 5.232 ± 0.022
4.978SerVal: 4.978 ± 0.019
1.149SerTrp: 1.149 ± 0.009
2.346SerTyr: 2.346 ± 0.014
0.001SerXaa: 0.001 ± 0.0
Thr
3.235ThrAla: 3.235 ± 0.019
0.879ThrCys: 0.879 ± 0.008
2.355ThrAsp: 2.355 ± 0.014
2.846ThrGlu: 2.846 ± 0.014
2.37ThrPhe: 2.37 ± 0.013
2.735ThrGly: 2.735 ± 0.015
1.22ThrHis: 1.22 ± 0.009
3.012ThrIle: 3.012 ± 0.017
3.12ThrLys: 3.12 ± 0.016
4.713ThrLeu: 4.713 ± 0.018
1.274ThrMet: 1.274 ± 0.009
2.314ThrAsn: 2.314 ± 0.015
2.741ThrPro: 2.741 ± 0.015
1.596ThrGln: 1.596 ± 0.011
2.986ThrArg: 2.986 ± 0.022
4.632ThrSer: 4.632 ± 0.019
3.388ThrThr: 3.388 ± 0.021
3.368ThrVal: 3.368 ± 0.016
0.79ThrTrp: 0.79 ± 0.007
1.537ThrTyr: 1.537 ± 0.014
0.0ThrXaa: 0.0 ± 0.0
Val
4.487ValAla: 4.487 ± 0.018
1.188ValCys: 1.188 ± 0.008
4.193ValAsp: 4.193 ± 0.017
4.971ValGlu: 4.971 ± 0.023
3.07ValPhe: 3.07 ± 0.016
4.064ValGly: 4.064 ± 0.019
1.446ValHis: 1.446 ± 0.009
3.515ValIle: 3.515 ± 0.018
4.039ValLys: 4.039 ± 0.016
6.221ValLeu: 6.221 ± 0.02
1.493ValMet: 1.493 ± 0.011
2.461ValAsn: 2.461 ± 0.014
3.197ValPro: 3.197 ± 0.017
2.357ValGln: 2.357 ± 0.013
3.468ValArg: 3.468 ± 0.017
5.47ValSer: 5.47 ± 0.022
3.157ValThr: 3.157 ± 0.016
5.503ValVal: 5.503 ± 0.022
0.805ValTrp: 0.805 ± 0.007
1.773ValTyr: 1.773 ± 0.012
0.0ValXaa: 0.0 ± 0.0
Trp
0.749TrpAla: 0.749 ± 0.008
0.187TrpCys: 0.187 ± 0.004
0.682TrpAsp: 0.682 ± 0.007
0.939TrpGlu: 0.939 ± 0.008
0.467TrpPhe: 0.467 ± 0.006
0.804TrpGly: 0.804 ± 0.008
0.298TrpHis: 0.298 ± 0.005
0.749TrpIle: 0.749 ± 0.007
1.219TrpLys: 1.219 ± 0.01
1.293TrpLeu: 1.293 ± 0.01
0.3TrpMet: 0.3 ± 0.004
0.669TrpAsn: 0.669 ± 0.007
0.565TrpPro: 0.565 ± 0.007
0.41TrpGln: 0.41 ± 0.005
1.026TrpArg: 1.026 ± 0.009
0.983TrpSer: 0.983 ± 0.009
0.689TrpThr: 0.689 ± 0.007
0.844TrpVal: 0.844 ± 0.008
0.296TrpTrp: 0.296 ± 0.004
0.346TrpTyr: 0.346 ± 0.005
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.854TyrAla: 1.854 ± 0.013
0.626TyrCys: 0.626 ± 0.007
1.468TyrAsp: 1.468 ± 0.011
1.682TyrGlu: 1.682 ± 0.012
1.3TyrPhe: 1.3 ± 0.012
2.038TyrGly: 2.038 ± 0.012
0.828TyrHis: 0.828 ± 0.007
1.496TyrIle: 1.496 ± 0.013
1.557TyrLys: 1.557 ± 0.012
2.674TyrLeu: 2.674 ± 0.014
0.722TyrMet: 0.722 ± 0.007
1.305TyrAsn: 1.305 ± 0.011
1.259TyrPro: 1.259 ± 0.01
1.15TyrGln: 1.15 ± 0.013
1.668TyrArg: 1.668 ± 0.01
2.222TyrSer: 2.222 ± 0.014
1.323TyrThr: 1.323 ± 0.01
2.003TyrVal: 2.003 ± 0.015
0.424TyrTrp: 0.424 ± 0.005
0.971TyrTyr: 0.971 ± 0.01
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.001XaaGly: 0.001 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.001XaaPro: 0.001 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.025XaaXaa: 0.025 ± 0.007
Statistics based on 37791 proteins (14011924 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski