Amino acid dipepetide frequency for Arabis alpina (Alpine rock-cress)

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
5.936AlaAla: 5.936 ± 0.037
1.099AlaCys: 1.099 ± 0.012
3.16AlaAsp: 3.16 ± 0.018
4.447AlaGlu: 4.447 ± 0.025
2.625AlaPhe: 2.625 ± 0.021
3.971AlaGly: 3.971 ± 0.026
1.178AlaHis: 1.178 ± 0.012
3.533AlaIle: 3.533 ± 0.021
4.229AlaLys: 4.229 ± 0.027
6.268AlaLeu: 6.268 ± 0.033
1.768AlaMet: 1.768 ± 0.016
2.519AlaAsn: 2.519 ± 0.018
2.814AlaPro: 2.814 ± 0.022
1.939AlaGln: 1.939 ± 0.017
3.553AlaArg: 3.553 ± 0.024
5.916AlaSer: 5.916 ± 0.031
3.712AlaThr: 3.712 ± 0.021
4.619AlaVal: 4.619 ± 0.026
0.716AlaTrp: 0.716 ± 0.009
1.716AlaTyr: 1.716 ± 0.013
0.004AlaXaa: 0.004 ± 0.001
Cys
0.896CysAla: 0.896 ± 0.01
0.472CysCys: 0.472 ± 0.009
0.885CysAsp: 0.885 ± 0.011
0.862CysGlu: 0.862 ± 0.01
0.891CysPhe: 0.891 ± 0.01
1.35CysGly: 1.35 ± 0.014
0.405CysHis: 0.405 ± 0.006
0.893CysIle: 0.893 ± 0.01
1.061CysLys: 1.061 ± 0.014
1.806CysLeu: 1.806 ± 0.015
0.375CysMet: 0.375 ± 0.006
0.753CysAsn: 0.753 ± 0.01
0.825CysPro: 0.825 ± 0.011
0.495CysGln: 0.495 ± 0.008
0.977CysArg: 0.977 ± 0.011
1.706CysSer: 1.706 ± 0.017
0.776CysThr: 0.776 ± 0.01
1.144CysVal: 1.144 ± 0.012
0.21CysTrp: 0.21 ± 0.005
0.529CysTyr: 0.529 ± 0.008
0.001CysXaa: 0.001 ± 0.0
Asp
3.433AspAla: 3.433 ± 0.022
0.928AspCys: 0.928 ± 0.01
4.329AspAsp: 4.329 ± 0.048
4.569AspGlu: 4.569 ± 0.03
2.477AspPhe: 2.477 ± 0.017
3.859AspGly: 3.859 ± 0.027
1.255AspHis: 1.255 ± 0.013
2.842AspIle: 2.842 ± 0.019
2.856AspLys: 2.856 ± 0.02
5.304AspLeu: 5.304 ± 0.03
1.371AspMet: 1.371 ± 0.014
2.01AspAsn: 2.01 ± 0.017
2.848AspPro: 2.848 ± 0.018
1.805AspGln: 1.805 ± 0.014
2.478AspArg: 2.478 ± 0.02
4.426AspSer: 4.426 ± 0.025
2.333AspThr: 2.333 ± 0.018
4.038AspVal: 4.038 ± 0.022
0.681AspTrp: 0.681 ± 0.009
1.593AspTyr: 1.593 ± 0.014
0.002AspXaa: 0.002 ± 0.0
Glu
5.117GluAla: 5.117 ± 0.029
0.874GluCys: 0.874 ± 0.011
4.476GluAsp: 4.476 ± 0.032
7.571GluGlu: 7.571 ± 0.055
2.569GluPhe: 2.569 ± 0.019
3.573GluGly: 3.573 ± 0.022
1.199GluHis: 1.199 ± 0.014
4.129GluIle: 4.129 ± 0.023
5.249GluLys: 5.249 ± 0.036
6.037GluLeu: 6.037 ± 0.033
1.887GluMet: 1.887 ± 0.017
3.056GluAsn: 3.056 ± 0.02
2.31GluPro: 2.31 ± 0.019
2.134GluGln: 2.134 ± 0.019
3.602GluArg: 3.602 ± 0.025
4.869GluSer: 4.869 ± 0.029
3.771GluThr: 3.771 ± 0.024
4.419GluVal: 4.419 ± 0.024
0.727GluTrp: 0.727 ± 0.01
1.663GluTyr: 1.663 ± 0.014
0.002GluXaa: 0.002 ± 0.0
Phe
2.547PheAla: 2.547 ± 0.019
0.83PheCys: 0.83 ± 0.009
2.437PheAsp: 2.437 ± 0.018
2.349PheGlu: 2.349 ± 0.019
1.971PhePhe: 1.971 ± 0.017
3.013PheGly: 3.013 ± 0.022
1.034PheHis: 1.034 ± 0.011
1.953PheIle: 1.953 ± 0.016
2.185PheLys: 2.185 ± 0.017
4.248PheLeu: 4.248 ± 0.026
0.991PheMet: 0.991 ± 0.012
1.629PheAsn: 1.629 ± 0.016
2.096PhePro: 2.096 ± 0.016
1.414PheGln: 1.414 ± 0.012
2.071PheArg: 2.071 ± 0.015
3.981PheSer: 3.981 ± 0.023
2.156PheThr: 2.156 ± 0.017
2.897PheVal: 2.897 ± 0.02
0.539PheTrp: 0.539 ± 0.008
1.232PheTyr: 1.232 ± 0.012
0.001PheXaa: 0.001 ± 0.0
Gly
3.568GlyAla: 3.568 ± 0.023
1.208GlyCys: 1.208 ± 0.014
3.671GlyAsp: 3.671 ± 0.021
4.041GlyGlu: 4.041 ± 0.029
3.207GlyPhe: 3.207 ± 0.021
5.694GlyGly: 5.694 ± 0.062
1.421GlyHis: 1.421 ± 0.015
3.53GlyIle: 3.53 ± 0.023
4.216GlyLys: 4.216 ± 0.027
5.844GlyLeu: 5.844 ± 0.03
1.44GlyMet: 1.44 ± 0.014
3.011GlyAsn: 3.011 ± 0.019
2.293GlyPro: 2.293 ± 0.017
1.913GlyGln: 1.913 ± 0.016
3.518GlyArg: 3.518 ± 0.022
5.877GlySer: 5.877 ± 0.033
3.23GlyThr: 3.23 ± 0.021
4.355GlyVal: 4.355 ± 0.024
0.815GlyTrp: 0.815 ± 0.011
2.115GlyTyr: 2.115 ± 0.019
0.004GlyXaa: 0.004 ± 0.001
His
1.165HisAla: 1.165 ± 0.013
0.452HisCys: 0.452 ± 0.006
1.097HisAsp: 1.097 ± 0.013
1.271HisGlu: 1.271 ± 0.013
0.955HisPhe: 0.955 ± 0.01
1.662HisGly: 1.662 ± 0.015
0.898HisHis: 0.898 ± 0.012
1.132HisIle: 1.132 ± 0.011
1.153HisLys: 1.153 ± 0.013
2.188HisLeu: 2.188 ± 0.016
0.516HisMet: 0.516 ± 0.008
0.914HisAsn: 0.914 ± 0.01
1.188HisPro: 1.188 ± 0.012
0.948HisGln: 0.948 ± 0.011
1.385HisArg: 1.385 ± 0.014
1.684HisSer: 1.684 ± 0.015
0.909HisThr: 0.909 ± 0.012
1.493HisVal: 1.493 ± 0.013
0.286HisTrp: 0.286 ± 0.006
0.655HisTyr: 0.655 ± 0.009
0.001HisXaa: 0.001 ± 0.0
Ile
3.538IleAla: 3.538 ± 0.022
1.011IleCys: 1.011 ± 0.011
3.075IleAsp: 3.075 ± 0.019
3.334IleGlu: 3.334 ± 0.021
2.118IlePhe: 2.118 ± 0.016
3.448IleGly: 3.448 ± 0.022
1.214IleHis: 1.214 ± 0.012
2.548IleIle: 2.548 ± 0.018
2.906IleLys: 2.906 ± 0.02
4.775IleLeu: 4.775 ± 0.027
1.091IleMet: 1.091 ± 0.012
2.121IleAsn: 2.121 ± 0.019
2.931IlePro: 2.931 ± 0.025
1.854IleGln: 1.854 ± 0.016
2.703IleArg: 2.703 ± 0.018
4.828IleSer: 4.828 ± 0.024
2.682IleThr: 2.682 ± 0.016
3.6IleVal: 3.6 ± 0.017
0.652IleTrp: 0.652 ± 0.008
1.407IleTyr: 1.407 ± 0.014
0.001IleXaa: 0.001 ± 0.0
Lys
4.244LysAla: 4.244 ± 0.026
0.969LysCys: 0.969 ± 0.011
3.337LysAsp: 3.337 ± 0.023
4.884LysGlu: 4.884 ± 0.033
2.145LysPhe: 2.145 ± 0.017
3.59LysGly: 3.59 ± 0.025
1.281LysHis: 1.281 ± 0.013
3.341LysIle: 3.341 ± 0.022
5.608LysLys: 5.608 ± 0.042
6.119LysLeu: 6.119 ± 0.026
1.568LysMet: 1.568 ± 0.015
2.689LysAsn: 2.689 ± 0.02
3.022LysPro: 3.022 ± 0.025
2.234LysGln: 2.234 ± 0.02
3.887LysArg: 3.887 ± 0.022
4.886LysSer: 4.886 ± 0.029
3.419LysThr: 3.419 ± 0.022
3.863LysVal: 3.863 ± 0.028
0.807LysTrp: 0.807 ± 0.01
1.533LysTyr: 1.533 ± 0.014
0.002LysXaa: 0.002 ± 0.0
Leu
6.384LeuAla: 6.384 ± 0.03
1.766LeuCys: 1.766 ± 0.017
5.013LeuAsp: 5.013 ± 0.024
6.366LeuGlu: 6.366 ± 0.034
3.698LeuPhe: 3.698 ± 0.025
5.645LeuGly: 5.645 ± 0.029
2.315LeuHis: 2.315 ± 0.017
4.484LeuIle: 4.484 ± 0.029
6.092LeuLys: 6.092 ± 0.03
9.455LeuLeu: 9.455 ± 0.048
2.178LeuMet: 2.178 ± 0.017
3.739LeuAsn: 3.739 ± 0.02
4.984LeuPro: 4.984 ± 0.024
3.744LeuGln: 3.744 ± 0.022
5.43LeuArg: 5.43 ± 0.026
8.378LeuSer: 8.378 ± 0.041
4.7LeuThr: 4.7 ± 0.027
6.512LeuVal: 6.512 ± 0.027
1.101LeuTrp: 1.101 ± 0.013
2.351LeuTyr: 2.351 ± 0.018
0.003LeuXaa: 0.003 ± 0.001
Met
2.02MetAla: 2.02 ± 0.015
0.324MetCys: 0.324 ± 0.005
1.356MetAsp: 1.356 ± 0.012
2.058MetGlu: 2.058 ± 0.016
0.886MetPhe: 0.886 ± 0.011
1.473MetGly: 1.473 ± 0.013
0.456MetHis: 0.456 ± 0.007
1.337MetIle: 1.337 ± 0.014
1.688MetLys: 1.688 ± 0.016
2.009MetLeu: 2.009 ± 0.015
0.737MetMet: 0.737 ± 0.011
1.006MetAsn: 1.006 ± 0.011
0.925MetPro: 0.925 ± 0.011
0.773MetGln: 0.773 ± 0.01
1.283MetArg: 1.283 ± 0.011
1.902MetSer: 1.902 ± 0.017
1.192MetThr: 1.192 ± 0.013
1.81MetVal: 1.81 ± 0.013
0.249MetTrp: 0.249 ± 0.006
0.581MetTyr: 0.581 ± 0.008
0.001MetXaa: 0.001 ± 0.0
Asn
2.484AsnAla: 2.484 ± 0.017
0.728AsnCys: 0.728 ± 0.008
2.083AsnAsp: 2.083 ± 0.015
2.475AsnGlu: 2.475 ± 0.017
1.693AsnPhe: 1.693 ± 0.015
3.201AsnGly: 3.201 ± 0.02
1.108AsnHis: 1.108 ± 0.011
2.272AsnIle: 2.272 ± 0.018
2.408AsnLys: 2.408 ± 0.018
4.442AsnLeu: 4.442 ± 0.031
1.035AsnMet: 1.035 ± 0.011
2.279AsnAsn: 2.279 ± 0.021
2.389AsnPro: 2.389 ± 0.019
1.658AsnGln: 1.658 ± 0.016
2.16AsnArg: 2.16 ± 0.018
3.455AsnSer: 3.455 ± 0.022
2.048AsnThr: 2.048 ± 0.016
2.815AsnVal: 2.815 ± 0.018
0.554AsnTrp: 0.554 ± 0.009
1.175AsnTyr: 1.175 ± 0.012
0.0AsnXaa: 0.0 ± 0.0
Pro
2.889ProAla: 2.889 ± 0.022
0.702ProCys: 0.702 ± 0.009
2.478ProAsp: 2.478 ± 0.019
3.401ProGlu: 3.401 ± 0.023
1.881ProPhe: 1.881 ± 0.016
2.643ProGly: 2.643 ± 0.018
0.97ProHis: 0.97 ± 0.011
2.347ProIle: 2.347 ± 0.017
2.861ProLys: 2.861 ± 0.021
4.148ProLeu: 4.148 ± 0.021
1.073ProMet: 1.073 ± 0.014
2.128ProAsn: 2.128 ± 0.015
4.27ProPro: 4.27 ± 0.063
1.697ProGln: 1.697 ± 0.015
2.643ProArg: 2.643 ± 0.018
5.064ProSer: 5.064 ± 0.031
2.792ProThr: 2.792 ± 0.022
3.314ProVal: 3.314 ± 0.023
0.605ProTrp: 0.605 ± 0.008
1.281ProTyr: 1.281 ± 0.014
0.004ProXaa: 0.004 ± 0.001
Gln
2.259GlnAla: 2.259 ± 0.016
0.523GlnCys: 0.523 ± 0.008
1.654GlnAsp: 1.654 ± 0.013
2.436GlnGlu: 2.436 ± 0.022
1.28GlnPhe: 1.28 ± 0.01
2.051GlnGly: 2.051 ± 0.017
0.777GlnHis: 0.777 ± 0.009
1.864GlnIle: 1.864 ± 0.013
2.117GlnLys: 2.117 ± 0.018
3.083GlnLeu: 3.083 ± 0.02
0.863GlnMet: 0.863 ± 0.01
1.601GlnAsn: 1.601 ± 0.016
1.601GlnPro: 1.601 ± 0.017
1.791GlnGln: 1.791 ± 0.026
2.169GlnArg: 2.169 ± 0.016
2.694GlnSer: 2.694 ± 0.018
1.754GlnThr: 1.754 ± 0.013
2.247GlnVal: 2.247 ± 0.015
0.428GlnTrp: 0.428 ± 0.007
0.853GlnTyr: 0.853 ± 0.01
0.001GlnXaa: 0.001 ± 0.0
Arg
3.346ArgAla: 3.346 ± 0.023
1.005ArgCys: 1.005 ± 0.011
2.874ArgAsp: 2.874 ± 0.02
3.479ArgGlu: 3.479 ± 0.025
2.478ArgPhe: 2.478 ± 0.018
3.42ArgGly: 3.42 ± 0.025
1.227ArgHis: 1.227 ± 0.012
3.019ArgIle: 3.019 ± 0.018
3.859ArgLys: 3.859 ± 0.026
5.127ArgLeu: 5.127 ± 0.024
1.248ArgMet: 1.248 ± 0.011
2.476ArgAsn: 2.476 ± 0.017
2.334ArgPro: 2.334 ± 0.018
1.788ArgGln: 1.788 ± 0.017
4.305ArgArg: 4.305 ± 0.03
4.642ArgSer: 4.642 ± 0.03
2.624ArgThr: 2.624 ± 0.018
3.699ArgVal: 3.699 ± 0.02
0.71ArgTrp: 0.71 ± 0.01
1.471ArgTyr: 1.471 ± 0.012
0.002ArgXaa: 0.002 ± 0.0
Ser
5.102SerAla: 5.102 ± 0.029
1.589SerCys: 1.589 ± 0.016
4.764SerAsp: 4.764 ± 0.022
5.073SerGlu: 5.073 ± 0.028
3.918SerPhe: 3.918 ± 0.023
6.083SerGly: 6.083 ± 0.034
1.914SerHis: 1.914 ± 0.017
4.233SerIle: 4.233 ± 0.025
5.043SerLys: 5.043 ± 0.026
8.795SerLeu: 8.795 ± 0.043
1.982SerMet: 1.982 ± 0.014
3.761SerAsn: 3.761 ± 0.024
4.702SerPro: 4.702 ± 0.034
2.857SerGln: 2.857 ± 0.019
4.739SerArg: 4.739 ± 0.026
11.799SerSer: 11.799 ± 0.066
4.543SerThr: 4.543 ± 0.026
5.608SerVal: 5.608 ± 0.028
1.105SerTrp: 1.105 ± 0.012
2.3SerTyr: 2.3 ± 0.019
0.004SerXaa: 0.004 ± 0.001
Thr
3.445ThrAla: 3.445 ± 0.023
0.907ThrCys: 0.907 ± 0.012
2.552ThrAsp: 2.552 ± 0.016
3.28ThrGlu: 3.28 ± 0.024
2.06ThrPhe: 2.06 ± 0.018
3.477ThrGly: 3.477 ± 0.02
1.032ThrHis: 1.032 ± 0.01
2.781ThrIle: 2.781 ± 0.018
3.107ThrLys: 3.107 ± 0.02
4.673ThrLeu: 4.673 ± 0.026
1.261ThrMet: 1.261 ± 0.012
2.186ThrAsn: 2.186 ± 0.019
2.722ThrPro: 2.722 ± 0.023
1.574ThrGln: 1.574 ± 0.014
2.738ThrArg: 2.738 ± 0.019
4.789ThrSer: 4.789 ± 0.028
3.544ThrThr: 3.544 ± 0.031
3.676ThrVal: 3.676 ± 0.023
0.657ThrTrp: 0.657 ± 0.009
1.348ThrTyr: 1.348 ± 0.016
0.003ThrXaa: 0.003 ± 0.001
Val
4.933ValAla: 4.933 ± 0.026
1.13ValCys: 1.13 ± 0.012
3.972ValAsp: 3.972 ± 0.022
4.885ValGlu: 4.885 ± 0.025
2.903ValPhe: 2.903 ± 0.021
4.067ValGly: 4.067 ± 0.025
1.373ValHis: 1.373 ± 0.013
3.501ValIle: 3.501 ± 0.02
4.34ValLys: 4.34 ± 0.025
6.258ValLeu: 6.258 ± 0.028
1.637ValMet: 1.637 ± 0.014
2.662ValAsn: 2.662 ± 0.019
3.26ValPro: 3.26 ± 0.021
2.146ValGln: 2.146 ± 0.017
3.244ValArg: 3.244 ± 0.02
5.868ValSer: 5.868 ± 0.026
3.706ValThr: 3.706 ± 0.02
5.283ValVal: 5.283 ± 0.026
0.736ValTrp: 0.736 ± 0.01
1.994ValTyr: 1.994 ± 0.016
0.003ValXaa: 0.003 ± 0.001
Trp
0.683TrpAla: 0.683 ± 0.01
0.223TrpCys: 0.223 ± 0.005
0.661TrpAsp: 0.661 ± 0.01
0.802TrpGlu: 0.802 ± 0.009
0.579TrpPhe: 0.579 ± 0.009
0.686TrpGly: 0.686 ± 0.012
0.23TrpHis: 0.23 ± 0.005
0.728TrpIle: 0.728 ± 0.01
0.897TrpLys: 0.897 ± 0.009
1.147TrpLeu: 1.147 ± 0.012
0.315TrpMet: 0.315 ± 0.006
0.65TrpAsn: 0.65 ± 0.008
0.458TrpPro: 0.458 ± 0.007
0.371TrpGln: 0.371 ± 0.007
0.829TrpArg: 0.829 ± 0.01
0.993TrpSer: 0.993 ± 0.011
0.624TrpThr: 0.624 ± 0.008
0.752TrpVal: 0.752 ± 0.01
0.223TrpTrp: 0.223 ± 0.005
0.32TrpTyr: 0.32 ± 0.007
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.729TyrAla: 1.729 ± 0.015
0.573TyrCys: 0.573 ± 0.008
1.558TyrAsp: 1.558 ± 0.014
1.645TyrGlu: 1.645 ± 0.015
1.237TyrPhe: 1.237 ± 0.014
2.061TyrGly: 2.061 ± 0.018
0.666TyrHis: 0.666 ± 0.01
1.367TyrIle: 1.367 ± 0.013
1.553TyrLys: 1.553 ± 0.014
2.612TyrLeu: 2.612 ± 0.018
0.731TyrMet: 0.731 ± 0.01
1.269TyrAsn: 1.269 ± 0.014
1.215TyrPro: 1.215 ± 0.013
0.906TyrGln: 0.906 ± 0.011
1.418TyrArg: 1.418 ± 0.013
2.155TyrSer: 2.155 ± 0.017
1.295TyrThr: 1.295 ± 0.014
1.757TyrVal: 1.757 ± 0.016
0.371TyrTrp: 0.371 ± 0.007
0.944TyrTyr: 0.944 ± 0.012
0.001TyrXaa: 0.001 ± 0.0
Xaa
0.003XaaAla: 0.003 ± 0.001
0.001XaaCys: 0.001 ± 0.0
0.002XaaAsp: 0.002 ± 0.0
0.003XaaGlu: 0.003 ± 0.001
0.002XaaPhe: 0.002 ± 0.0
0.004XaaGly: 0.004 ± 0.001
0.001XaaHis: 0.001 ± 0.0
0.001XaaIle: 0.001 ± 0.0
0.002XaaLys: 0.002 ± 0.0
0.003XaaLeu: 0.003 ± 0.001
0.001XaaMet: 0.001 ± 0.0
0.002XaaAsn: 0.002 ± 0.0
0.004XaaPro: 0.004 ± 0.001
0.001XaaGln: 0.001 ± 0.0
0.002XaaArg: 0.002 ± 0.0
0.003XaaSer: 0.003 ± 0.001
0.002XaaThr: 0.002 ± 0.0
0.004XaaVal: 0.004 ± 0.001
0.001XaaTrp: 0.001 ± 0.0
0.001XaaTyr: 0.001 ± 0.0
0.391XaaXaa: 0.391 ± 0.068
Statistics based on 23245 proteins (9104347 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski