Amino acid dipepetide frequency for Orpheovirus IHUMI-LCC2

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.243AlaAla: 1.243 ± 0.073
0.532AlaCys: 0.532 ± 0.046
1.572AlaAsp: 1.572 ± 0.07
1.359AlaGlu: 1.359 ± 0.077
1.255AlaPhe: 1.255 ± 0.069
1.725AlaGly: 1.725 ± 0.141
0.415AlaHis: 0.415 ± 0.036
2.657AlaIle: 2.657 ± 0.118
1.999AlaLys: 1.999 ± 0.084
2.925AlaLeu: 2.925 ± 0.105
0.624AlaMet: 0.624 ± 0.048
1.879AlaAsn: 1.879 ± 0.079
0.867AlaPro: 0.867 ± 0.062
0.83AlaGln: 0.83 ± 0.074
0.966AlaArg: 0.966 ± 0.069
2.076AlaSer: 2.076 ± 0.074
1.366AlaThr: 1.366 ± 0.093
1.578AlaVal: 1.578 ± 0.078
0.335AlaTrp: 0.335 ± 0.046
1.661AlaTyr: 1.661 ± 0.072
0.0AlaXaa: 0.0 ± 0.0
Cys
0.492CysAla: 0.492 ± 0.039
0.351CysCys: 0.351 ± 0.041
1.055CysAsp: 1.055 ± 0.073
0.929CysGlu: 0.929 ± 0.051
0.541CysPhe: 0.541 ± 0.043
1.064CysGly: 1.064 ± 0.075
0.461CysHis: 0.461 ± 0.039
1.725CysIle: 1.725 ± 0.094
1.898CysLys: 1.898 ± 0.091
1.442CysLeu: 1.442 ± 0.078
0.449CysMet: 0.449 ± 0.041
1.725CysAsn: 1.725 ± 0.095
0.735CysPro: 0.735 ± 0.064
0.618CysGln: 0.618 ± 0.042
0.757CysArg: 0.757 ± 0.054
1.052CysSer: 1.052 ± 0.059
0.873CysThr: 0.873 ± 0.056
0.794CysVal: 0.794 ± 0.054
0.225CysTrp: 0.225 ± 0.027
1.252CysTyr: 1.252 ± 0.085
0.0CysXaa: 0.0 ± 0.0
Asp
1.956AspAla: 1.956 ± 0.08
0.957AspCys: 0.957 ± 0.087
5.167AspAsp: 5.167 ± 0.166
4.084AspGlu: 4.084 ± 0.136
2.073AspPhe: 2.073 ± 0.086
4.124AspGly: 4.124 ± 0.145
0.938AspHis: 0.938 ± 0.052
8.716AspIle: 8.716 ± 0.217
5.275AspLys: 5.275 ± 0.136
4.53AspLeu: 4.53 ± 0.135
1.956AspMet: 1.956 ± 0.081
6.151AspAsn: 6.151 ± 0.158
1.532AspPro: 1.532 ± 0.095
1.0AspGln: 1.0 ± 0.049
2.528AspArg: 2.528 ± 0.078
3.066AspSer: 3.066 ± 0.105
2.879AspThr: 2.879 ± 0.097
4.192AspVal: 4.192 ± 0.109
0.643AspTrp: 0.643 ± 0.047
3.977AspTyr: 3.977 ± 0.13
0.0AspXaa: 0.0 ± 0.0
Glu
1.479GluAla: 1.479 ± 0.081
1.35GluCys: 1.35 ± 0.088
4.838GluAsp: 4.838 ± 0.179
4.835GluGlu: 4.835 ± 0.174
1.935GluPhe: 1.935 ± 0.08
3.149GluGly: 3.149 ± 0.105
1.101GluHis: 1.101 ± 0.061
5.435GluIle: 5.435 ± 0.148
4.121GluLys: 4.121 ± 0.133
5.472GluLeu: 5.472 ± 0.132
1.396GluMet: 1.396 ± 0.068
4.503GluAsn: 4.503 ± 0.111
0.867GluPro: 0.867 ± 0.054
1.363GluGln: 1.363 ± 0.072
2.633GluArg: 2.633 ± 0.096
2.943GluSer: 2.943 ± 0.112
2.144GluThr: 2.144 ± 0.081
3.377GluVal: 3.377 ± 0.112
0.83GluTrp: 0.83 ± 0.06
4.887GluTyr: 4.887 ± 0.138
0.0GluXaa: 0.0 ± 0.0
Phe
1.123PheAla: 1.123 ± 0.062
0.597PheCys: 0.597 ± 0.04
2.353PheAsp: 2.353 ± 0.096
1.685PheGlu: 1.685 ± 0.08
1.212PhePhe: 1.212 ± 0.07
1.95PheGly: 1.95 ± 0.075
0.76PheHis: 0.76 ± 0.043
4.306PheIle: 4.306 ± 0.131
2.457PheLys: 2.457 ± 0.087
3.663PheLeu: 3.663 ± 0.109
1.166PheMet: 1.166 ± 0.062
3.263PheAsn: 3.263 ± 0.116
1.289PhePro: 1.289 ± 0.063
0.824PheGln: 0.824 ± 0.054
1.313PheArg: 1.313 ± 0.06
2.743PheSer: 2.743 ± 0.082
2.153PheThr: 2.153 ± 0.085
2.11PheVal: 2.11 ± 0.083
0.397PheTrp: 0.397 ± 0.034
2.565PheTyr: 2.565 ± 0.094
0.0PheXaa: 0.0 ± 0.0
Gly
1.596GlyAla: 1.596 ± 0.11
1.135GlyCys: 1.135 ± 0.066
3.519GlyAsp: 3.519 ± 0.167
2.62GlyGlu: 2.62 ± 0.113
1.713GlyPhe: 1.713 ± 0.078
3.223GlyGly: 3.223 ± 0.252
0.827GlyHis: 0.827 ± 0.052
4.367GlyIle: 4.367 ± 0.128
4.475GlyLys: 4.475 ± 0.182
4.14GlyLeu: 4.14 ± 0.119
1.172GlyMet: 1.172 ± 0.062
5.016GlyAsn: 5.016 ± 0.209
1.236GlyPro: 1.236 ± 0.124
1.575GlyGln: 1.575 ± 0.102
2.042GlyArg: 2.042 ± 0.089
3.531GlySer: 3.531 ± 0.199
2.703GlyThr: 2.703 ± 0.168
2.793GlyVal: 2.793 ± 0.154
0.489GlyTrp: 0.489 ± 0.039
3.426GlyTyr: 3.426 ± 0.115
0.0GlyXaa: 0.0 ± 0.0
His
0.544HisAla: 0.544 ± 0.035
0.329HisCys: 0.329 ± 0.033
1.0HisAsp: 1.0 ± 0.061
0.867HisGlu: 0.867 ± 0.057
0.612HisPhe: 0.612 ± 0.043
1.144HisGly: 1.144 ± 0.072
0.461HisHis: 0.461 ± 0.046
2.205HisIle: 2.205 ± 0.087
1.516HisLys: 1.516 ± 0.07
1.446HisLeu: 1.446 ± 0.07
0.455HisMet: 0.455 ± 0.037
1.735HisAsn: 1.735 ± 0.081
0.877HisPro: 0.877 ± 0.06
0.412HisGln: 0.412 ± 0.042
0.812HisArg: 0.812 ± 0.046
1.012HisSer: 1.012 ± 0.059
0.806HisThr: 0.806 ± 0.05
0.861HisVal: 0.861 ± 0.057
0.141HisTrp: 0.141 ± 0.017
1.058HisTyr: 1.058 ± 0.068
0.0HisXaa: 0.0 ± 0.0
Ile
2.325IleAla: 2.325 ± 0.098
1.842IleCys: 1.842 ± 0.077
6.01IleAsp: 6.01 ± 0.163
5.536IleGlu: 5.536 ± 0.144
4.155IlePhe: 4.155 ± 0.138
4.149IleGly: 4.149 ± 0.122
1.768IleHis: 1.768 ± 0.076
10.626IleIle: 10.626 ± 0.262
8.313IleLys: 8.313 ± 0.197
10.405IleLeu: 10.405 ± 0.225
2.584IleMet: 2.584 ± 0.092
9.239IleAsn: 9.239 ± 0.203
4.004IlePro: 4.004 ± 0.122
2.58IleGln: 2.58 ± 0.088
3.42IleArg: 3.42 ± 0.102
7.363IleSer: 7.363 ± 0.164
4.672IleThr: 4.672 ± 0.135
4.684IleVal: 4.684 ± 0.109
0.886IleTrp: 0.886 ± 0.054
6.127IleTyr: 6.127 ± 0.154
0.0IleXaa: 0.0 ± 0.0
Lys
1.649LysAla: 1.649 ± 0.075
1.682LysCys: 1.682 ± 0.094
6.21LysAsp: 6.21 ± 0.159
5.927LysGlu: 5.927 ± 0.171
3.571LysPhe: 3.571 ± 0.103
3.011LysGly: 3.011 ± 0.113
1.522LysHis: 1.522 ± 0.074
7.096LysIle: 7.096 ± 0.176
5.269LysLys: 5.269 ± 0.131
7.883LysLeu: 7.883 ± 0.166
1.821LysMet: 1.821 ± 0.07
6.127LysAsn: 6.127 ± 0.167
1.347LysPro: 1.347 ± 0.074
1.455LysGln: 1.455 ± 0.079
2.796LysArg: 2.796 ± 0.092
4.287LysSer: 4.287 ± 0.127
3.146LysThr: 3.146 ± 0.096
4.097LysVal: 4.097 ± 0.134
1.107LysTrp: 1.107 ± 0.079
7.271LysTyr: 7.271 ± 0.222
0.0LysXaa: 0.0 ± 0.0
Leu
2.608LeuAla: 2.608 ± 0.093
2.058LeuCys: 2.058 ± 0.082
6.133LeuAsp: 6.133 ± 0.141
5.281LeuGlu: 5.281 ± 0.147
3.878LeuPhe: 3.878 ± 0.138
4.134LeuGly: 4.134 ± 0.129
2.371LeuHis: 2.371 ± 0.101
6.948LeuIle: 6.948 ± 0.158
5.773LeuLys: 5.773 ± 0.145
10.165LeuLeu: 10.165 ± 0.212
2.156LeuMet: 2.156 ± 0.089
6.567LeuAsn: 6.567 ± 0.151
3.623LeuPro: 3.623 ± 0.119
3.512LeuGln: 3.512 ± 0.122
3.562LeuArg: 3.562 ± 0.105
8.043LeuSer: 8.043 ± 0.179
4.183LeuThr: 4.183 ± 0.153
4.291LeuVal: 4.291 ± 0.136
0.892LeuTrp: 0.892 ± 0.05
7.369LeuTyr: 7.369 ± 0.208
0.0LeuXaa: 0.0 ± 0.0
Met
0.677MetAla: 0.677 ± 0.043
0.369MetCys: 0.369 ± 0.038
2.045MetAsp: 2.045 ± 0.079
2.75MetGlu: 2.75 ± 0.086
0.83MetPhe: 0.83 ± 0.051
1.19MetGly: 1.19 ± 0.07
0.338MetHis: 0.338 ± 0.034
2.018MetIle: 2.018 ± 0.075
2.23MetLys: 2.23 ± 0.078
2.27MetLeu: 2.27 ± 0.079
0.833MetMet: 0.833 ± 0.051
1.919MetAsn: 1.919 ± 0.074
0.437MetPro: 0.437 ± 0.039
0.609MetGln: 0.609 ± 0.041
0.867MetArg: 0.867 ± 0.05
1.808MetSer: 1.808 ± 0.072
1.089MetThr: 1.089 ± 0.051
1.286MetVal: 1.286 ± 0.08
0.234MetTrp: 0.234 ± 0.027
1.409MetTyr: 1.409 ± 0.061
0.0MetXaa: 0.0 ± 0.0
Asn
2.334AsnAla: 2.334 ± 0.097
1.046AsnCys: 1.046 ± 0.089
5.016AsnAsp: 5.016 ± 0.131
3.977AsnGlu: 3.977 ± 0.11
2.968AsnPhe: 2.968 ± 0.11
5.795AsnGly: 5.795 ± 0.254
1.021AsnHis: 1.021 ± 0.059
11.967AsnIle: 11.967 ± 0.253
7.831AsnLys: 7.831 ± 0.204
6.911AsnLeu: 6.911 ± 0.174
2.694AsnMet: 2.694 ± 0.103
8.686AsnAsn: 8.686 ± 0.225
2.5AsnPro: 2.5 ± 0.097
1.344AsnGln: 1.344 ± 0.08
2.946AsnArg: 2.946 ± 0.105
4.706AsnSer: 4.706 ± 0.143
3.958AsnThr: 3.958 ± 0.112
4.881AsnVal: 4.881 ± 0.118
0.667AsnTrp: 0.667 ± 0.048
5.204AsnTyr: 5.204 ± 0.179
0.0AsnXaa: 0.0 ± 0.0
Pro
0.818ProAla: 0.818 ± 0.067
0.467ProCys: 0.467 ± 0.058
1.618ProAsp: 1.618 ± 0.078
1.778ProGlu: 1.778 ± 0.099
1.246ProPhe: 1.246 ± 0.065
1.381ProGly: 1.381 ± 0.13
0.507ProHis: 0.507 ± 0.038
2.977ProIle: 2.977 ± 0.123
2.076ProLys: 2.076 ± 0.081
2.547ProLeu: 2.547 ± 0.082
0.56ProMet: 0.56 ± 0.046
2.827ProAsn: 2.827 ± 0.091
1.329ProPro: 1.329 ± 0.116
1.11ProGln: 1.11 ± 0.143
1.03ProArg: 1.03 ± 0.093
2.605ProSer: 2.605 ± 0.11
1.999ProThr: 1.999 ± 0.099
1.575ProVal: 1.575 ± 0.082
0.258ProTrp: 0.258 ± 0.028
1.947ProTyr: 1.947 ± 0.083
0.0ProXaa: 0.0 ± 0.0
Gln
0.818GlnAla: 0.818 ± 0.057
0.458GlnCys: 0.458 ± 0.04
1.482GlnAsp: 1.482 ± 0.07
1.596GlnGlu: 1.596 ± 0.064
1.052GlnPhe: 1.052 ± 0.063
1.326GlnGly: 1.326 ± 0.134
0.431GlnHis: 0.431 ± 0.034
1.962GlnIle: 1.962 ± 0.09
1.412GlnLys: 1.412 ± 0.083
2.667GlnLeu: 2.667 ± 0.094
0.501GlnMet: 0.501 ± 0.04
1.898GlnAsn: 1.898 ± 0.082
0.987GlnPro: 0.987 ± 0.14
1.246GlnGln: 1.246 ± 0.313
1.061GlnArg: 1.061 ± 0.064
1.95GlnSer: 1.95 ± 0.111
1.23GlnThr: 1.23 ± 0.064
1.089GlnVal: 1.089 ± 0.06
0.311GlnTrp: 0.311 ± 0.035
2.184GlnTyr: 2.184 ± 0.088
0.0GlnXaa: 0.0 ± 0.0
Arg
1.058ArgAla: 1.058 ± 0.072
0.75ArgCys: 0.75 ± 0.052
2.651ArgAsp: 2.651 ± 0.105
2.537ArgGlu: 2.537 ± 0.08
1.381ArgPhe: 1.381 ± 0.064
1.879ArgGly: 1.879 ± 0.099
0.729ArgHis: 0.729 ± 0.044
3.245ArgIle: 3.245 ± 0.101
3.106ArgLys: 3.106 ± 0.09
3.712ArgLeu: 3.712 ± 0.113
1.012ArgMet: 1.012 ± 0.056
2.774ArgAsn: 2.774 ± 0.093
0.895ArgPro: 0.895 ± 0.063
0.91ArgGln: 0.91 ± 0.062
1.673ArgArg: 1.673 ± 0.097
2.359ArgSer: 2.359 ± 0.111
1.692ArgThr: 1.692 ± 0.077
1.633ArgVal: 1.633 ± 0.08
0.495ArgTrp: 0.495 ± 0.041
2.547ArgTyr: 2.547 ± 0.075
0.0ArgXaa: 0.0 ± 0.0
Ser
1.941SerAla: 1.941 ± 0.096
1.252SerCys: 1.252 ± 0.076
3.912SerAsp: 3.912 ± 0.116
3.183SerGlu: 3.183 ± 0.094
2.574SerPhe: 2.574 ± 0.097
3.239SerGly: 3.239 ± 0.143
1.335SerHis: 1.335 ± 0.063
6.428SerIle: 6.428 ± 0.153
5.176SerLys: 5.176 ± 0.14
7.012SerLeu: 7.012 ± 0.177
1.59SerMet: 1.59 ± 0.067
5.576SerAsn: 5.576 ± 0.16
2.048SerPro: 2.048 ± 0.114
1.99SerGln: 1.99 ± 0.106
2.273SerArg: 2.273 ± 0.098
5.278SerSer: 5.278 ± 0.163
3.276SerThr: 3.276 ± 0.111
2.934SerVal: 2.934 ± 0.09
0.698SerTrp: 0.698 ± 0.05
4.626SerTyr: 4.626 ± 0.129
0.0SerXaa: 0.0 ± 0.0
Thr
1.381ThrAla: 1.381 ± 0.094
0.766ThrCys: 0.766 ± 0.046
2.331ThrAsp: 2.331 ± 0.1
2.362ThrGlu: 2.362 ± 0.088
2.208ThrPhe: 2.208 ± 0.085
2.58ThrGly: 2.58 ± 0.195
0.809ThrHis: 0.809 ± 0.046
4.736ThrIle: 4.736 ± 0.139
3.18ThrLys: 3.18 ± 0.096
5.047ThrLeu: 5.047 ± 0.15
0.984ThrMet: 0.984 ± 0.055
3.848ThrAsn: 3.848 ± 0.124
1.833ThrPro: 1.833 ± 0.094
1.246ThrGln: 1.246 ± 0.075
1.516ThrArg: 1.516 ± 0.068
3.611ThrSer: 3.611 ± 0.121
2.547ThrThr: 2.547 ± 0.16
2.193ThrVal: 2.193 ± 0.096
0.526ThrTrp: 0.526 ± 0.039
3.202ThrTyr: 3.202 ± 0.115
0.0ThrXaa: 0.0 ± 0.0
Val
1.676ValAla: 1.676 ± 0.074
1.076ValCys: 1.076 ± 0.069
3.555ValAsp: 3.555 ± 0.099
3.073ValGlu: 3.073 ± 0.116
1.947ValPhe: 1.947 ± 0.075
2.777ValGly: 2.777 ± 0.115
0.926ValHis: 0.926 ± 0.056
4.377ValIle: 4.377 ± 0.133
3.986ValLys: 3.986 ± 0.123
4.573ValLeu: 4.573 ± 0.121
1.178ValMet: 1.178 ± 0.062
4.352ValAsn: 4.352 ± 0.127
1.922ValPro: 1.922 ± 0.109
1.393ValGln: 1.393 ± 0.07
1.978ValArg: 1.978 ± 0.084
3.519ValSer: 3.519 ± 0.106
2.51ValThr: 2.51 ± 0.103
2.897ValVal: 2.897 ± 0.119
0.655ValTrp: 0.655 ± 0.048
3.174ValTyr: 3.174 ± 0.097
0.0ValXaa: 0.0 ± 0.0
Trp
0.274TrpAla: 0.274 ± 0.028
0.237TrpCys: 0.237 ± 0.029
0.683TrpAsp: 0.683 ± 0.044
0.486TrpGlu: 0.486 ± 0.042
0.348TrpPhe: 0.348 ± 0.03
0.384TrpGly: 0.384 ± 0.041
0.237TrpHis: 0.237 ± 0.031
1.036TrpIle: 1.036 ± 0.066
0.935TrpLys: 0.935 ± 0.059
0.935TrpLeu: 0.935 ± 0.071
0.323TrpMet: 0.323 ± 0.036
1.172TrpAsn: 1.172 ± 0.064
0.16TrpPro: 0.16 ± 0.024
0.169TrpGln: 0.169 ± 0.02
0.351TrpArg: 0.351 ± 0.032
0.474TrpSer: 0.474 ± 0.041
0.538TrpThr: 0.538 ± 0.039
0.566TrpVal: 0.566 ± 0.061
0.135TrpTrp: 0.135 ± 0.027
1.098TrpTyr: 1.098 ± 0.086
0.0TrpXaa: 0.0 ± 0.0
Tyr
1.873TyrAla: 1.873 ± 0.074
1.184TyrCys: 1.184 ± 0.063
4.595TyrAsp: 4.595 ± 0.12
3.792TyrGlu: 3.792 ± 0.145
2.43TyrPhe: 2.43 ± 0.094
3.374TyrGly: 3.374 ± 0.102
1.335TyrHis: 1.335 ± 0.067
7.861TyrIle: 7.861 ± 0.227
6.244TyrLys: 6.244 ± 0.195
5.801TyrLeu: 5.801 ± 0.142
1.75TyrMet: 1.75 ± 0.084
7.197TyrAsn: 7.197 ± 0.189
2.168TyrPro: 2.168 ± 0.092
1.473TyrGln: 1.473 ± 0.08
2.516TyrArg: 2.516 ± 0.09
3.915TyrSer: 3.915 ± 0.122
3.143TyrThr: 3.143 ± 0.116
3.715TyrVal: 3.715 ± 0.114
0.615TyrTrp: 0.615 ± 0.049
4.629TyrTyr: 4.629 ± 0.153
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 1199 proteins (325136 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski