Amino acid dipepetide frequency for Ofaie virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
0.667AlaAla: 0.667 ± 0.088
0.934AlaCys: 0.934 ± 0.171
3.47AlaAsp: 3.47 ± 0.36
2.803AlaGlu: 2.803 ± 0.272
2.536AlaPhe: 2.536 ± 0.67
1.868AlaGly: 1.868 ± 0.139
0.934AlaHis: 0.934 ± 0.07
3.07AlaIle: 3.07 ± 0.355
3.336AlaLys: 3.336 ± 0.198
4.538AlaLeu: 4.538 ± 0.452
1.601AlaMet: 1.601 ± 0.259
3.737AlaAsn: 3.737 ± 0.278
0.934AlaPro: 0.934 ± 0.07
1.868AlaGln: 1.868 ± 0.379
2.002AlaArg: 2.002 ± 0.263
1.601AlaSer: 1.601 ± 0.222
3.87AlaThr: 3.87 ± 0.116
3.87AlaVal: 3.87 ± 0.364
0.0AlaTrp: 0.0 ± 0.0
2.669AlaTyr: 2.669 ± 0.13
0.0AlaXaa: 0.0 ± 0.0
Cys
1.201CysAla: 1.201 ± 0.254
0.133CysCys: 0.133 ± 0.079
1.068CysAsp: 1.068 ± 0.148
2.002CysGlu: 2.002 ± 0.263
1.201CysPhe: 1.201 ± 0.254
1.335CysGly: 1.335 ± 0.175
0.934CysHis: 0.934 ± 0.07
1.735CysIle: 1.735 ± 0.18
2.002CysLys: 2.002 ± 0.023
2.135CysLeu: 2.135 ± 0.184
0.667CysMet: 0.667 ± 0.088
1.868CysAsn: 1.868 ± 0.62
0.801CysPro: 0.801 ± 0.009
0.0CysGln: 0.0 ± 0.0
0.801CysArg: 0.801 ± 0.009
1.868CysSer: 1.868 ± 0.101
1.468CysThr: 1.468 ± 0.097
2.269CysVal: 2.269 ± 0.346
0.0CysTrp: 0.0 ± 0.0
1.468CysTyr: 1.468 ± 0.097
0.0CysXaa: 0.0 ± 0.0
Asp
3.203AspAla: 3.203 ± 0.445
1.601AspCys: 1.601 ± 0.259
3.47AspAsp: 3.47 ± 0.121
2.936AspGlu: 2.936 ± 0.434
3.203AspPhe: 3.203 ± 0.277
1.601AspGly: 1.601 ± 0.222
1.335AspHis: 1.335 ± 0.065
4.671AspIle: 4.671 ± 0.348
5.071AspLys: 5.071 ± 0.138
7.741AspLeu: 7.741 ± 0.488
1.468AspMet: 1.468 ± 0.097
2.803AspAsn: 2.803 ± 0.032
2.536AspPro: 2.536 ± 0.292
1.735AspGln: 1.735 ± 0.301
2.269AspArg: 2.269 ± 0.135
3.07AspSer: 3.07 ± 0.366
4.538AspThr: 4.538 ± 0.693
2.269AspVal: 2.269 ± 0.135
0.4AspTrp: 0.4 ± 0.005
5.605AspTyr: 5.605 ± 0.177
0.0AspXaa: 0.0 ± 0.0
Glu
1.201GluAla: 1.201 ± 0.014
1.201GluCys: 1.201 ± 0.254
2.002GluAsp: 2.002 ± 0.023
1.601GluGlu: 1.601 ± 0.018
2.269GluPhe: 2.269 ± 0.375
1.335GluGly: 1.335 ± 0.416
1.868GluHis: 1.868 ± 0.139
3.47GluIle: 3.47 ± 0.36
1.468GluLys: 1.468 ± 0.337
5.872GluLeu: 5.872 ± 0.334
0.4GluMet: 0.4 ± 0.005
2.936GluAsn: 2.936 ± 0.047
2.536GluPro: 2.536 ± 0.051
2.936GluGln: 2.936 ± 0.194
1.468GluArg: 1.468 ± 0.097
3.07GluSer: 3.07 ± 0.355
3.203GluThr: 3.203 ± 0.277
2.269GluVal: 2.269 ± 0.375
0.267GluTrp: 0.267 ± 0.083
2.002GluTyr: 2.002 ± 0.218
0.0GluXaa: 0.0 ± 0.0
Phe
1.335PheAla: 1.335 ± 0.175
1.201PheCys: 1.201 ± 0.014
4.271PheAsp: 4.271 ± 0.352
2.803PheGlu: 2.803 ± 0.272
1.201PhePhe: 1.201 ± 0.014
2.002PheGly: 2.002 ± 0.218
0.4PheHis: 0.4 ± 0.005
2.669PheIle: 2.669 ± 0.11
1.735PheLys: 1.735 ± 0.782
2.936PheLeu: 2.936 ± 0.528
1.735PheMet: 1.735 ± 0.06
2.536PheAsn: 2.536 ± 0.189
0.934PhePro: 0.934 ± 0.07
1.335PheGln: 1.335 ± 0.416
0.801PheArg: 0.801 ± 0.009
3.203PheSer: 3.203 ± 0.036
3.47PheThr: 3.47 ± 0.36
2.936PheVal: 2.936 ± 0.674
0.267PheTrp: 0.267 ± 0.083
2.803PheTyr: 2.803 ± 0.753
0.0PheXaa: 0.0 ± 0.0
Gly
1.868GlyAla: 1.868 ± 0.342
0.801GlyCys: 0.801 ± 0.009
1.868GlyAsp: 1.868 ± 0.139
0.801GlyGlu: 0.801 ± 0.231
1.601GlyPhe: 1.601 ± 0.259
0.934GlyGly: 0.934 ± 0.31
1.068GlyHis: 1.068 ± 0.333
1.068GlyIle: 1.068 ± 0.148
0.934GlyLys: 0.934 ± 0.31
4.538GlyLeu: 4.538 ± 0.029
0.934GlyMet: 0.934 ± 0.07
2.135GlyAsn: 2.135 ± 0.184
1.335GlyPro: 1.335 ± 0.175
0.667GlyGln: 0.667 ± 0.088
2.135GlyArg: 2.135 ± 0.296
1.868GlySer: 1.868 ± 0.101
3.603GlyThr: 3.603 ± 0.2
1.335GlyVal: 1.335 ± 0.305
0.267GlyTrp: 0.267 ± 0.157
2.402GlyTyr: 2.402 ± 0.508
0.0GlyXaa: 0.0 ± 0.0
His
1.468HisAla: 1.468 ± 0.337
0.801HisCys: 0.801 ± 0.009
1.601HisAsp: 1.601 ± 0.463
0.534HisGlu: 0.534 ± 0.074
2.135HisPhe: 2.135 ± 0.184
1.468HisGly: 1.468 ± 0.097
0.934HisHis: 0.934 ± 0.07
2.669HisIle: 2.669 ± 0.37
2.536HisLys: 2.536 ± 0.532
2.536HisLeu: 2.536 ± 0.532
2.269HisMet: 2.269 ± 0.106
1.868HisAsn: 1.868 ± 0.101
2.536HisPro: 2.536 ± 0.189
1.468HisGln: 1.468 ± 0.144
0.667HisArg: 0.667 ± 0.153
1.868HisSer: 1.868 ± 0.379
2.803HisThr: 2.803 ± 0.209
1.335HisVal: 1.335 ± 0.305
0.133HisTrp: 0.133 ± 0.079
1.601HisTyr: 1.601 ± 0.018
0.0HisXaa: 0.0 ± 0.0
Ile
3.87IleAla: 3.87 ± 0.364
1.201IleCys: 1.201 ± 0.014
4.804IleAsp: 4.804 ± 0.667
2.402IleGlu: 2.402 ± 0.454
2.269IlePhe: 2.269 ± 0.346
2.669IleGly: 2.669 ± 0.11
1.735IleHis: 1.735 ± 0.06
4.004IleIle: 4.004 ± 0.195
5.605IleLys: 5.605 ± 0.177
6.94IleLeu: 6.94 ± 0.001
1.468IleMet: 1.468 ± 0.097
6.006IleAsn: 6.006 ± 0.172
4.404IlePro: 4.404 ± 0.05
3.203IleGln: 3.203 ± 0.036
4.671IleArg: 4.671 ± 0.374
3.603IleSer: 3.603 ± 0.2
5.739IleThr: 5.739 ± 0.255
4.004IleVal: 4.004 ± 0.045
1.068IleTrp: 1.068 ± 0.092
4.137IleTyr: 4.137 ± 0.754
0.0IleXaa: 0.0 ± 0.0
Lys
2.135LysAla: 2.135 ± 0.056
0.534LysCys: 0.534 ± 0.166
3.87LysAsp: 3.87 ± 0.116
3.47LysGlu: 3.47 ± 0.36
1.868LysPhe: 1.868 ± 0.139
0.534LysGly: 0.534 ± 0.074
4.271LysHis: 4.271 ± 0.112
5.872LysIle: 5.872 ± 0.147
1.601LysLys: 1.601 ± 0.259
6.806LysLeu: 6.806 ± 0.163
0.4LysMet: 0.4 ± 0.054
3.07LysAsn: 3.07 ± 0.115
2.803LysPro: 2.803 ± 0.449
2.135LysGln: 2.135 ± 0.056
1.868LysArg: 1.868 ± 0.379
4.938LysSer: 4.938 ± 0.216
4.538LysThr: 4.538 ± 0.51
3.336LysVal: 3.336 ± 0.042
0.133LysTrp: 0.133 ± 0.079
3.603LysTyr: 3.603 ± 0.44
0.0LysXaa: 0.0 ± 0.0
Leu
4.538LeuAla: 4.538 ± 0.029
3.07LeuCys: 3.07 ± 0.115
6.406LeuAsp: 6.406 ± 0.168
3.603LeuGlu: 3.603 ± 0.281
4.404LeuPhe: 4.404 ± 0.29
2.002LeuGly: 2.002 ± 0.218
3.07LeuHis: 3.07 ± 0.125
7.073LeuIle: 7.073 ± 1.042
6.139LeuLys: 6.139 ± 0.47
10.677LeuLeu: 10.677 ± 0.039
1.735LeuMet: 1.735 ± 0.42
8.141LeuAsn: 8.141 ± 0.228
3.203LeuPro: 3.203 ± 0.277
4.804LeuGln: 4.804 ± 0.295
4.538LeuArg: 4.538 ± 0.269
8.808LeuSer: 8.808 ± 0.621
7.474LeuThr: 7.474 ± 0.076
6.806LeuVal: 6.806 ± 0.404
0.801LeuTrp: 0.801 ± 0.231
7.741LeuTyr: 7.741 ± 0.008
0.0LeuXaa: 0.0 ± 0.0
Met
2.402MetAla: 2.402 ± 0.268
1.068MetCys: 1.068 ± 0.092
0.801MetAsp: 0.801 ± 0.009
0.534MetGlu: 0.534 ± 0.166
0.133MetPhe: 0.133 ± 0.079
0.4MetGly: 0.4 ± 0.005
1.201MetHis: 1.201 ± 0.014
1.201MetIle: 1.201 ± 0.254
1.068MetLys: 1.068 ± 0.092
3.47MetLeu: 3.47 ± 0.121
0.534MetMet: 0.534 ± 0.074
0.534MetAsn: 0.534 ± 0.074
1.201MetPro: 1.201 ± 0.254
2.269MetGln: 2.269 ± 0.346
0.801MetArg: 0.801 ± 0.231
1.068MetSer: 1.068 ± 0.092
1.201MetThr: 1.201 ± 0.254
1.601MetVal: 1.601 ± 0.259
0.133MetTrp: 0.133 ± 0.079
0.934MetTyr: 0.934 ± 0.07
0.0MetXaa: 0.0 ± 0.0
Asn
2.803AsnAla: 2.803 ± 0.032
2.669AsnCys: 2.669 ± 0.11
2.669AsnAsp: 2.669 ± 0.591
4.671AsnGlu: 4.671 ± 0.133
3.737AsnPhe: 3.737 ± 0.203
2.669AsnGly: 2.669 ± 0.11
2.402AsnHis: 2.402 ± 0.027
6.806AsnIle: 6.806 ± 0.404
3.47AsnLys: 3.47 ± 0.361
7.207AsnLeu: 7.207 ± 0.322
2.002AsnMet: 2.002 ± 0.218
4.004AsnAsn: 4.004 ± 0.045
4.404AsnPro: 4.404 ± 0.531
2.402AsnGln: 2.402 ± 0.027
1.735AsnArg: 1.735 ± 0.06
2.402AsnSer: 2.402 ± 0.213
5.071AsnThr: 5.071 ± 0.343
4.404AsnVal: 4.404 ± 0.05
0.267AsnTrp: 0.267 ± 0.157
3.336AsnTyr: 3.336 ± 0.198
0.0AsnXaa: 0.0 ± 0.0
Pro
1.868ProAla: 1.868 ± 0.342
0.534ProCys: 0.534 ± 0.166
2.936ProAsp: 2.936 ± 0.047
0.934ProGlu: 0.934 ± 0.31
0.801ProPhe: 0.801 ± 0.009
0.667ProGly: 0.667 ± 0.153
1.735ProHis: 1.735 ± 0.06
3.603ProIle: 3.603 ± 0.2
2.536ProLys: 2.536 ± 0.292
6.673ProLeu: 6.673 ± 0.396
0.801ProMet: 0.801 ± 0.231
3.336ProAsn: 3.336 ± 0.042
1.335ProPro: 1.335 ± 0.546
1.868ProGln: 1.868 ± 0.379
1.868ProArg: 1.868 ± 0.139
3.603ProSer: 3.603 ± 0.281
3.603ProThr: 3.603 ± 0.2
2.936ProVal: 2.936 ± 0.047
0.534ProTrp: 0.534 ± 0.166
2.002ProTyr: 2.002 ± 0.023
0.0ProXaa: 0.0 ± 0.0
Gln
2.002GlnAla: 2.002 ± 0.263
1.468GlnCys: 1.468 ± 0.337
3.203GlnAsp: 3.203 ± 0.277
2.002GlnGlu: 2.002 ± 0.218
0.934GlnPhe: 0.934 ± 0.31
0.801GlnGly: 0.801 ± 0.472
1.601GlnHis: 1.601 ± 0.222
2.669GlnIle: 2.669 ± 0.351
1.735GlnLys: 1.735 ± 0.301
4.538GlnLeu: 4.538 ± 0.029
0.0GlnMet: 0.0 ± 0.0
1.868GlnAsn: 1.868 ± 0.101
1.601GlnPro: 1.601 ± 0.222
2.135GlnGln: 2.135 ± 0.296
1.468GlnArg: 1.468 ± 0.097
1.868GlnSer: 1.868 ± 0.101
2.936GlnThr: 2.936 ± 0.047
2.135GlnVal: 2.135 ± 0.184
0.267GlnTrp: 0.267 ± 0.157
2.536GlnTyr: 2.536 ± 0.292
0.0GlnXaa: 0.0 ± 0.0
Arg
1.601ArgAla: 1.601 ± 0.018
1.201ArgCys: 1.201 ± 0.227
2.803ArgAsp: 2.803 ± 0.032
1.335ArgGlu: 1.335 ± 0.065
1.601ArgPhe: 1.601 ± 0.222
0.667ArgGly: 0.667 ± 0.088
1.068ArgHis: 1.068 ± 0.148
2.269ArgIle: 2.269 ± 0.375
3.07ArgLys: 3.07 ± 0.125
3.603ArgLeu: 3.603 ± 0.44
0.133ArgMet: 0.133 ± 0.091
4.938ArgAsn: 4.938 ± 0.216
1.335ArgPro: 1.335 ± 0.305
1.468ArgGln: 1.468 ± 0.097
2.803ArgArg: 2.803 ± 0.032
1.735ArgSer: 1.735 ± 0.301
2.936ArgThr: 2.936 ± 0.434
3.07ArgVal: 3.07 ± 0.115
0.133ArgTrp: 0.133 ± 0.079
2.936ArgTyr: 2.936 ± 0.434
0.0ArgXaa: 0.0 ± 0.0
Ser
3.603SerAla: 3.603 ± 0.041
1.335SerCys: 1.335 ± 0.175
4.004SerAsp: 4.004 ± 0.045
3.07SerGlu: 3.07 ± 0.115
3.07SerPhe: 3.07 ± 0.355
2.269SerGly: 2.269 ± 0.375
1.868SerHis: 1.868 ± 0.86
5.071SerIle: 5.071 ± 0.138
2.803SerLys: 2.803 ± 0.209
5.205SerLeu: 5.205 ± 0.422
1.601SerMet: 1.601 ± 0.018
3.603SerAsn: 3.603 ± 0.281
1.868SerPro: 1.868 ± 0.62
2.002SerGln: 2.002 ± 0.218
2.002SerArg: 2.002 ± 0.023
1.201SerSer: 1.201 ± 0.467
4.671SerThr: 4.671 ± 0.374
3.47SerVal: 3.47 ± 0.121
0.133SerTrp: 0.133 ± 0.079
3.603SerTyr: 3.603 ± 0.522
0.0SerXaa: 0.0 ± 0.0
Thr
4.938ThrAla: 4.938 ± 0.265
2.536ThrCys: 2.536 ± 0.051
3.203ThrAsp: 3.203 ± 0.036
2.402ThrGlu: 2.402 ± 0.027
2.402ThrPhe: 2.402 ± 0.027
3.87ThrGly: 3.87 ± 0.605
2.002ThrHis: 2.002 ± 0.458
4.538ThrIle: 4.538 ± 0.029
5.338ThrLys: 5.338 ± 0.702
8.408ThrLeu: 8.408 ± 0.145
0.667ThrMet: 0.667 ± 0.088
5.872ThrAsn: 5.872 ± 0.147
5.338ThrPro: 5.338 ± 0.221
2.002ThrGln: 2.002 ± 0.458
3.203ThrArg: 3.203 ± 0.036
4.137ThrSer: 4.137 ± 0.207
3.87ThrThr: 3.87 ± 0.124
5.205ThrVal: 5.205 ± 0.299
0.534ThrTrp: 0.534 ± 0.074
6.139ThrTyr: 6.139 ± 0.23
0.0ThrXaa: 0.0 ± 0.0
Val
2.803ValAla: 2.803 ± 0.272
1.468ValCys: 1.468 ± 0.144
4.004ValAsp: 4.004 ± 0.526
2.936ValGlu: 2.936 ± 0.047
2.402ValPhe: 2.402 ± 0.454
2.002ValGly: 2.002 ± 0.023
2.402ValHis: 2.402 ± 0.268
6.273ValIle: 6.273 ± 0.089
3.203ValLys: 3.203 ± 0.204
4.538ValLeu: 4.538 ± 0.51
1.868ValMet: 1.868 ± 0.342
4.804ValAsn: 4.804 ± 0.535
3.336ValPro: 3.336 ± 0.283
1.601ValGln: 1.601 ± 0.018
2.669ValArg: 2.669 ± 0.11
3.07ValSer: 3.07 ± 0.115
5.605ValThr: 5.605 ± 0.304
4.804ValVal: 4.804 ± 0.776
0.0ValTrp: 0.0 ± 0.0
3.87ValTyr: 3.87 ± 0.364
0.0ValXaa: 0.0 ± 0.0
Trp
0.534TrpAla: 0.534 ± 0.074
0.0TrpCys: 0.0 ± 0.0
0.801TrpAsp: 0.801 ± 0.231
0.0TrpGlu: 0.0 ± 0.0
0.4TrpPhe: 0.4 ± 0.005
0.0TrpGly: 0.0 ± 0.0
0.133TrpHis: 0.133 ± 0.079
0.267TrpIle: 0.267 ± 0.157
0.133TrpLys: 0.133 ± 0.079
0.801TrpLeu: 0.801 ± 0.009
0.0TrpMet: 0.0 ± 0.0
0.267TrpAsn: 0.267 ± 0.157
0.267TrpPro: 0.267 ± 0.083
0.0TrpGln: 0.0 ± 0.0
0.0TrpArg: 0.0 ± 0.0
0.133TrpSer: 0.133 ± 0.079
0.667TrpThr: 0.667 ± 0.088
0.4TrpVal: 0.4 ± 0.005
0.0TrpTrp: 0.0 ± 0.0
0.667TrpTyr: 0.667 ± 0.153
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.269TyrAla: 2.269 ± 0.106
1.201TyrCys: 1.201 ± 0.227
4.271TyrAsp: 4.271 ± 0.369
2.536TyrGlu: 2.536 ± 0.051
2.402TyrPhe: 2.402 ± 0.213
3.07TyrGly: 3.07 ± 0.355
2.269TyrHis: 2.269 ± 0.375
4.804TyrIle: 4.804 ± 0.054
4.137TyrLys: 4.137 ± 0.274
5.872TyrLeu: 5.872 ± 0.147
2.135TyrMet: 2.135 ± 0.425
4.804TyrAsn: 4.804 ± 0.295
1.468TyrPro: 1.468 ± 0.144
1.868TyrGln: 1.868 ± 0.139
2.936TyrArg: 2.936 ± 0.047
3.336TyrSer: 3.336 ± 0.042
5.338TyrThr: 5.338 ± 0.221
5.205TyrVal: 5.205 ± 0.54
0.133TyrTrp: 0.133 ± 0.079
4.671TyrTyr: 4.671 ± 0.374
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 2 proteins (7494 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski