Amino acid dipepetide frequency for Mycobacterium virus Larva

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
20.963AlaAla: 20.963 ± 1.985
1.119AlaCys: 1.119 ± 0.239
8.497AlaAsp: 8.497 ± 0.723
7.53AlaGlu: 7.53 ± 0.79
3.155AlaPhe: 3.155 ± 0.635
12.822AlaGly: 12.822 ± 0.913
2.493AlaHis: 2.493 ± 0.361
4.681AlaIle: 4.681 ± 0.553
4.172AlaLys: 4.172 ± 0.46
10.532AlaLeu: 10.532 ± 0.748
2.849AlaMet: 2.849 ± 0.418
2.798AlaAsn: 2.798 ± 0.454
6.055AlaPro: 6.055 ± 0.619
4.376AlaGln: 4.376 ± 0.449
8.395AlaArg: 8.395 ± 0.816
5.444AlaSer: 5.444 ± 0.615
6.614AlaThr: 6.614 ± 0.683
10.736AlaVal: 10.736 ± 0.934
2.442AlaTrp: 2.442 ± 0.429
2.595AlaTyr: 2.595 ± 0.386
0.0AlaXaa: 0.0 ± 0.0
Cys
1.272CysAla: 1.272 ± 0.267
0.051CysCys: 0.051 ± 0.049
0.661CysAsp: 0.661 ± 0.177
0.458CysGlu: 0.458 ± 0.135
0.153CysPhe: 0.153 ± 0.093
1.272CysGly: 1.272 ± 0.326
0.356CysHis: 0.356 ± 0.171
0.356CysIle: 0.356 ± 0.145
0.458CysLys: 0.458 ± 0.122
0.305CysLeu: 0.305 ± 0.12
0.204CysMet: 0.204 ± 0.117
0.305CysAsn: 0.305 ± 0.132
0.56CysPro: 0.56 ± 0.138
0.509CysGln: 0.509 ± 0.152
0.458CysArg: 0.458 ± 0.145
0.661CysSer: 0.661 ± 0.217
0.763CysThr: 0.763 ± 0.214
0.712CysVal: 0.712 ± 0.207
0.407CysTrp: 0.407 ± 0.138
0.254CysTyr: 0.254 ± 0.115
0.0CysXaa: 0.0 ± 0.0
Asp
8.904AspAla: 8.904 ± 0.796
0.916AspCys: 0.916 ± 0.269
5.902AspAsp: 5.902 ± 0.647
5.19AspGlu: 5.19 ± 0.574
1.119AspPhe: 1.119 ± 0.251
6.106AspGly: 6.106 ± 0.645
1.119AspHis: 1.119 ± 0.217
2.239AspIle: 2.239 ± 0.312
1.679AspLys: 1.679 ± 0.396
6.309AspLeu: 6.309 ± 0.461
1.476AspMet: 1.476 ± 0.325
1.883AspAsn: 1.883 ± 0.285
3.714AspPro: 3.714 ± 0.497
1.628AspGln: 1.628 ± 0.273
5.444AspArg: 5.444 ± 0.663
2.137AspSer: 2.137 ± 0.317
3.969AspThr: 3.969 ± 0.436
5.037AspVal: 5.037 ± 0.391
1.272AspTrp: 1.272 ± 0.206
1.526AspTyr: 1.526 ± 0.289
0.0AspXaa: 0.0 ± 0.0
Glu
6.767GluAla: 6.767 ± 0.796
0.661GluCys: 0.661 ± 0.173
3.104GluAsp: 3.104 ± 0.385
2.697GluGlu: 2.697 ± 0.427
1.577GluPhe: 1.577 ± 0.33
3.562GluGly: 3.562 ± 0.513
2.239GluHis: 2.239 ± 0.382
1.628GluIle: 1.628 ± 0.322
1.577GluLys: 1.577 ± 0.277
5.648GluLeu: 5.648 ± 0.583
1.476GluMet: 1.476 ± 0.264
0.916GluAsn: 0.916 ± 0.188
3.155GluPro: 3.155 ± 0.496
2.442GluGln: 2.442 ± 0.391
5.444GluArg: 5.444 ± 0.801
2.748GluSer: 2.748 ± 0.378
2.442GluThr: 2.442 ± 0.323
4.02GluVal: 4.02 ± 0.492
0.967GluTrp: 0.967 ± 0.173
1.628GluTyr: 1.628 ± 0.226
0.0GluXaa: 0.0 ± 0.0
Phe
2.849PheAla: 2.849 ± 0.418
0.204PheCys: 0.204 ± 0.097
2.849PheAsp: 2.849 ± 0.402
1.628PheGlu: 1.628 ± 0.279
0.56PhePhe: 0.56 ± 0.141
2.9PheGly: 2.9 ± 0.373
0.56PheHis: 0.56 ± 0.18
1.119PheIle: 1.119 ± 0.194
1.018PheLys: 1.018 ± 0.254
1.628PheLeu: 1.628 ± 0.268
0.56PheMet: 0.56 ± 0.156
0.865PheAsn: 0.865 ± 0.238
1.272PhePro: 1.272 ± 0.258
0.56PheGln: 0.56 ± 0.165
1.679PheArg: 1.679 ± 0.264
1.73PheSer: 1.73 ± 0.263
2.086PheThr: 2.086 ± 0.355
1.628PheVal: 1.628 ± 0.29
0.305PheTrp: 0.305 ± 0.126
0.458PheTyr: 0.458 ± 0.143
0.0PheXaa: 0.0 ± 0.0
Gly
9.108GlyAla: 9.108 ± 0.941
1.018GlyCys: 1.018 ± 0.238
6.36GlyAsp: 6.36 ± 0.687
4.885GlyGlu: 4.885 ± 0.506
2.29GlyPhe: 2.29 ± 0.406
8.701GlyGly: 8.701 ± 1.592
1.628GlyHis: 1.628 ± 0.38
3.867GlyIle: 3.867 ± 0.392
3.358GlyLys: 3.358 ± 0.471
5.597GlyLeu: 5.597 ± 0.85
2.391GlyMet: 2.391 ± 0.44
3.358GlyAsn: 3.358 ± 0.348
4.07GlyPro: 4.07 ± 0.505
2.646GlyGln: 2.646 ± 0.403
6.818GlyArg: 6.818 ± 0.642
4.732GlySer: 4.732 ± 0.714
6.157GlyThr: 6.157 ± 0.577
6.106GlyVal: 6.106 ± 0.567
1.832GlyTrp: 1.832 ± 0.307
2.035GlyTyr: 2.035 ± 0.328
0.0GlyXaa: 0.0 ± 0.0
His
2.442HisAla: 2.442 ± 0.419
0.458HisCys: 0.458 ± 0.16
1.526HisAsp: 1.526 ± 0.284
1.272HisGlu: 1.272 ± 0.232
0.509HisPhe: 0.509 ± 0.184
2.29HisGly: 2.29 ± 0.397
0.763HisHis: 0.763 ± 0.205
0.763HisIle: 0.763 ± 0.181
0.763HisLys: 0.763 ± 0.197
2.188HisLeu: 2.188 ± 0.384
0.458HisMet: 0.458 ± 0.15
0.661HisAsn: 0.661 ± 0.175
1.476HisPro: 1.476 ± 0.323
0.712HisGln: 0.712 ± 0.263
1.73HisArg: 1.73 ± 0.288
0.967HisSer: 0.967 ± 0.231
1.119HisThr: 1.119 ± 0.24
1.73HisVal: 1.73 ± 0.311
0.305HisTrp: 0.305 ± 0.171
0.509HisTyr: 0.509 ± 0.169
0.0HisXaa: 0.0 ± 0.0
Ile
5.444IleAla: 5.444 ± 0.569
0.153IleCys: 0.153 ± 0.082
3.918IleAsp: 3.918 ± 0.48
2.9IleGlu: 2.9 ± 0.464
0.712IlePhe: 0.712 ± 0.212
3.867IleGly: 3.867 ± 0.507
0.814IleHis: 0.814 ± 0.218
1.221IleIle: 1.221 ± 0.223
1.526IleLys: 1.526 ± 0.284
2.442IleLeu: 2.442 ± 0.321
0.407IleMet: 0.407 ± 0.149
1.628IleAsn: 1.628 ± 0.313
1.984IlePro: 1.984 ± 0.337
0.865IleGln: 0.865 ± 0.216
3.002IleArg: 3.002 ± 0.391
2.239IleSer: 2.239 ± 0.354
2.544IleThr: 2.544 ± 0.262
3.358IleVal: 3.358 ± 0.447
0.611IleTrp: 0.611 ± 0.241
0.763IleTyr: 0.763 ± 0.218
0.0IleXaa: 0.0 ± 0.0
Lys
4.07LysAla: 4.07 ± 0.655
0.458LysCys: 0.458 ± 0.184
1.018LysAsp: 1.018 ± 0.228
0.763LysGlu: 0.763 ± 0.152
1.17LysPhe: 1.17 ± 0.269
1.933LysGly: 1.933 ± 0.34
1.17LysHis: 1.17 ± 0.315
1.221LysIle: 1.221 ± 0.36
0.865LysLys: 0.865 ± 0.169
4.07LysLeu: 4.07 ± 0.483
0.916LysMet: 0.916 ± 0.207
1.119LysAsn: 1.119 ± 0.278
2.544LysPro: 2.544 ± 0.473
1.476LysGln: 1.476 ± 0.333
2.9LysArg: 2.9 ± 0.45
1.781LysSer: 1.781 ± 0.304
2.391LysThr: 2.391 ± 0.341
2.697LysVal: 2.697 ± 0.313
0.661LysTrp: 0.661 ± 0.175
0.56LysTyr: 0.56 ± 0.146
0.0LysXaa: 0.0 ± 0.0
Leu
11.143LeuAla: 11.143 ± 0.638
1.018LeuCys: 1.018 ± 0.232
5.699LeuAsp: 5.699 ± 0.527
3.155LeuGlu: 3.155 ± 0.366
2.34LeuPhe: 2.34 ± 0.336
6.004LeuGly: 6.004 ± 0.647
1.628LeuHis: 1.628 ± 0.289
2.798LeuIle: 2.798 ± 0.553
2.951LeuLys: 2.951 ± 0.394
5.088LeuLeu: 5.088 ± 0.467
1.272LeuMet: 1.272 ± 0.238
1.984LeuAsn: 1.984 ± 0.32
4.732LeuPro: 4.732 ± 0.609
3.46LeuGln: 3.46 ± 0.434
6.818LeuArg: 6.818 ± 0.769
4.325LeuSer: 4.325 ± 0.627
5.597LeuThr: 5.597 ± 0.524
5.139LeuVal: 5.139 ± 0.419
1.526LeuTrp: 1.526 ± 0.34
1.679LeuTyr: 1.679 ± 0.294
0.0LeuXaa: 0.0 ± 0.0
Met
1.933MetAla: 1.933 ± 0.262
0.204MetCys: 0.204 ± 0.114
0.763MetAsp: 0.763 ± 0.241
0.661MetGlu: 0.661 ± 0.181
0.916MetPhe: 0.916 ± 0.244
1.73MetGly: 1.73 ± 0.269
0.407MetHis: 0.407 ± 0.136
0.763MetIle: 0.763 ± 0.18
0.509MetLys: 0.509 ± 0.175
2.493MetLeu: 2.493 ± 0.41
0.305MetMet: 0.305 ± 0.109
0.916MetAsn: 0.916 ± 0.218
1.272MetPro: 1.272 ± 0.233
0.661MetGln: 0.661 ± 0.194
1.374MetArg: 1.374 ± 0.292
2.035MetSer: 2.035 ± 0.375
2.137MetThr: 2.137 ± 0.314
1.526MetVal: 1.526 ± 0.31
0.458MetTrp: 0.458 ± 0.138
0.407MetTyr: 0.407 ± 0.147
0.0MetXaa: 0.0 ± 0.0
Asn
3.867AsnAla: 3.867 ± 0.618
0.204AsnCys: 0.204 ± 0.09
1.832AsnAsp: 1.832 ± 0.291
1.119AsnGlu: 1.119 ± 0.217
0.458AsnPhe: 0.458 ± 0.205
3.612AsnGly: 3.612 ± 0.39
0.611AsnHis: 0.611 ± 0.17
0.916AsnIle: 0.916 ± 0.247
0.967AsnLys: 0.967 ± 0.229
1.781AsnLeu: 1.781 ± 0.378
0.56AsnMet: 0.56 ± 0.138
0.865AsnAsn: 0.865 ± 0.233
2.798AsnPro: 2.798 ± 0.36
0.916AsnGln: 0.916 ± 0.202
1.577AsnArg: 1.577 ± 0.317
1.17AsnSer: 1.17 ± 0.247
2.035AsnThr: 2.035 ± 0.377
2.29AsnVal: 2.29 ± 0.271
0.56AsnTrp: 0.56 ± 0.129
0.56AsnTyr: 0.56 ± 0.145
0.0AsnXaa: 0.0 ± 0.0
Pro
8.904ProAla: 8.904 ± 0.737
0.254ProCys: 0.254 ± 0.109
3.663ProAsp: 3.663 ± 0.589
3.358ProGlu: 3.358 ± 0.338
1.374ProPhe: 1.374 ± 0.262
4.579ProGly: 4.579 ± 0.441
1.221ProHis: 1.221 ± 0.229
2.646ProIle: 2.646 ± 0.276
2.646ProLys: 2.646 ± 0.498
3.562ProLeu: 3.562 ± 0.471
1.17ProMet: 1.17 ± 0.283
1.425ProAsn: 1.425 ± 0.288
2.951ProPro: 2.951 ± 0.438
2.086ProGln: 2.086 ± 0.391
3.409ProArg: 3.409 ± 0.609
2.595ProSer: 2.595 ± 0.378
3.409ProThr: 3.409 ± 0.409
3.969ProVal: 3.969 ± 0.49
0.865ProTrp: 0.865 ± 0.219
1.17ProTyr: 1.17 ± 0.216
0.0ProXaa: 0.0 ± 0.0
Gln
4.935GlnAla: 4.935 ± 0.583
0.204GlnCys: 0.204 ± 0.108
1.272GlnAsp: 1.272 ± 0.254
1.73GlnGlu: 1.73 ± 0.234
0.916GlnPhe: 0.916 ± 0.165
1.679GlnGly: 1.679 ± 0.317
0.712GlnHis: 0.712 ± 0.229
1.883GlnIle: 1.883 ± 0.278
1.068GlnLys: 1.068 ± 0.232
3.205GlnLeu: 3.205 ± 0.475
1.221GlnMet: 1.221 ± 0.245
0.712GlnAsn: 0.712 ± 0.236
2.239GlnPro: 2.239 ± 0.34
1.323GlnGln: 1.323 ± 0.298
3.409GlnArg: 3.409 ± 0.36
1.781GlnSer: 1.781 ± 0.308
1.832GlnThr: 1.832 ± 0.259
2.646GlnVal: 2.646 ± 0.403
0.967GlnTrp: 0.967 ± 0.201
0.916GlnTyr: 0.916 ± 0.234
0.0GlnXaa: 0.0 ± 0.0
Arg
8.395ArgAla: 8.395 ± 0.85
0.916ArgCys: 0.916 ± 0.226
4.528ArgAsp: 4.528 ± 0.506
4.07ArgGlu: 4.07 ± 0.613
2.29ArgPhe: 2.29 ± 0.304
5.749ArgGly: 5.749 ± 0.656
1.883ArgHis: 1.883 ± 0.357
4.325ArgIle: 4.325 ± 0.373
2.798ArgLys: 2.798 ± 0.367
6.004ArgLeu: 6.004 ± 0.498
1.73ArgMet: 1.73 ± 0.34
2.493ArgAsn: 2.493 ± 0.395
3.969ArgPro: 3.969 ± 0.611
3.155ArgGln: 3.155 ± 0.374
6.92ArgArg: 6.92 ± 0.881
3.358ArgSer: 3.358 ± 0.489
4.477ArgThr: 4.477 ± 0.437
5.699ArgVal: 5.699 ± 0.665
1.679ArgTrp: 1.679 ± 0.293
1.832ArgTyr: 1.832 ± 0.331
0.0ArgXaa: 0.0 ± 0.0
Ser
5.8SerAla: 5.8 ± 0.604
0.56SerCys: 0.56 ± 0.171
3.002SerAsp: 3.002 ± 0.339
3.307SerGlu: 3.307 ± 0.446
1.068SerPhe: 1.068 ± 0.194
4.528SerGly: 4.528 ± 0.723
0.865SerHis: 0.865 ± 0.231
2.188SerIle: 2.188 ± 0.392
1.374SerLys: 1.374 ± 0.225
3.155SerLeu: 3.155 ± 0.415
1.425SerMet: 1.425 ± 0.325
1.374SerAsn: 1.374 ± 0.254
2.493SerPro: 2.493 ± 0.42
1.73SerGln: 1.73 ± 0.305
3.256SerArg: 3.256 ± 0.424
3.053SerSer: 3.053 ± 0.4
4.427SerThr: 4.427 ± 0.44
3.867SerVal: 3.867 ± 0.544
1.679SerTrp: 1.679 ± 0.294
1.119SerTyr: 1.119 ± 0.202
0.0SerXaa: 0.0 ± 0.0
Thr
8.548ThrAla: 8.548 ± 0.91
0.509ThrCys: 0.509 ± 0.197
3.511ThrAsp: 3.511 ± 0.438
3.409ThrGlu: 3.409 ± 0.456
2.086ThrPhe: 2.086 ± 0.334
5.902ThrGly: 5.902 ± 0.555
1.476ThrHis: 1.476 ± 0.297
3.205ThrIle: 3.205 ± 0.351
2.29ThrLys: 2.29 ± 0.341
4.579ThrLeu: 4.579 ± 0.577
1.018ThrMet: 1.018 ± 0.231
1.73ThrAsn: 1.73 ± 0.278
3.46ThrPro: 3.46 ± 0.481
1.577ThrGln: 1.577 ± 0.265
4.274ThrArg: 4.274 ± 0.503
3.002ThrSer: 3.002 ± 0.47
4.477ThrThr: 4.477 ± 0.603
7.123ThrVal: 7.123 ± 0.651
1.323ThrTrp: 1.323 ± 0.275
1.272ThrTyr: 1.272 ± 0.277
0.0ThrXaa: 0.0 ± 0.0
Val
8.446ValAla: 8.446 ± 0.699
0.814ValCys: 0.814 ± 0.189
7.072ValAsp: 7.072 ± 0.745
4.376ValGlu: 4.376 ± 0.487
2.544ValPhe: 2.544 ± 0.387
6.106ValGly: 6.106 ± 0.579
2.188ValHis: 2.188 ± 0.354
3.104ValIle: 3.104 ± 0.341
2.9ValLys: 2.9 ± 0.483
5.495ValLeu: 5.495 ± 0.621
1.425ValMet: 1.425 ± 0.238
2.34ValAsn: 2.34 ± 0.251
4.172ValPro: 4.172 ± 0.554
2.697ValGln: 2.697 ± 0.326
5.749ValArg: 5.749 ± 0.667
4.02ValSer: 4.02 ± 0.433
5.139ValThr: 5.139 ± 0.65
6.564ValVal: 6.564 ± 0.713
1.17ValTrp: 1.17 ± 0.252
1.476ValTyr: 1.476 ± 0.248
0.0ValXaa: 0.0 ± 0.0
Trp
2.188TrpAla: 2.188 ± 0.363
0.204TrpCys: 0.204 ± 0.093
1.17TrpAsp: 1.17 ± 0.311
0.916TrpGlu: 0.916 ± 0.198
0.611TrpPhe: 0.611 ± 0.152
1.221TrpGly: 1.221 ± 0.252
0.356TrpHis: 0.356 ± 0.143
0.661TrpIle: 0.661 ± 0.18
0.407TrpLys: 0.407 ± 0.119
2.544TrpLeu: 2.544 ± 0.356
0.305TrpMet: 0.305 ± 0.122
0.712TrpAsn: 0.712 ± 0.189
1.068TrpPro: 1.068 ± 0.253
0.916TrpGln: 0.916 ± 0.255
1.883TrpArg: 1.883 ± 0.264
1.119TrpSer: 1.119 ± 0.239
1.425TrpThr: 1.425 ± 0.263
1.272TrpVal: 1.272 ± 0.26
0.305TrpTrp: 0.305 ± 0.139
0.458TrpTyr: 0.458 ± 0.13
0.0TrpXaa: 0.0 ± 0.0
Tyr
2.34TyrAla: 2.34 ± 0.307
0.254TyrCys: 0.254 ± 0.103
1.221TyrAsp: 1.221 ± 0.305
1.272TyrGlu: 1.272 ± 0.275
0.611TyrPhe: 0.611 ± 0.162
2.391TyrGly: 2.391 ± 0.354
0.153TyrHis: 0.153 ± 0.089
0.814TyrIle: 0.814 ± 0.235
0.458TyrLys: 0.458 ± 0.131
1.73TyrLeu: 1.73 ± 0.297
0.254TyrMet: 0.254 ± 0.108
0.611TyrAsn: 0.611 ± 0.145
1.272TyrPro: 1.272 ± 0.231
0.916TyrGln: 0.916 ± 0.234
1.73TyrArg: 1.73 ± 0.314
1.374TyrSer: 1.374 ± 0.263
1.73TyrThr: 1.73 ± 0.332
1.679TyrVal: 1.679 ± 0.299
0.407TyrTrp: 0.407 ± 0.133
0.56TyrTyr: 0.56 ± 0.23
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 97 proteins (19655 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski