Amino acid dipepetide frequency for Huangpi Tick Virus 1

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
3.17AlaAla: 3.17 ± 0.81
1.585AlaCys: 1.585 ± 0.682
1.585AlaAsp: 1.585 ± 0.578
3.346AlaGlu: 3.346 ± 0.744
2.642AlaPhe: 2.642 ± 0.456
4.051AlaGly: 4.051 ± 2.227
0.352AlaHis: 0.352 ± 0.435
3.875AlaIle: 3.875 ± 0.866
2.818AlaLys: 2.818 ± 0.867
5.284AlaLeu: 5.284 ± 1.285
1.233AlaMet: 1.233 ± 0.234
2.818AlaAsn: 2.818 ± 0.725
1.409AlaPro: 1.409 ± 1.446
2.113AlaGln: 2.113 ± 0.61
2.994AlaArg: 2.994 ± 0.536
3.875AlaSer: 3.875 ± 0.631
2.29AlaThr: 2.29 ± 0.304
5.636AlaVal: 5.636 ± 1.26
1.585AlaTrp: 1.585 ± 1.29
1.585AlaTyr: 1.585 ± 0.428
0.0AlaXaa: 0.0 ± 0.0
Cys
1.761CysAla: 1.761 ± 0.742
1.233CysCys: 1.233 ± 0.421
1.057CysAsp: 1.057 ± 0.479
1.937CysGlu: 1.937 ± 0.565
1.761CysPhe: 1.761 ± 0.262
0.704CysGly: 0.704 ± 0.377
0.881CysHis: 0.881 ± 0.288
1.409CysIle: 1.409 ± 0.511
1.057CysLys: 1.057 ± 0.285
2.29CysLeu: 2.29 ± 0.59
0.528CysMet: 0.528 ± 0.283
1.585CysAsn: 1.585 ± 0.94
2.29CysPro: 2.29 ± 0.9
0.881CysGln: 0.881 ± 0.251
1.057CysArg: 1.057 ± 0.285
2.642CysSer: 2.642 ± 0.863
1.585CysThr: 1.585 ± 0.601
1.409CysVal: 1.409 ± 0.903
0.352CysTrp: 0.352 ± 0.16
0.528CysTyr: 0.528 ± 0.142
0.0CysXaa: 0.0 ± 0.0
Asp
2.29AspAla: 2.29 ± 0.304
2.29AspCys: 2.29 ± 0.918
2.113AspAsp: 2.113 ± 0.515
2.818AspGlu: 2.818 ± 1.022
3.17AspPhe: 3.17 ± 0.5
1.937AspGly: 1.937 ± 0.56
0.704AspHis: 0.704 ± 0.32
2.994AspIle: 2.994 ± 0.235
2.29AspLys: 2.29 ± 0.754
5.284AspLeu: 5.284 ± 1.386
1.409AspMet: 1.409 ± 0.754
1.409AspAsn: 1.409 ± 0.247
1.409AspPro: 1.409 ± 0.903
1.233AspGln: 1.233 ± 0.421
3.17AspArg: 3.17 ± 0.856
4.755AspSer: 4.755 ± 0.619
3.346AspThr: 3.346 ± 1.409
2.466AspVal: 2.466 ± 0.624
1.937AspTrp: 1.937 ± 1.107
0.881AspTyr: 0.881 ± 0.38
0.0AspXaa: 0.0 ± 0.0
Glu
4.403GluAla: 4.403 ± 1.527
1.233GluCys: 1.233 ± 0.234
2.994GluAsp: 2.994 ± 0.353
6.34GluGlu: 6.34 ± 0.849
1.937GluPhe: 1.937 ± 0.583
3.875GluGly: 3.875 ± 0.64
2.466GluHis: 2.466 ± 1.114
4.403GluIle: 4.403 ± 1.113
4.755GluLys: 4.755 ± 0.418
6.164GluLeu: 6.164 ± 2.103
1.585GluMet: 1.585 ± 0.602
2.466GluAsn: 2.466 ± 0.821
2.818GluPro: 2.818 ± 0.983
3.698GluGln: 3.698 ± 1.264
2.29GluArg: 2.29 ± 0.607
4.579GluSer: 4.579 ± 1.716
4.227GluThr: 4.227 ± 1.335
6.692GluVal: 6.692 ± 1.025
0.528GluTrp: 0.528 ± 1.005
2.29GluTyr: 2.29 ± 0.601
0.0GluXaa: 0.0 ± 0.0
Phe
1.585PheAla: 1.585 ± 0.427
1.233PheCys: 1.233 ± 0.292
1.057PheAsp: 1.057 ± 0.334
3.17PheGlu: 3.17 ± 0.5
2.642PhePhe: 2.642 ± 0.669
2.994PheGly: 2.994 ± 0.881
1.057PheHis: 1.057 ± 0.479
2.113PheIle: 2.113 ± 0.084
2.642PheLys: 2.642 ± 1.207
4.227PheLeu: 4.227 ± 0.134
1.233PheMet: 1.233 ± 0.421
1.409PheAsn: 1.409 ± 0.362
1.233PhePro: 1.233 ± 1.259
1.585PheGln: 1.585 ± 0.578
1.937PheArg: 1.937 ± 0.565
5.46PheSer: 5.46 ± 0.794
2.642PheThr: 2.642 ± 1.207
2.642PheVal: 2.642 ± 0.724
0.528PheTrp: 0.528 ± 0.142
1.057PheTyr: 1.057 ± 0.406
0.0PheXaa: 0.0 ± 0.0
Gly
2.113GlyAla: 2.113 ± 1.047
1.761GlyCys: 1.761 ± 1.061
4.051GlyAsp: 4.051 ± 1.857
2.466GlyGlu: 2.466 ± 0.819
2.113GlyPhe: 2.113 ± 1.131
3.17GlyGly: 3.17 ± 0.997
2.29GlyHis: 2.29 ± 0.055
3.522GlyIle: 3.522 ± 0.524
4.403GlyLys: 4.403 ± 0.106
5.46GlyLeu: 5.46 ± 1.354
0.881GlyMet: 0.881 ± 0.38
1.585GlyAsn: 1.585 ± 0.682
2.29GlyPro: 2.29 ± 0.9
1.233GlyGln: 1.233 ± 0.443
2.642GlyArg: 2.642 ± 0.456
3.698GlySer: 3.698 ± 0.163
2.29GlyThr: 2.29 ± 0.601
2.29GlyVal: 2.29 ± 0.674
0.352GlyTrp: 0.352 ± 0.189
1.761GlyTyr: 1.761 ± 1.17
0.0GlyXaa: 0.0 ± 0.0
His
1.761HisAla: 1.761 ± 0.093
0.704HisCys: 0.704 ± 0.181
0.704HisAsp: 0.704 ± 0.32
1.585HisGlu: 1.585 ± 0.355
0.881HisPhe: 0.881 ± 0.53
1.585HisGly: 1.585 ± 0.601
0.528HisHis: 0.528 ± 0.142
1.761HisIle: 1.761 ± 0.093
1.761HisLys: 1.761 ± 0.262
2.466HisLeu: 2.466 ± 0.436
1.057HisMet: 1.057 ± 0.566
0.881HisAsn: 0.881 ± 0.288
0.881HisPro: 0.881 ± 0.309
0.528HisGln: 0.528 ± 0.921
0.881HisArg: 0.881 ± 0.472
2.466HisSer: 2.466 ± 0.624
2.113HisThr: 2.113 ± 0.543
1.937HisVal: 1.937 ± 0.759
0.528HisTrp: 0.528 ± 0.373
0.704HisTyr: 0.704 ± 0.378
0.0HisXaa: 0.0 ± 0.0
Ile
2.642IleAla: 2.642 ± 0.196
2.113IleCys: 2.113 ± 0.543
2.642IleAsp: 2.642 ± 0.754
4.051IleGlu: 4.051 ± 0.504
1.761IlePhe: 1.761 ± 0.575
2.642IleGly: 2.642 ± 0.38
1.937IleHis: 1.937 ± 0.61
4.051IleIle: 4.051 ± 0.915
5.988IleLys: 5.988 ± 0.812
5.636IleLeu: 5.636 ± 1.227
0.704IleMet: 0.704 ± 0.285
1.761IleAsn: 1.761 ± 0.742
2.466IlePro: 2.466 ± 0.624
2.113IleGln: 2.113 ± 0.216
3.17IleArg: 3.17 ± 0.332
6.516IleSer: 6.516 ± 1.191
2.466IleThr: 2.466 ± 0.468
3.346IleVal: 3.346 ± 0.337
0.528IleTrp: 0.528 ± 0.142
1.585IleTyr: 1.585 ± 0.578
0.0IleXaa: 0.0 ± 0.0
Lys
5.107LysAla: 5.107 ± 2.566
1.233LysCys: 1.233 ± 0.443
3.522LysAsp: 3.522 ± 0.852
4.755LysGlu: 4.755 ± 0.619
2.113LysPhe: 2.113 ± 0.216
2.818LysGly: 2.818 ± 0.966
1.057LysHis: 1.057 ± 0.334
1.761LysIle: 1.761 ± 0.944
5.812LysLys: 5.812 ± 0.422
8.982LysLeu: 8.982 ± 1.534
1.937LysMet: 1.937 ± 0.137
2.466LysAsn: 2.466 ± 0.583
3.875LysPro: 3.875 ± 0.34
2.29LysGln: 2.29 ± 0.601
2.113LysArg: 2.113 ± 0.216
4.755LysSer: 4.755 ± 1.417
4.403LysThr: 4.403 ± 1.624
5.812LysVal: 5.812 ± 1.217
1.585LysTrp: 1.585 ± 0.682
3.17LysTyr: 3.17 ± 0.585
0.0LysXaa: 0.0 ± 0.0
Leu
6.516LeuAla: 6.516 ± 0.451
1.761LeuCys: 1.761 ± 0.575
5.107LeuAsp: 5.107 ± 0.365
6.516LeuGlu: 6.516 ± 0.86
4.579LeuPhe: 4.579 ± 1.075
4.051LeuGly: 4.051 ± 0.571
2.466LeuHis: 2.466 ± 0.395
4.931LeuIle: 4.931 ± 0.972
8.454LeuLys: 8.454 ± 0.864
10.039LeuLeu: 10.039 ± 2.479
2.466LeuMet: 2.466 ± 0.821
4.931LeuAsn: 4.931 ± 0.239
5.988LeuPro: 5.988 ± 1.679
3.698LeuGln: 3.698 ± 0.788
4.579LeuArg: 4.579 ± 1.202
9.334LeuSer: 9.334 ± 1.418
6.164LeuThr: 6.164 ± 0.532
7.221LeuVal: 7.221 ± 2.438
0.528LeuTrp: 0.528 ± 0.142
1.585LeuTyr: 1.585 ± 0.578
0.0LeuXaa: 0.0 ± 0.0
Met
1.409MetAla: 1.409 ± 0.752
0.704MetCys: 0.704 ± 0.32
1.057MetAsp: 1.057 ± 0.257
1.233MetGlu: 1.233 ± 0.234
1.057MetPhe: 1.057 ± 0.334
0.704MetGly: 0.704 ± 0.592
0.352MetHis: 0.352 ± 0.441
1.761MetIle: 1.761 ± 0.653
2.994MetLys: 2.994 ± 1.252
3.17MetLeu: 3.17 ± 0.5
0.352MetMet: 0.352 ± 0.189
0.881MetAsn: 0.881 ± 0.251
0.704MetPro: 0.704 ± 0.378
0.352MetGln: 0.352 ± 0.189
1.409MetArg: 1.409 ± 0.247
1.585MetSer: 1.585 ± 0.602
1.761MetThr: 1.761 ± 0.624
1.761MetVal: 1.761 ± 0.262
0.352MetTrp: 0.352 ± 0.16
0.704MetTyr: 0.704 ± 0.32
0.0MetXaa: 0.0 ± 0.0
Asn
2.113AsnAla: 2.113 ± 0.682
0.704AsnCys: 0.704 ± 0.32
1.233AsnAsp: 1.233 ± 0.292
2.29AsnGlu: 2.29 ± 0.674
1.409AsnPhe: 1.409 ± 0.247
2.642AsnGly: 2.642 ± 0.724
1.233AsnHis: 1.233 ± 0.443
3.522AsnIle: 3.522 ± 1.006
2.113AsnLys: 2.113 ± 0.812
4.227AsnLeu: 4.227 ± 0.134
1.057AsnMet: 1.057 ± 0.435
2.113AsnAsn: 2.113 ± 0.216
1.233AsnPro: 1.233 ± 0.234
1.409AsnGln: 1.409 ± 0.916
2.113AsnArg: 2.113 ± 0.383
5.284AsnSer: 5.284 ± 0.694
2.466AsnThr: 2.466 ± 0.678
1.409AsnVal: 1.409 ± 1.308
1.409AsnTrp: 1.409 ± 0.639
1.057AsnTyr: 1.057 ± 0.334
0.0AsnXaa: 0.0 ± 0.0
Pro
1.937ProAla: 1.937 ± 0.56
1.057ProCys: 1.057 ± 0.406
1.761ProAsp: 1.761 ± 0.093
5.636ProGlu: 5.636 ± 0.883
1.937ProPhe: 1.937 ± 0.56
1.937ProGly: 1.937 ± 1.567
0.704ProHis: 0.704 ± 0.32
2.29ProIle: 2.29 ± 1.537
1.585ProLys: 1.585 ± 0.848
3.522ProLeu: 3.522 ± 0.955
0.881ProMet: 0.881 ± 0.472
1.409ProAsn: 1.409 ± 0.425
1.409ProPro: 1.409 ± 0.425
0.881ProGln: 0.881 ± 0.251
1.585ProArg: 1.585 ± 0.85
4.403ProSer: 4.403 ± 0.565
3.17ProThr: 3.17 ± 0.829
2.994ProVal: 2.994 ± 1.379
0.704ProTrp: 0.704 ± 0.377
1.233ProTyr: 1.233 ± 0.312
0.0ProXaa: 0.0 ± 0.0
Gln
1.585GlnAla: 1.585 ± 0.644
1.233GlnCys: 1.233 ± 0.312
1.057GlnAsp: 1.057 ± 0.334
3.17GlnGlu: 3.17 ± 0.5
2.466GlnPhe: 2.466 ± 0.436
1.057GlnGly: 1.057 ± 0.257
1.409GlnHis: 1.409 ± 0.425
1.585GlnIle: 1.585 ± 0.682
1.937GlnLys: 1.937 ± 0.61
3.346GlnLeu: 3.346 ± 0.935
0.881GlnMet: 0.881 ± 0.472
1.937GlnAsn: 1.937 ± 0.433
0.881GlnPro: 0.881 ± 0.251
2.818GlnGln: 2.818 ± 0.983
2.113GlnArg: 2.113 ± 0.084
2.113GlnSer: 2.113 ± 0.667
1.761GlnThr: 1.761 ± 0.357
3.17GlnVal: 3.17 ± 0.301
0.528GlnTrp: 0.528 ± 0.921
1.233GlnTyr: 1.233 ± 0.827
0.0GlnXaa: 0.0 ± 0.0
Arg
2.818ArgAla: 2.818 ± 0.341
1.409ArgCys: 1.409 ± 0.639
2.642ArgAsp: 2.642 ± 0.932
2.642ArgGlu: 2.642 ± 0.487
1.761ArgPhe: 1.761 ± 0.45
1.409ArgGly: 1.409 ± 0.916
1.057ArgHis: 1.057 ± 0.334
3.346ArgIle: 3.346 ± 0.929
3.346ArgLys: 3.346 ± 0.379
6.34ArgLeu: 6.34 ± 1.482
1.409ArgMet: 1.409 ± 0.204
2.466ArgAsn: 2.466 ± 0.395
2.29ArgPro: 2.29 ± 0.711
2.113ArgGln: 2.113 ± 0.514
3.17ArgArg: 3.17 ± 0.997
3.698ArgSer: 3.698 ± 0.782
2.466ArgThr: 2.466 ± 0.468
3.522ArgVal: 3.522 ± 1.36
0.176ArgTrp: 0.176 ± 0.094
1.761ArgTyr: 1.761 ± 0.357
0.0ArgXaa: 0.0 ± 0.0
Ser
5.284SerAla: 5.284 ± 0.28
2.642SerCys: 2.642 ± 0.456
6.34SerAsp: 6.34 ± 1.17
5.46SerGlu: 5.46 ± 1.473
3.698SerPhe: 3.698 ± 0.416
4.227SerGly: 4.227 ± 0.63
2.29SerHis: 2.29 ± 0.607
5.988SerIle: 5.988 ± 0.707
4.579SerLys: 4.579 ± 1.007
8.454SerLeu: 8.454 ± 3.071
1.937SerMet: 1.937 ± 0.49
3.522SerAsn: 3.522 ± 0.463
2.642SerPro: 2.642 ± 0.472
1.761SerGln: 1.761 ± 0.093
5.46SerArg: 5.46 ± 1.046
10.391SerSer: 10.391 ± 2.545
5.636SerThr: 5.636 ± 0.147
5.812SerVal: 5.812 ± 0.142
0.528SerTrp: 0.528 ± 0.142
2.29SerTyr: 2.29 ± 0.59
0.0SerXaa: 0.0 ± 0.0
Thr
2.642ThrAla: 2.642 ± 1.033
1.057ThrCys: 1.057 ± 1.032
4.051ThrAsp: 4.051 ± 0.699
4.579ThrGlu: 4.579 ± 1.007
2.642ThrPhe: 2.642 ± 0.472
5.46ThrGly: 5.46 ± 2.169
1.409ThrHis: 1.409 ± 0.425
2.642ThrIle: 2.642 ± 0.495
4.051ThrLys: 4.051 ± 0.251
5.812ThrLeu: 5.812 ± 0.422
1.409ThrMet: 1.409 ± 0.511
2.466ThrAsn: 2.466 ± 0.653
2.113ThrPro: 2.113 ± 0.084
1.409ThrGln: 1.409 ± 0.362
3.17ThrArg: 3.17 ± 1.205
4.755ThrSer: 4.755 ± 0.412
2.994ThrThr: 2.994 ± 0.917
4.403ThrVal: 4.403 ± 1.432
0.704ThrTrp: 0.704 ± 0.567
1.937ThrTyr: 1.937 ± 0.49
0.0ThrXaa: 0.0 ± 0.0
Val
3.698ValAla: 3.698 ± 0.432
1.409ValCys: 1.409 ± 0.425
2.642ValAsp: 2.642 ± 0.932
4.579ValGlu: 4.579 ± 1.214
2.113ValPhe: 2.113 ± 1.281
2.642ValGly: 2.642 ± 1.338
1.937ValHis: 1.937 ± 0.565
2.818ValIle: 2.818 ± 0.725
5.284ValLys: 5.284 ± 1.339
6.164ValLeu: 6.164 ± 1.213
2.466ValMet: 2.466 ± 0.523
3.346ValAsn: 3.346 ± 1.272
3.698ValPro: 3.698 ± 0.416
3.522ValGln: 3.522 ± 1.11
4.051ValArg: 4.051 ± 0.915
5.636ValSer: 5.636 ± 0.549
4.931ValThr: 4.931 ± 0.986
4.755ValVal: 4.755 ± 0.754
0.704ValTrp: 0.704 ± 0.592
2.466ValTyr: 2.466 ± 0.395
0.0ValXaa: 0.0 ± 0.0
Trp
0.528TrpAla: 0.528 ± 1.005
0.528TrpCys: 0.528 ± 0.142
0.352TrpAsp: 0.352 ± 0.16
1.057TrpGlu: 1.057 ± 1.032
0.704TrpPhe: 0.704 ± 0.932
1.233TrpGly: 1.233 ± 0.543
0.528TrpHis: 0.528 ± 0.395
0.704TrpIle: 0.704 ± 0.567
2.466TrpLys: 2.466 ± 1.532
1.233TrpLeu: 1.233 ± 0.443
0.352TrpMet: 0.352 ± 0.16
0.528TrpAsn: 0.528 ± 0.373
0.528TrpPro: 0.528 ± 0.142
0.704TrpGln: 0.704 ± 0.181
0.704TrpArg: 0.704 ± 0.567
0.352TrpSer: 0.352 ± 0.16
0.881TrpThr: 0.881 ± 0.862
0.704TrpVal: 0.704 ± 0.181
0.176TrpTrp: 0.176 ± 0.49
0.528TrpTyr: 0.528 ± 0.373
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.881TyrAla: 0.881 ± 0.251
1.057TyrCys: 1.057 ± 0.285
1.937TyrAsp: 1.937 ± 0.732
2.113TyrGlu: 2.113 ± 0.61
1.057TyrPhe: 1.057 ± 0.382
1.233TyrGly: 1.233 ± 0.421
1.057TyrHis: 1.057 ± 0.334
2.642TyrIle: 2.642 ± 0.495
1.233TyrLys: 1.233 ± 0.421
2.818TyrLeu: 2.818 ± 0.834
0.352TyrMet: 0.352 ± 0.16
1.233TyrAsn: 1.233 ± 0.292
0.881TyrPro: 0.881 ± 0.309
1.937TyrGln: 1.937 ± 0.857
1.233TyrArg: 1.233 ± 0.421
2.642TyrSer: 2.642 ± 0.196
2.113TyrThr: 2.113 ± 0.084
0.881TyrVal: 0.881 ± 0.472
1.057TyrTrp: 1.057 ± 0.257
0.881TyrTyr: 0.881 ± 0.288
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 3 proteins (5679 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski