Amino acid dipepetide frequency for Parry Creek virus

Apart from single amino acid frequencies one can also calculate so called amino acid dipeptide frequency.

There is 400 possibilites (441 if we consider Xaa as additional 21st amino acid representing all non-standard or unknown amino acids). Thus, if the dipeptides would be present randomly in proteins each amino acid dipeptide should be present with 0.25% frequency. As it is not the case in the nature, for better visablity all more than expected dipepetides are marked by red, and those which are underrepresented are marked by blue in the table.

All values are presented as per milles (‰), therefore need to be multiplied by 10-3.

For more information see sequence space article on Wikipedia.

AlaCysAspGluPheGlyHisIleLysLeuMetAsnProGlnArgSerThrValTrpTyrXaa
Ala
1.177AlaAla: 1.177 ± 0.576
0.471AlaCys: 0.471 ± 0.248
3.06AlaAsp: 3.06 ± 0.629
0.235AlaGlu: 0.235 ± 0.249
0.706AlaPhe: 0.706 ± 0.665
0.471AlaGly: 0.471 ± 0.435
0.235AlaHis: 0.235 ± 0.36
1.647AlaIle: 1.647 ± 0.754
1.647AlaLys: 1.647 ± 0.436
3.53AlaLeu: 3.53 ± 1.058
0.0AlaMet: 0.0 ± 0.0
1.883AlaAsn: 1.883 ± 0.558
1.412AlaPro: 1.412 ± 0.572
1.412AlaGln: 1.412 ± 0.505
1.177AlaArg: 1.177 ± 0.423
3.06AlaSer: 3.06 ± 0.785
2.353AlaThr: 2.353 ± 0.701
2.118AlaVal: 2.118 ± 0.704
1.412AlaTrp: 1.412 ± 0.591
2.118AlaTyr: 2.118 ± 0.619
0.0AlaXaa: 0.0 ± 0.0
Cys
0.941CysAla: 0.941 ± 0.377
1.177CysCys: 1.177 ± 1.314
0.235CysAsp: 0.235 ± 0.142
0.706CysGlu: 0.706 ± 0.39
0.471CysPhe: 0.471 ± 0.283
0.941CysGly: 0.941 ± 0.602
0.941CysHis: 0.941 ± 0.511
1.177CysIle: 1.177 ± 0.475
2.589CysLys: 2.589 ± 1.067
1.412CysLeu: 1.412 ± 1.497
0.235CysMet: 0.235 ± 0.142
1.412CysAsn: 1.412 ± 0.85
1.412CysPro: 1.412 ± 0.77
0.471CysGln: 0.471 ± 0.47
1.883CysArg: 1.883 ± 0.726
1.647CysSer: 1.647 ± 0.614
0.235CysThr: 0.235 ± 0.142
0.471CysVal: 0.471 ± 0.248
0.706CysTrp: 0.706 ± 0.425
0.0CysTyr: 0.0 ± 0.0
0.0CysXaa: 0.0 ± 0.0
Asp
1.177AspAla: 1.177 ± 1.135
1.177AspCys: 1.177 ± 0.258
5.413AspAsp: 5.413 ± 2.212
3.06AspGlu: 3.06 ± 0.737
3.53AspPhe: 3.53 ± 0.763
2.589AspGly: 2.589 ± 0.597
1.177AspHis: 1.177 ± 0.878
4.472AspIle: 4.472 ± 0.934
3.53AspLys: 3.53 ± 1.497
8.237AspLeu: 8.237 ± 1.646
1.647AspMet: 1.647 ± 0.652
3.766AspAsn: 3.766 ± 0.781
3.295AspPro: 3.295 ± 0.903
2.118AspGln: 2.118 ± 0.664
1.883AspArg: 1.883 ± 0.601
2.353AspSer: 2.353 ± 1.085
1.647AspThr: 1.647 ± 1.242
2.589AspVal: 2.589 ± 1.218
2.589AspTrp: 2.589 ± 0.42
4.472AspTyr: 4.472 ± 1.197
0.0AspXaa: 0.0 ± 0.0
Glu
0.941GluAla: 0.941 ± 0.306
0.941GluCys: 0.941 ± 0.877
4.472GluAsp: 4.472 ± 1.85
4.001GluGlu: 4.001 ± 1.185
4.001GluPhe: 4.001 ± 1.136
2.353GluGly: 2.353 ± 0.505
1.177GluHis: 1.177 ± 0.425
5.884GluIle: 5.884 ± 1.295
4.001GluLys: 4.001 ± 1.449
5.648GluLeu: 5.648 ± 0.971
1.177GluMet: 1.177 ± 0.475
3.53GluAsn: 3.53 ± 0.866
2.824GluPro: 2.824 ± 1.111
1.177GluGln: 1.177 ± 0.527
1.883GluArg: 1.883 ± 0.429
6.119GluSer: 6.119 ± 0.935
3.295GluThr: 3.295 ± 0.612
3.53GluVal: 3.53 ± 0.801
0.471GluTrp: 0.471 ± 0.589
2.118GluTyr: 2.118 ± 0.566
0.0GluXaa: 0.0 ± 0.0
Phe
2.118PheAla: 2.118 ± 0.758
1.177PheCys: 1.177 ± 0.64
2.824PheAsp: 2.824 ± 1.197
3.06PheGlu: 3.06 ± 1.054
2.589PhePhe: 2.589 ± 0.703
4.472PheGly: 4.472 ± 1.248
0.706PheHis: 0.706 ± 0.457
3.295PheIle: 3.295 ± 0.651
2.824PheLys: 2.824 ± 0.695
4.942PheLeu: 4.942 ± 0.779
1.412PheMet: 1.412 ± 0.678
2.118PheAsn: 2.118 ± 0.475
2.353PhePro: 2.353 ± 0.749
2.824PheGln: 2.824 ± 0.634
2.589PheArg: 2.589 ± 0.843
3.295PheSer: 3.295 ± 1.292
2.353PheThr: 2.353 ± 0.683
3.53PheVal: 3.53 ± 0.755
0.235PheTrp: 0.235 ± 0.142
1.883PheTyr: 1.883 ± 0.532
0.0PheXaa: 0.0 ± 0.0
Gly
0.941GlyAla: 0.941 ± 0.362
0.0GlyCys: 0.0 ± 0.0
4.001GlyAsp: 4.001 ± 0.909
3.766GlyGlu: 3.766 ± 1.655
3.766GlyPhe: 3.766 ± 0.558
2.353GlyGly: 2.353 ± 0.549
1.412GlyHis: 1.412 ± 1.401
4.707GlyIle: 4.707 ± 1.168
3.53GlyLys: 3.53 ± 0.662
7.767GlyLeu: 7.767 ± 0.797
1.412GlyMet: 1.412 ± 0.781
3.06GlyAsn: 3.06 ± 0.961
1.883GlyPro: 1.883 ± 0.615
2.353GlyGln: 2.353 ± 0.872
1.177GlyArg: 1.177 ± 0.966
6.59GlySer: 6.59 ± 1.224
2.589GlyThr: 2.589 ± 0.731
1.412GlyVal: 1.412 ± 0.533
0.471GlyTrp: 0.471 ± 0.257
0.706GlyTyr: 0.706 ± 0.397
0.0GlyXaa: 0.0 ± 0.0
His
0.941HisAla: 0.941 ± 0.228
0.235HisCys: 0.235 ± 0.295
0.941HisAsp: 0.941 ± 0.668
1.177HisGlu: 1.177 ± 0.538
0.706HisPhe: 0.706 ± 0.4
0.941HisGly: 0.941 ± 0.815
0.706HisHis: 0.706 ± 0.548
2.824HisIle: 2.824 ± 1.149
2.353HisLys: 2.353 ± 0.673
3.06HisLeu: 3.06 ± 1.019
1.177HisMet: 1.177 ± 0.823
0.471HisAsn: 0.471 ± 0.388
2.353HisPro: 2.353 ± 0.973
0.706HisGln: 0.706 ± 0.277
2.118HisArg: 2.118 ± 1.3
1.647HisSer: 1.647 ± 0.579
0.941HisThr: 0.941 ± 1.179
1.883HisVal: 1.883 ± 0.584
0.235HisTrp: 0.235 ± 0.142
1.177HisTyr: 1.177 ± 0.708
0.0HisXaa: 0.0 ± 0.0
Ile
1.412IleAla: 1.412 ± 0.349
2.353IleCys: 2.353 ± 0.782
6.59IleAsp: 6.59 ± 1.343
6.825IleGlu: 6.825 ± 1.069
3.53IlePhe: 3.53 ± 0.952
5.884IleGly: 5.884 ± 1.552
0.941IleHis: 0.941 ± 0.565
6.59IleIle: 6.59 ± 2.244
8.943IleLys: 8.943 ± 1.713
5.413IleLeu: 5.413 ± 1.596
2.589IleMet: 2.589 ± 0.672
4.942IleAsn: 4.942 ± 1.689
3.53IlePro: 3.53 ± 0.863
1.177IleGln: 1.177 ± 0.258
4.001IleArg: 4.001 ± 1.034
4.707IleSer: 4.707 ± 0.715
4.472IleThr: 4.472 ± 1.02
2.589IleVal: 2.589 ± 1.25
4.001IleTrp: 4.001 ± 1.109
3.53IleTyr: 3.53 ± 0.991
0.0IleXaa: 0.0 ± 0.0
Lys
1.647LysAla: 1.647 ± 0.619
1.412LysCys: 1.412 ± 1.051
3.06LysAsp: 3.06 ± 0.53
5.648LysGlu: 5.648 ± 0.937
2.824LysPhe: 2.824 ± 0.832
4.001LysGly: 4.001 ± 0.534
1.883LysHis: 1.883 ± 0.594
5.884LysIle: 5.884 ± 1.977
7.531LysLys: 7.531 ± 1.769
6.825LysLeu: 6.825 ± 1.491
2.353LysMet: 2.353 ± 0.889
5.648LysAsn: 5.648 ± 1.287
2.118LysPro: 2.118 ± 0.645
1.177LysGln: 1.177 ± 0.706
4.236LysArg: 4.236 ± 1.203
6.354LysSer: 6.354 ± 1.348
3.53LysThr: 3.53 ± 0.496
3.766LysVal: 3.766 ± 0.715
1.647LysTrp: 1.647 ± 0.803
2.824LysTyr: 2.824 ± 1.329
0.0LysXaa: 0.0 ± 0.0
Leu
4.001LeuAla: 4.001 ± 0.773
2.589LeuCys: 2.589 ± 0.692
4.942LeuAsp: 4.942 ± 1.187
7.06LeuGlu: 7.06 ± 1.032
4.236LeuPhe: 4.236 ± 0.72
5.413LeuGly: 5.413 ± 0.977
2.353LeuHis: 2.353 ± 1.078
10.826LeuIle: 10.826 ± 1.316
7.531LeuLys: 7.531 ± 0.779
10.355LeuLeu: 10.355 ± 1.423
3.06LeuMet: 3.06 ± 0.632
6.59LeuAsn: 6.59 ± 1.329
2.118LeuPro: 2.118 ± 0.516
4.001LeuGln: 4.001 ± 1.113
3.06LeuArg: 3.06 ± 0.966
8.473LeuSer: 8.473 ± 1.017
5.413LeuThr: 5.413 ± 1.692
5.413LeuVal: 5.413 ± 1.082
0.706LeuTrp: 0.706 ± 0.365
2.118LeuTyr: 2.118 ± 0.496
0.0LeuXaa: 0.0 ± 0.0
Met
0.941MetAla: 0.941 ± 0.439
0.0MetCys: 0.0 ± 0.0
0.941MetAsp: 0.941 ± 0.306
1.883MetGlu: 1.883 ± 0.597
1.412MetPhe: 1.412 ± 0.62
2.118MetGly: 2.118 ± 0.684
0.471MetHis: 0.471 ± 0.299
2.824MetIle: 2.824 ± 0.586
1.647MetLys: 1.647 ± 0.924
1.883MetLeu: 1.883 ± 0.529
0.706MetMet: 0.706 ± 0.562
0.941MetAsn: 0.941 ± 0.566
0.0MetPro: 0.0 ± 0.0
0.706MetGln: 0.706 ± 0.277
0.941MetArg: 0.941 ± 0.727
1.883MetSer: 1.883 ± 0.59
1.177MetThr: 1.177 ± 0.507
1.883MetVal: 1.883 ± 0.919
0.706MetTrp: 0.706 ± 0.355
0.941MetTyr: 0.941 ± 0.821
0.0MetXaa: 0.0 ± 0.0
Asn
2.824AsnAla: 2.824 ± 0.932
1.647AsnCys: 1.647 ± 0.614
2.353AsnAsp: 2.353 ± 0.682
4.001AsnGlu: 4.001 ± 0.778
3.53AsnPhe: 3.53 ± 1.531
2.118AsnGly: 2.118 ± 0.891
1.647AsnHis: 1.647 ± 0.625
5.648AsnIle: 5.648 ± 1.293
3.766AsnLys: 3.766 ± 0.632
8.473AsnLeu: 8.473 ± 1.061
0.941AsnMet: 0.941 ± 0.349
3.295AsnAsn: 3.295 ± 1.017
4.236AsnPro: 4.236 ± 0.773
2.824AsnGln: 2.824 ± 1.1
1.412AsnArg: 1.412 ± 0.725
4.472AsnSer: 4.472 ± 1.082
1.647AsnThr: 1.647 ± 0.758
2.589AsnVal: 2.589 ± 1.27
1.647AsnTrp: 1.647 ± 0.436
3.295AsnTyr: 3.295 ± 1.005
0.0AsnXaa: 0.0 ± 0.0
Pro
1.177ProAla: 1.177 ± 0.354
0.235ProCys: 0.235 ± 0.421
3.53ProAsp: 3.53 ± 0.683
1.647ProGlu: 1.647 ± 0.601
1.412ProPhe: 1.412 ± 0.525
2.353ProGly: 2.353 ± 0.93
2.353ProHis: 2.353 ± 0.881
4.472ProIle: 4.472 ± 0.63
2.118ProLys: 2.118 ± 0.849
4.942ProLeu: 4.942 ± 1.478
0.471ProMet: 0.471 ± 0.601
2.824ProAsn: 2.824 ± 0.771
2.824ProPro: 2.824 ± 0.626
1.647ProGln: 1.647 ± 1.318
0.941ProArg: 0.941 ± 0.349
4.472ProSer: 4.472 ± 1.069
1.883ProThr: 1.883 ± 0.736
1.412ProVal: 1.412 ± 0.692
0.706ProTrp: 0.706 ± 0.461
2.118ProTyr: 2.118 ± 0.547
0.0ProXaa: 0.0 ± 0.0
Gln
0.471GlnAla: 0.471 ± 0.257
0.706GlnCys: 0.706 ± 0.344
1.883GlnAsp: 1.883 ± 0.892
1.883GlnGlu: 1.883 ± 0.737
1.647GlnPhe: 1.647 ± 0.776
1.412GlnGly: 1.412 ± 0.623
0.471GlnHis: 0.471 ± 0.257
2.118GlnIle: 2.118 ± 0.439
3.53GlnLys: 3.53 ± 0.545
2.118GlnLeu: 2.118 ± 0.528
0.941GlnMet: 0.941 ± 0.495
1.647GlnAsn: 1.647 ± 0.361
2.118GlnPro: 2.118 ± 0.901
0.0GlnGln: 0.0 ± 0.0
0.941GlnArg: 0.941 ± 0.518
3.06GlnSer: 3.06 ± 1.051
1.883GlnThr: 1.883 ± 0.784
1.883GlnVal: 1.883 ± 0.578
0.471GlnTrp: 0.471 ± 0.283
0.706GlnTyr: 0.706 ± 0.294
0.0GlnXaa: 0.0 ± 0.0
Arg
1.883ArgAla: 1.883 ± 0.395
0.471ArgCys: 0.471 ± 0.283
2.589ArgAsp: 2.589 ± 0.699
2.353ArgGlu: 2.353 ± 0.722
3.295ArgPhe: 3.295 ± 1.119
1.647ArgGly: 1.647 ± 0.694
1.177ArgHis: 1.177 ± 0.391
3.06ArgIle: 3.06 ± 0.839
2.353ArgLys: 2.353 ± 0.55
2.589ArgLeu: 2.589 ± 1.139
0.941ArgMet: 0.941 ± 0.362
2.824ArgAsn: 2.824 ± 0.69
1.883ArgPro: 1.883 ± 0.475
0.471ArgGln: 0.471 ± 0.283
2.353ArgArg: 2.353 ± 0.813
3.53ArgSer: 3.53 ± 0.592
3.766ArgThr: 3.766 ± 0.719
2.353ArgVal: 2.353 ± 0.754
0.706ArgTrp: 0.706 ± 0.277
0.706ArgTyr: 0.706 ± 0.276
0.0ArgXaa: 0.0 ± 0.0
Ser
3.295SerAla: 3.295 ± 0.788
1.177SerCys: 1.177 ± 0.615
5.413SerAsp: 5.413 ± 0.751
4.001SerGlu: 4.001 ± 1.0
3.53SerPhe: 3.53 ± 0.695
4.707SerGly: 4.707 ± 1.879
3.766SerHis: 3.766 ± 1.065
5.884SerIle: 5.884 ± 1.163
4.707SerLys: 4.707 ± 1.165
9.414SerLeu: 9.414 ± 1.842
1.412SerMet: 1.412 ± 0.552
5.413SerAsn: 5.413 ± 0.667
3.53SerPro: 3.53 ± 1.4
2.353SerGln: 2.353 ± 0.831
3.53SerArg: 3.53 ± 1.03
5.648SerSer: 5.648 ± 1.05
4.707SerThr: 4.707 ± 1.179
4.001SerVal: 4.001 ± 1.362
2.589SerTrp: 2.589 ± 0.566
2.589SerTyr: 2.589 ± 0.392
0.0SerXaa: 0.0 ± 0.0
Thr
1.177ThrAla: 1.177 ± 0.494
1.177ThrCys: 1.177 ± 0.95
2.118ThrAsp: 2.118 ± 0.746
1.883ThrGlu: 1.883 ± 0.717
2.824ThrPhe: 2.824 ± 0.775
2.589ThrGly: 2.589 ± 0.466
2.353ThrHis: 2.353 ± 0.784
4.001ThrIle: 4.001 ± 0.973
3.06ThrLys: 3.06 ± 1.036
3.53ThrLeu: 3.53 ± 0.872
1.412ThrMet: 1.412 ± 0.669
3.766ThrAsn: 3.766 ± 0.684
1.883ThrPro: 1.883 ± 0.444
1.412ThrGln: 1.412 ± 0.5
2.353ThrArg: 2.353 ± 0.463
5.178ThrSer: 5.178 ± 0.926
2.589ThrThr: 2.589 ± 0.711
1.883ThrVal: 1.883 ± 0.928
0.941ThrTrp: 0.941 ± 0.326
2.824ThrTyr: 2.824 ± 1.045
0.0ThrXaa: 0.0 ± 0.0
Val
0.706ValAla: 0.706 ± 0.496
1.177ValCys: 1.177 ± 0.505
2.353ValAsp: 2.353 ± 0.446
2.353ValGlu: 2.353 ± 0.869
2.118ValPhe: 2.118 ± 0.556
3.06ValGly: 3.06 ± 1.48
0.471ValHis: 0.471 ± 0.483
3.53ValIle: 3.53 ± 0.877
3.295ValLys: 3.295 ± 0.699
4.707ValLeu: 4.707 ± 0.808
0.706ValMet: 0.706 ± 0.365
4.001ValAsn: 4.001 ± 1.527
1.883ValPro: 1.883 ± 0.737
1.412ValGln: 1.412 ± 0.598
2.118ValArg: 2.118 ± 0.293
4.472ValSer: 4.472 ± 0.626
2.824ValThr: 2.824 ± 0.925
2.353ValVal: 2.353 ± 1.46
1.412ValTrp: 1.412 ± 0.601
2.824ValTyr: 2.824 ± 1.784
0.0ValXaa: 0.0 ± 0.0
Trp
1.647TrpAla: 1.647 ± 0.715
0.235TrpCys: 0.235 ± 0.425
1.412TrpAsp: 1.412 ± 0.553
1.883TrpGlu: 1.883 ± 0.69
1.883TrpPhe: 1.883 ± 0.687
2.589TrpGly: 2.589 ± 0.986
0.471TrpHis: 0.471 ± 0.248
1.412TrpIle: 1.412 ± 0.473
1.883TrpLys: 1.883 ± 0.594
0.706TrpLeu: 0.706 ± 0.294
0.706TrpMet: 0.706 ± 0.4
0.706TrpAsn: 0.706 ± 0.277
0.941TrpPro: 0.941 ± 0.683
0.706TrpGln: 0.706 ± 0.335
0.706TrpArg: 0.706 ± 0.365
1.647TrpSer: 1.647 ± 0.82
0.706TrpThr: 0.706 ± 0.276
0.706TrpVal: 0.706 ± 0.362
0.706TrpTrp: 0.706 ± 0.526
0.941TrpTyr: 0.941 ± 0.816
0.0TrpXaa: 0.0 ± 0.0
Tyr
0.706TyrAla: 0.706 ± 0.335
0.706TyrCys: 0.706 ± 0.294
2.589TyrAsp: 2.589 ± 0.962
2.118TyrGlu: 2.118 ± 1.191
2.589TyrPhe: 2.589 ± 0.534
1.883TyrGly: 1.883 ± 0.832
1.883TyrHis: 1.883 ± 0.513
3.766TyrIle: 3.766 ± 0.985
3.53TyrLys: 3.53 ± 1.259
4.707TyrLeu: 4.707 ± 1.502
0.471TyrMet: 0.471 ± 0.238
3.53TyrAsn: 3.53 ± 0.695
1.177TyrPro: 1.177 ± 0.498
0.941TyrGln: 0.941 ± 0.582
1.647TyrArg: 1.647 ± 1.124
3.06TyrSer: 3.06 ± 0.735
0.941TyrThr: 0.941 ± 0.511
1.412TyrVal: 1.412 ± 0.903
0.235TyrTrp: 0.235 ± 0.425
1.177TyrTyr: 1.177 ± 0.539
0.0TyrXaa: 0.0 ± 0.0
Xaa
0.0XaaAla: 0.0 ± 0.0
0.0XaaCys: 0.0 ± 0.0
0.0XaaAsp: 0.0 ± 0.0
0.0XaaGlu: 0.0 ± 0.0
0.0XaaPhe: 0.0 ± 0.0
0.0XaaGly: 0.0 ± 0.0
0.0XaaHis: 0.0 ± 0.0
0.0XaaIle: 0.0 ± 0.0
0.0XaaLys: 0.0 ± 0.0
0.0XaaLeu: 0.0 ± 0.0
0.0XaaMet: 0.0 ± 0.0
0.0XaaAsn: 0.0 ± 0.0
0.0XaaPro: 0.0 ± 0.0
0.0XaaGln: 0.0 ± 0.0
0.0XaaArg: 0.0 ± 0.0
0.0XaaSer: 0.0 ± 0.0
0.0XaaThr: 0.0 ± 0.0
0.0XaaVal: 0.0 ± 0.0
0.0XaaTrp: 0.0 ± 0.0
0.0XaaTyr: 0.0 ± 0.0
0.0XaaXaa: 0.0 ± 0.0
Statistics based on 9 proteins (4250 amino acids)

Note: The error has been estimated with the bootstraping (x100) at the protein level

Above dipeptide statistics (among other stats for this proteome) you can download from this CSV file
See this proteome in: uniprot_link
Proteome-pI is available under Creative Commons Attribution-NoDerivs license, for more details see here

Reference: Kozlowski LP. Proteome-pI 2.0: Proteome Isoelectric Point Database Update. Nucleic Acids Res. 2021, doi: 10.1093/nar/gkab944 Contact: Lukasz P. Kozlowski