S

Sung‐Sau So

Merck & Co., Inc., Rahway, NJ, USA (United States)

Publishes on Computational Drug Discovery Methods, Protein Structure and Dynamics, Synthesis and biological activity. 45 papers and 2.4k citations.

45Publications
2.4kTotal Citations

Is this you? Claim your profile.

Add your photo, update your bio, and get notified when your ranking changes.

Top publicationsby citations

Overview of Artificial Neural Networks
Jinming Zou, Yi Han, Sung‐Sau So|Methods in molecular biology|2008
Cited by 657

The artificial neural network (ANN), or simply neural network, is a machine learning method evolved from the idea of simulating the human brain. The data explosion in modem drug discovery research requires sophisticated analysis methods to uncover the hidden causal relationships between single or multiple responses and a large set of properties. The ANN is one of many versatile tools to meet the demand in drug discovery modeling. Compared to a traditional regression approach, the ANN is capable of modeling complex nonlinear relationships. The ANN also has excellent fault tolerance and is fast and highly scalable with parallel processing. This chapter introduces the background of ANN development and outlines the basic concepts crucially important for understanding more sophisticated ANN. Several commonly used learning methods and network setups are discussed briefly at the end of the chapter.

Evolutionary Optimization in Quantitative Structure−Activity Relationship:  An Application of Genetic Neural Networks
Sung‐Sau So, Martin Karplus|Journal of Medicinal Chemistry|1996
Cited by 261

A new hybrid method (GNN) combining a genetic algorithm and an artificial neural network has been developed for quantitative structure-activity relationship (QSAR) studies. A suitable set of molecular descriptors are selected by a genetic algorithm. This set serves as input to a neural network, in which model-free mapping of multivariate data is performed. Multiple predictors are generated that are superior to results obtained from previous studies of the Selwood data set, which is used to test the method. The neural network technique provides a graphical description of the functional form of the descriptors that play an important role in determining drug activity. This can serve as an aid in future design of drug analogues. The effectiveness of GNN is tested by comparing its results with a benchmark obtained by exhaustive enumeration. Different fitness strategies that tune the evolution of genetic models are examined, and QSARs with higher predictiveness are found. From these results, a composite model is constructed by averaging predictions from several high-ranking models. The predictions of the resulting QSAR should be more reliable than those derived from a single predictor because it makes greater use of information and also permits error estimation. An analysis of the sets of descriptors selected by GNN shows that it is essential to have one each for the steric, electrostatic, and hydrophobic attributes of a drug candidate to obtain a satisfactory QSAR for this data set. This type of result is expected to be of general utility in designing and understanding QSAR.

Structure-activity relationships from molecular similarity matrices
Andrew C. Good, Sung‐Sau So, W. Graham Richards|Journal of Medicinal Chemistry|1993
Cited by 161

An alternative method for determining structure-activity correlations is presented. Ligand molecules are described using data matrices derived from the results of N by N (each molecule compared to every other) molecular similarity calculations. The matrices were analyzed using a neural network pattern recognition technique and partial least squares statistics, with the results obtained compared to those achieved using comparative molecular field analysis (CoMFA). The molecular series used in the study comprised 31 steroids. The resultant pattern recognition analysis showed clustering of compounds with high, intermediate, and low affinity into separate regions of the neuron output plots. The cross-validated correlation coefficients obtained from statistical analyses of the matrices against steroid binding data compared well with those achieved using CoMFA. These results show that data matrices derived from molecular similarity calculations can provide the basis for rapid elucidation of both qualitative and quantitative structure-activity relationships.

Three-Dimensional Quantitative Structure−Activity Relationships from Molecular Similarity Matrices and Genetic Neural Networks. 1. Method and Validations
Sung‐Sau So, Martin Karplus|Journal of Medicinal Chemistry|1997
Cited by 155Open Access

The utility of genetic neural network (GNN) to obtain quantitative structure-activity relationships (QSAR) from molecular similarity matrices is described. In this application, the corticosteroid-binding globulin (CBG) binding affinity of the well-known steroid data set is examined. Excellent predictivity can be obtained through the use of either electrostatic or shape properties alone. Statistical validation using a standard randomization test indicates that the results are not due to chance correlations. Application of GNN on the combined electrostatic and shape matrix produces a six-descriptor model with a cross-validated r2 value of 0.94. The model is superior to those obtained from partial least-squares and genetic regressions, and it also compares favorably with the results for the same data set from other established 3D QSAR methods. The theoretical basis for the use of molecular similarity in QSAR is discussed.

Application of neural networks: quantitative structure-activity relationships of the derivatives of 2,4-diamino-5-(substituted-benzyl)pyrimidines as DHFR inhibitors
Sung‐Sau So, William G. Richards|Journal of Medicinal Chemistry|1992
Cited by 138

A comparative study of quantitative structure-activity relationships involving diaminopyrimidines as DHFR inhibitors using regression analysis and the neural-network approach suggests that the neural network can outperform traditional methods. The technique permits the highlighting the functional form of those parameters which have an influence on the biological activity.