Gojiberries (Page 6)

Sign in Subscribe

More issues

American PII: Lapses in Securing Confidential Data

At least 83% of Americans have had their confidential data shared with a company breached (see here and here). The list of most frequently implicated companies in the loss of confidential data makes for sobering reading. Reputable companies like LinkedIn (Microsoft), Adobe, Dropbox, etc., are among the top 20 worst

Not Recommended: Why Current Content Recommendation Systems Fail Us

Recommendation systems paint a wonderful picture: The system automatically gets to know you and caters to your preferences. And that is indeed what happens, except that the picture is warped. Warping happens for three reasons: 1. Humans want more than immediate gratification. The systems, however, are designed to learn from

Not Normal, Optimal

Reports of blood work generally include guides for normal ranges. For instance, for LDL-C, in the US, a score of < 100 (mg/DL) is considered normal. But neither the reports nor doctors have much to say about what LDL-C level to aspire for. The same holds true for things

Smallest Loss That Compute Can Buy

With Chris Alexiuk and Atul Dhingra The most expensive portion of model training today is GPU time. Given that, it is useful to ask what is the best way to spend the compute budget. More formally, the optimization problem is: minimize test loss given a FLOPs budget. To achieve the

Why Are the Prices the Same?

Many times within a narrow product category like breakfast cereals, ice cream tubs, etc., the prices of different varieties within a brand are the same. The same pattern continues in many ice cream stores where you are charged for the quantity instead of the flavor or the vessel in which

Cracking the Code: Addressing Some of the Challenges in Research Software

Macro Concerns 1. Lack of Incentives for Producing High-Quality Software. Software’s role in enabling and accelerating research cannot be overstated. But the incentives for producing software in academia are still very thin. One reason is that people do not cite the software they use; the academic currency is still

When Is Discrimination Profit-Maximizing?

Consider the following scenario: There are multiple firms looking to fill identical jobs. And there are multiple eligible workers given each job opening. Both the company and the workers have perfect information, which they are able toacquire without cost. Assume also that employees can switch jobs without cost. Under these

Generative AI and the Market for Creators

Many widely used machine-learning models rely on copyrighted data. For instance, Google finds the most relevant web pages for a search term by relying on a machine learning model trained on copyrighted web data. But the use of copyrighted data by machine learning models that generate content (or give answers